Computer Science – Databases
Scientific paper
2012-03-29
Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 7, pp. 610-621 (2012)
Computer Science
Databases
VLDB2012
Scientific paper
Clustering uncertain data has emerged as a challenging task in uncertain data management and mining. Thanks to a computational complexity advantage over other clustering paradigms, partitional clustering has been particularly studied and a number of algorithms have been developed. While existing proposals differ mainly in the notions of cluster centroid and clustering objective function, little attention has been given to an analysis of their characteristics and limits. In this work, we theoretically investigate major existing methods of partitional clustering, and alternatively propose a well-founded approach to clustering uncertain data based on a novel notion of cluster centroid. A cluster centroid is seen as an uncertain object defined in terms of a random variable whose realizations are derived based on all deterministic representations of the objects to be clustered. As demonstrated theoretically and experimentally, this allows for better representing a cluster of uncertain objects, thus supporting a consistently improved clustering performance while maintaining comparable efficiency with existing partitional clustering algorithms.
Gullo Francesco
Tagarelli Andrea
No associations
LandOfFree
Uncertain Centroid based Partitional Clustering of Uncertain Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Uncertain Centroid based Partitional Clustering of Uncertain Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Uncertain Centroid based Partitional Clustering of Uncertain Data will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-56946