Information theoretic model validation for clustering

Computer Science – Information Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

9 pages, 2 figures, International Symposium on Information Theory 2010 (ISIT10 E-Mo-4.2), June 13-18 in Austin, TX}

Scientific paper

Model selection in clustering requires (i) to specify a suitable clustering principle and (ii) to control the model order complexity by choosing an appropriate number of clusters depending on the noise level in the data. We advocate an information theoretic perspective where the uncertainty in the measurements quantizes the set of data partitionings and, thereby, induces uncertainty in the solution space of clusterings. A clustering model, which can tolerate a higher level of fluctuations in the measurements than alternative models, is considered to be superior provided that the clustering solution is equally informative. This tradeoff between \emph{informativeness} and \emph{robustness} is used as a model selection criterion. The requirement that data partitionings should generalize from one data set to an equally probable second data set gives rise to a new notion of structure induced information.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Information theoretic model validation for clustering does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Information theoretic model validation for clustering, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information theoretic model validation for clustering will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-512831

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.