Biology – Quantitative Biology – Quantitative Methods
Scientific paper
2005-11-26
Biology
Quantitative Biology
Quantitative Methods
To appear in Proceedings of the National Academy of Sciences USA, 11 pages, 9 figures
Scientific paper
10.1073/pnas.0507432102
In an age of increasingly large data sets, investigators in many different disciplines have turned to clustering as a tool for data analysis and exploration. Existing clustering methods, however, typically depend on several nontrivial assumptions about the structure of data. Here we reformulate the clustering problem from an information theoretic perspective which avoids many of these assumptions. In particular, our formulation obviates the need for defining a cluster "prototype", does not require an a priori similarity metric, is invariant to changes in the representation of the data, and naturally captures non-linear relations. We apply this approach to different domains and find that it consistently produces clusters that are more coherent than those extracted by existing algorithms. Finally, our approach provides a way of clustering based on collective notions of similarity rather than the traditional pairwise measures.
Atwal Gurinder Singh
Bialek William
Slonim Noam
Tkacik Gasper
No associations
LandOfFree
Information based clustering does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Information based clustering, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information based clustering will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-365381