Computer Science – Computer Vision and Pattern Recognition
Scientific paper
2009-05-20
Computer Science
Computer Vision and Pattern Recognition
LateX 14 pages, Submitted to a technical journal
Scientific paper
Information distance is a parameter-free similarity measure based on compression, used in pattern recognition, data mining, phylogeny, clustering, and classification. The notion of information distance is extended from pairs to multiples (finite lists). We study maximal overlap, metricity, universality, minimal overlap, additivity, and normalized information distance in multiples. We use the theoretical notion of Kolmogorov complexity which for practical purposes is approximated by the length of the compressed version of the file involved, using a real-world compression program. {\em Index Terms}-- Information distance, multiples, pattern recognition, data mining, similarity, Kolmogorov complexity
No associations
LandOfFree
Information Distance in Multiples does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Information Distance in Multiples, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information Distance in Multiples will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-243372