Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy

Other

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

Modern classification and clustering techniques analyze collections of objects that are described by a set of useful features or parameters. Clustering methods group the objects in that feature space to identify distinct, well separated subsets of the data set.
However, real observational data may contain missing values for some features. A ``shape'' feature may not be well defined for objects close to the detection limit, and objects of extreme color may be unobservable at some wavelengths.
The usual methods for handling data with missing values, such as imputation (estimating the missing values) or marginalization (deleting all objects with missing values), rely on the assumption that missing values occur by random chance. While this is a reasonable assumption in other disciplines, the fact that a value is missing in an astronomical catalog may be physically meaningful.
We demonstrate a clustering analysis algorithm, KSC, that a) does not impute values and b) does not discard the partially observed objects. KSC uses soft constraints defined by the fully observed objects to assist in the grouping of objects with missing values. We present an analysis of objects taken from the Sloan Digital Sky Survey to demonstrate how imputing the values can be misleading and why the KSC approach can produce more appropriate results.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-1034515

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.