A Geometric Analysis of Subspace Clustering with Outliers

Computer Science – Information Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

This paper considers the problem of clustering a collection of unlabeled data points assumed to lie near a union of lower dimensional planes. As is common in computer vision or unsupervised learning applications, we do not know in advance how many subspaces there are nor do we have any information about their dimensions. We develop a novel geometric analysis of an algorithm named Sparse Subspace Clustering (SSC [Elhamifar, 09]), which significantly broadens the range of problems where it is provably effective. For instance, we show that SSC can recover multiple subspaces, each of dimension comparable to the ambient dimension. We also prove that SSC can correctly cluster data points even when the subspaces of interest intersect. Further, we develop an extension of SSC that succeeds when the data set is corrupted with possibly overwhelmingly many outliers. Underlying our analysis are clear geometric insights, which may bear on other sparse recovery problems. A numerical study complements our theoretical analysis and demonstrates the effectiveness of these methods.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Geometric Analysis of Subspace Clustering with Outliers does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Geometric Analysis of Subspace Clustering with Outliers, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Geometric Analysis of Subspace Clustering with Outliers will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-210245

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.