Intrinsic dimension estimation of data by principal component analysis

Computer Science – Computer Vision and Pattern Recognition

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

8 pages, submitted for publication

Scientific paper

Estimating intrinsic dimensionality of data is a classic problem in pattern recognition and statistics. Principal Component Analysis (PCA) is a powerful tool in discovering dimensionality of data sets with a linear structure; it, however, becomes ineffective when data have a nonlinear structure. In this paper, we propose a new PCA-based method to estimate intrinsic dimension of data with nonlinear structures. Our method works by first finding a minimal cover of the data set, then performing PCA locally on each subset in the cover and finally giving the estimation result by checking up the data variance on all small neighborhood regions. The proposed method utilizes the whole data set to estimate its intrinsic dimension and is convenient for incremental learning. In addition, our new PCA procedure can filter out noise in data and converge to a stable estimation with the neighborhood region size increasing. Experiments on synthetic and real world data sets show effectiveness of the proposed method.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Intrinsic dimension estimation of data by principal component analysis does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Intrinsic dimension estimation of data by principal component analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Intrinsic dimension estimation of data by principal component analysis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-238551

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.