Mathematics – Statistics Theory
Scientific paper
2009-11-19
Annals of Statistics 2009, Vol. 37, No. 6B, 4104-4130
Mathematics
Statistics Theory
Published in at http://dx.doi.org/10.1214/09-AOS709 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of
Scientific paper
10.1214/09-AOS709
Principal Component Analysis (PCA) is an important tool of dimension reduction especially when the dimension (or the number of variables) is very high. Asymptotic studies where the sample size is fixed, and the dimension grows [i.e., High Dimension, Low Sample Size (HDLSS)] are becoming increasingly relevant. We investigate the asymptotic behavior of the Principal Component (PC) directions. HDLSS asymptotics are used to study consistency, strong inconsistency and subspace consistency. We show that if the first few eigenvalues of a population covariance matrix are large enough compared to the others, then the corresponding estimated PC directions are consistent or converge to the appropriate subspace (subspace consistency) and most other PC directions are strongly inconsistent. Broad sets of sufficient conditions for each of these cases are specified and the main theorem gives a catalogue of possible combinations. In preparation for these results, we show that the geometric representation of HDLSS data holds under general conditions, which includes a $\rho$-mixing condition and a broad range of sphericity measures of the covariance matrix.
Jung Sungkyu
Marron Stephen J.
No associations
LandOfFree
PCA consistency in high dimension, low sample size context does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with PCA consistency in high dimension, low sample size context, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and PCA consistency in high dimension, low sample size context will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-453529