Sparse linear discriminant analysis by thresholding for high dimensional data

Mathematics – Statistics Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Published in at http://dx.doi.org/10.1214/10-AOS870 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of

Scientific paper

10.1214/10-AOS870

In many social, economical, biological and medical studies, one objective is to classify a subject into one of several classes based on a set of variables observed from the subject. Because the probability distribution of the variables is usually unknown, the rule of classification is constructed using a training sample. The well-known linear discriminant analysis (LDA) works well for the situation where the number of variables used for classification is much smaller than the training sample size. Because of the advance in technologies, modern statistical studies often face classification problems with the number of variables much larger than the sample size, and the LDA may perform poorly. We explore when and why the LDA has poor performance and propose a sparse LDA that is asymptotically optimal under some sparsity conditions on the unknown parameters. For illustration of application, we discuss an example of classifying human cancer into two classes of leukemia based on a set of 7,129 genes and a training sample of size 72. A simulation is also conducted to check the performance of the proposed method.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Sparse linear discriminant analysis by thresholding for high dimensional data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Sparse linear discriminant analysis by thresholding for high dimensional data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Sparse linear discriminant analysis by thresholding for high dimensional data will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-53708

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.