Statistics – Applications
Scientific paper
2009-03-11
Annals of Applied Statistics 2010, Vol. 4, No. 1, 503-519
Statistics
Applications
Published in at http://dx.doi.org/10.1214/09-AOAS277 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Ins
Scientific paper
10.1214/09-AOAS277
We revisit the problem of feature selection in linear discriminant analysis (LDA), that is, when features are correlated. First, we introduce a pooled centroids formulation of the multiclass LDA predictor function, in which the relative weights of Mahalanobis-transformed predictors are given by correlation-adjusted $t$-scores (cat scores). Second, for feature selection we propose thresholding cat scores by controlling false nondiscovery rates (FNDR). Third, training of the classifier is based on James--Stein shrinkage estimates of correlations and variances, where regularization parameters are chosen analytically without resampling. Overall, this results in an effective and computationally inexpensive framework for high-dimensional prediction with natural feature selection. The proposed shrinkage discriminant procedures are implemented in the R package ``sda'' available from the R repository CRAN.
Ahdesmäki Miika
Strimmer Korbinian
No associations
LandOfFree
Feature selection in omics prediction problems using cat scores and false nondiscovery rate control does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Feature selection in omics prediction problems using cat scores and false nondiscovery rate control, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Feature selection in omics prediction problems using cat scores and false nondiscovery rate control will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-124198