Robustness and accuracy of methods for high dimensional data analysis based on Student's t statistic

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

37 pages, 5 figures

Scientific paper

Student's $t$ statistic is finding applications today that were never envisaged when it was introduced more than a century ago. Many of these applications rely on properties, for example robustness against heavy tailed sampling distributions, that were not explicitly considered until relatively recently. In this paper we explore these features of the $t$ statistic in the context of its application to very high dimensional problems, including feature selection and ranking, highly multiple hypothesis testing, and sparse, high dimensional signal detection. Robustness properties of the $t$-ratio are highlighted, and it is established that those properties are preserved under applications of the bootstrap. In particular, bootstrap methods correct for skewness, and therefore lead to second-order accuracy, even in the extreme tails. Indeed, it is shown that the bootstrap, and also the more popular but less accurate $t$-distribution and normal approximations, are more effective in the tails than towards the middle of the distribution. These properties motivate new methods, for example bootstrap-based techniques for signal detection, that confine attention to the significant tail of a statistic.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Robustness and accuracy of methods for high dimensional data analysis based on Student's t statistic does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Robustness and accuracy of methods for high dimensional data analysis based on Student's t statistic, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Robustness and accuracy of methods for high dimensional data analysis based on Student's t statistic will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-655002

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.