Generalized Error Exponents for Sparse Sample Goodness of Fit Tests

Mathematics – Statistics Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

47 pages, 5 figure, submitted to The Annals of Statistics

Scientific paper

We investigate the sparse sample goodness-of-fit problem, where the number of samples $n$ is smaller than the size of the alphabet $m$. The goal of this work is to find an appropriate criterion to analyze statistical tests in this setting. A suitable model for analysis is the high-dimensional model in which both $n$ and $m$ tend to infinity, and $n=o(m)$. We propose a new performance criterion based on large deviation analysis, which generalizes the classical error exponent applicable for large sample problems (in which $m=O(n)$). This new criterion provides insights that are not available from asymptotic consistency or CLT analysis. The main results are: (i) The best achievable probability of error $P_e$ decays as $-\log(P_e)=(n^2/m)(1+o(1))J$ for some $J>0$. (ii) A well-known coincidence-based test attains the optimal generalized error exponent. (iii) The widely used Pearson's chi-square test has J=0. (iv) The contributions (i)-(iii) are established under the assumption that the distribution under the null hypothesis is uniform. For the non-uniform case, a new test is proposed, with a non-zero generalized error exponent.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Generalized Error Exponents for Sparse Sample Goodness of Fit Tests does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Generalized Error Exponents for Sparse Sample Goodness of Fit Tests, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Generalized Error Exponents for Sparse Sample Goodness of Fit Tests will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-36876

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.