Testing Closeness of Discrete Distributions

Computer Science – Data Structures and Algorithms

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

26 pages, A preliminary version of this paper appeared in the 41st Symposium on Foundations of Computer Science, 2000, Redondo

Scientific paper

Given samples from two distributions over an $n$-element set, we wish to test whether these distributions are statistically close. We present an algorithm which uses sublinear in $n$, specifically, $O(n^{2/3}\epsilon^{-8/3}\log n)$, independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases when the distance between the distributions is small (less than $\max\{\epsilon^{4/3}n^{-1/3}/32, \epsilon n^{-1/2}/4\}$) or large (more than $\epsilon$) in $\ell_1$ distance. This result can be compared to the lower bound of $\Omega(n^{2/3}\epsilon^{-2/3})$ for this problem given by Valiant. Our algorithm has applications to the problem of testing whether a given Markov process is rapidly mixing. We present sublinear for several variants of this problem as well.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Testing Closeness of Discrete Distributions does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Testing Closeness of Discrete Distributions, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Testing Closeness of Discrete Distributions will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-692585

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.