Estimating the joint distribution of independent categorical variables via model selection

Mathematics – Statistics Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Published in at http://dx.doi.org/10.3150/08-BEJ155 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statisti

Scientific paper

10.3150/08-BEJ155

Assume one observes independent categorical variables or, equivalently, one observes the corresponding multinomial variables. Estimating the distribution of the observed sequence amounts to estimating the expectation of the multinomial sequence. A new estimator for this mean is proposed that is nonparametric, non-asymptotic and implementable even for large sequences. It is a penalized least-squares estimator based on wavelets, with a penalization term inspired by papers of Birg\'{e} and Massart. The estimator is proved to satisfy an oracle inequality and to be adaptive in the minimax sense over a class of Besov bodies. The method is embedded in a general framework which allows us to recover also an existing method for segmentation. Beyond theoretical results, a simulation study is reported and an application on real data is provided.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Estimating the joint distribution of independent categorical variables via model selection does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Estimating the joint distribution of independent categorical variables via model selection, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Estimating the joint distribution of independent categorical variables via model selection will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-135009

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.