Indirect Cross-validation for Density Estimation

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

26 pages, 10 figures

Scientific paper

A new method of bandwidth selection for kernel density estimators is proposed. The method, termed indirect cross-validation, or ICV, makes use of so-called selection kernels. Least squares cross-validation (LSCV) is used to select the bandwidth of a selection-kernel estimator, and this bandwidth is appropriately rescaled for use in a Gaussian kernel estimator. The proposed selection kernels are linear combinations of two Gaussian kernels, and need not be unimodal or positive. Theory is developed showing that the relative error of ICV bandwidths can converge to 0 at a rate of $n^{-1/4}$, which is substantially better than the $n^{-1/10}$ rate of LSCV. Interestingly, the selection kernels that are best for purposes of bandwidth selection are very poor if used to actually estimate the density function. This property appears to be part of the larger and well-documented paradox to the effect that "the harder the estimation problem, the better cross-validation performs." The ICV method uniformly outperforms LSCV in a simulation study, a real data example, and a simulated example in which bandwidths are chosen locally.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Indirect Cross-validation for Density Estimation does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Indirect Cross-validation for Density Estimation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Indirect Cross-validation for Density Estimation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-139685

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.