Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser

Statistics – Machine Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

24 pages, 2 figures

Scientific paper

In this article, we derive concentration inequalities for the cross-validation estimate of the generalization error for empirical risk minimizers. In the general setting, we prove sanity-check bounds in the spirit of \cite{KR99} \textquotedblleft\textit{bounds showing that the worst-case error of this estimate is not much worse that of training error estimate} \textquotedblright . General loss functions and class of predictors with finite VC-dimension are considered. We closely follow the formalism introduced by \cite{DUD03} to cover a large variety of cross-validation procedures including leave-one-out cross-validation, $k$% -fold cross-validation, hold-out cross-validation (or split sample), and the leave-$\upsilon$-out cross-validation. In particular, we focus on proving the consistency of the various cross-validation procedures. We point out the interest of each cross-validation procedure in terms of rate of convergence. An estimation curve with transition phases depending on the cross-validation procedure and not only on the percentage of observations in the test sample gives a simple rule on how to choose the cross-validation. An interesting consequence is that the size of the test sample is not required to grow to infinity for the consistency of the cross-validation procedure.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-405406

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.