Resampling methods for parameter-free and robust feature selection with mutual information

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

10.1016/j.neucom.2006.11.019

Combining the mutual information criterion with a forward feature selection strategy offers a good trade-off between optimality of the selected feature subset and computation time. However, it requires to set the parameter(s) of the mutual information estimator and to determine when to halt the forward procedure. These two choices are difficult to make because, as the dimensionality of the subset increases, the estimation of the mutual information becomes less and less reliable. This paper proposes to use resampling methods, a K-fold cross-validation and the permutation test, to address both issues. The resampling methods bring information about the variance of the estimator, information which can then be used to automatically set the parameter and to calculate a threshold to stop the forward procedure. The procedure is illustrated on a synthetic dataset as well as on real-world examples.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Resampling methods for parameter-free and robust feature selection with mutual information does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Resampling methods for parameter-free and robust feature selection with mutual information, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Resampling methods for parameter-free and robust feature selection with mutual information will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-651662

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.