Mathematics – Statistics Theory
Scientific paper
2007-08-14
Annals of Statistics 2007, Vol. 35, No. 2, 543-574
Mathematics
Statistics Theory
Published at http://dx.doi.org/10.1214/009053606000001415 in the Annals of Statistics (http://www.imstat.org/aos/) by the Inst
Scientific paper
10.1214/009053606000001415
We investigate the problem of finding confidence sets for split points in decision trees (CART). Our main results establish the asymptotic distribution of the least squares estimators and some associated residual sum of squares statistics in a binary decision tree approximation to a smooth regression curve. Cube-root asymptotics with nonnormal limit distributions are involved. We study various confidence sets for the split point, one calibrated using the subsampling bootstrap, and others calibrated using plug-in estimates of some nuisance parameters. The performance of the confidence sets is assessed in a simulation study. A motivation for developing such confidence sets comes from the problem of phosphorus pollution in the Everglades. Ecologists have suggested that split points provide a phosphorus threshold at which biological imbalance occurs, and the lower endpoint of the confidence set may be interpreted as a level that is protective of the ecosystem. This is illustrated using data from a Duke University Wetlands Center phosphorus dosing study in the Everglades.
Banerjee Moulinath
McKeague Ian W.
No associations
LandOfFree
Confidence sets for split points in decision trees does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Confidence sets for split points in decision trees, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Confidence sets for split points in decision trees will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-389082