BART: Bayesian additive regression trees

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Published in at http://dx.doi.org/10.1214/09-AOAS285 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Ins

Scientific paper

10.1214/09-AOAS285

We develop a Bayesian "sum-of-trees" model where each tree is constrained by a regularization prior to be a weak learner, and fitting and inference are accomplished via an iterative Bayesian backfitting MCMC algorithm that generates samples from a posterior. Effectively, BART is a nonparametric Bayesian regression approach which uses dimensionally adaptive random basis elements. Motivated by ensemble methods in general, and boosting algorithms in particular, BART is defined by a statistical model: a prior and a likelihood. This approach enables full posterior inference including point and interval estimates of the unknown regression function as well as the marginal effects of potential predictors. By keeping track of predictor inclusion frequencies, BART can also be used for model-free variable selection. BART's many features are illustrated with a bake-off against competing methods on 42 different data sets, with a simulation experiment and on a drug discovery classification problem.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

BART: Bayesian additive regression trees does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with BART: Bayesian additive regression trees, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and BART: Bayesian additive regression trees will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-370586

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.