Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

20 pages, 3 figures

Scientific paper

10.1093/biomet/asq008

We consider variable selection in high-dimensional linear models where the number of covariates greatly exceeds the sample size. We introduce the new concept of partial faithfulness and use it to infer associations between the covariates and the response. Under partial faithfulness, we develop a simplified version of the PC algorithm (Spirtes et al., 2000), the PC-simple algorithm, which is computationally feasible even with thousands of covariates and provides consistent variable selection under conditions on the random design matrix that are of a different nature than coherence conditions for penalty-based approaches like the Lasso. Simulations and application to real data show that our method is competitive compared to penalty-based approaches. We provide an efficient implementation of the algorithm in the R-package pcalg.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-223416

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.