Treelets--An adaptive multi-scale basis for sparse unordered data

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

This paper commented in: [arXiv:0807.4011], [arXiv:0807.4016], [arXiv:0807.4018], [arXiv:0807.4019], [arXiv:0807.4023], [arXiv

Scientific paper

10.1214/07-AOAS137

In many modern applications, including analysis of gene expression and text documents, the data are noisy, high-dimensional, and unordered--with no particular meaning to the given order of the variables. Yet, successful learning is often possible due to sparsity: the fact that the data are typically redundant with underlying structures that can be represented by only a few features. In this paper we present treelets--a novel construction of multi-scale bases that extends wavelets to nonsmooth signals. The method is fully adaptive, as it returns a hierarchical tree and an orthonormal basis which both reflect the internal structure of the data. Treelets are especially well-suited as a dimensionality reduction and feature selection tool prior to regression and classification, in situations where sample sizes are small and the data are sparse with unknown groupings of correlated or collinear variables. The method is also simple to implement and analyze theoretically. Here we describe a variety of situations where treelets perform better than principal component analysis, as well as some common variable selection and cluster averaging schemes. We illustrate treelets on a blocked covariance model and on several data sets (hyperspectral image data, DNA microarray data, and internet advertisements) with highly complex dependencies between variables.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Treelets--An adaptive multi-scale basis for sparse unordered data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Treelets--An adaptive multi-scale basis for sparse unordered data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Treelets--An adaptive multi-scale basis for sparse unordered data will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-180074

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.