Statistics – Machine Learning
Scientific paper
2009-02-08
Statistics
Machine Learning
26 pages, 6 figures, submitted
Scientific paper
In this paper we propose a computationally efficient algorithm for on-line variable selection in multivariate regression problems involving high dimensional data streams. The algorithm recursively extracts all the latent factors of a partial least squares solution and selects the most important variables for each factor. This is achieved by means of only one sparse singular value decomposition which can be efficiently updated on-line and in an adaptive fashion. Simulation results based on artificial data streams demonstrate that the algorithm is able to select important variables in dynamic settings where the correlation structure among the observed streams is governed by a few hidden components and the importance of each variable changes over time. We also report on an application of our algorithm to a multivariate version of the "enhanced index tracking" problem using financial data streams. The application consists of performing on-line asset allocation with the objective of overperforming two benchmark indices simultaneously.
McWilliams Brian
Montana Giovanni
No associations
LandOfFree
Sparse partial least squares for on-line variable selection in multivariate data streams does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Sparse partial least squares for on-line variable selection in multivariate data streams, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Sparse partial least squares for on-line variable selection in multivariate data streams will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-581221