Closing the Learning-Planning Loop with Predictive State Representations

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

A central problem in artificial intelligence is that of planning to maximize future reward under uncertainty in a partially observable environment. In this paper we propose and demonstrate a novel algorithm which accurately learns a model of such an environment directly from sequences of action-observation pairs. We then close the loop from observations to actions by planning in the learned model and recovering a policy which is near-optimal in the original environment. Specifically, we present an efficient and statistically consistent spectral algorithm for learning the parameters of a Predictive State Representation (PSR). We demonstrate the algorithm by learning a model of a simulated high-dimensional, vision-based mobile robot planning task, and then perform approximate point-based planning in the learned PSR. Analysis of our results shows that the algorithm learns a state space which efficiently captures the essential features of the environment. This representation allows accurate prediction with a small number of parameters, and enables successful and efficient planning.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Closing the Learning-Planning Loop with Predictive State Representations does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Closing the Learning-Planning Loop with Predictive State Representations, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Closing the Learning-Planning Loop with Predictive State Representations will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-532907

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.