Efficient Inference in Markov Control Problems

Computer Science – Systems and Control

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

Markov control algorithms that perform smooth, non-greedy updates of the policy have been shown to be very general and versatile, with policy gradient and Expectation Maximisation algorithms being particularly popular. For these algorithms, marginal inference of the reward weighted trajectory distribution is required to perform policy updates. We discuss a new exact inference algorithm for these marginals in the finite horizon case that is more efficient than the standard approach based on classical forward-backward recursions. We also provide a principled extension to infinite horizon Markov Decision Problems that explicitly accounts for an infinite horizon. This extension provides a novel algorithm for both policy gradients and Expectation Maximisation in infinite horizon problems.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Efficient Inference in Markov Control Problems does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Efficient Inference in Markov Control Problems, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient Inference in Markov Control Problems will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-90436

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.