A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(lambda), LSTD(lambda), iLSTD, residual-gradient TD. It is asserted that they all consist in minimizing a gradient function and differ by the form of this function and their means of minimizing it. Two new schemes are introduced in that framework: Full-gradient TD which uses a generalization of the principle introduced in iLSTD, and EGD TD, which reduces the gradient by successive equi-gradient descents. These three algorithms form a new intermediate family with the interesting property of making much better use of the samples than TD while keeping a gradient descent scheme, which is useful for complexity issues and optimistic policy iteration.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-372579

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.