Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control

Computer Science – Robotics

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control

: 2011-08-23
: arxiv.org/abs/1108.4698v2
: Computer Science
: Robotics

: Technical report accompanying an accepted paper to CDC 2011
: Scientific paper
: We consider the problem of finding a control policy for a Markov Decision Process (MDP) to maximize the probability of reaching some states while avoiding some other states. This problem is motivated by applications in robotics, where such problems naturally arise when probabilistic models of robot motion are required to satisfy temporal logic task specifications. We transform this problem into a Stochastic Shortest Path (SSP) problem and develop a new approximate dynamic programming algorithm to solve it. This algorithm is of the actor-critic type and uses a least-square temporal difference learning method. It operates on sample paths of the system and optimizes the policy within a pre-specified class parameterized by a parsimonious set of parameters. We show its convergence to a policy corresponding to a stationary point in the parameters' space. Simulation results confirm the effectiveness of the proposed solution.

Affiliated with

Belta Calin A.

Computer Science – Robotics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Ding Xu Chu

Computer Science – Robotics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Estanjini Reza Moazzez

Computer Science – Robotics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Lahijanian Morteza

Computer Science – Robotics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Paschalidis Ioannis Ch.

Computer Science – Robotics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-376905

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure