Training Reinforcement Neurocontrollers Using the Polytope Algorithm

Computer Science – Neural and Evolutionary Computing

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorithm to adjust the weights of the action network so that a simple direct measure of the training performance is maximized. Experimental results from the application of the method to the pole balancing problem indicate improved training performance compared with critic-based and genetic reinforcement approaches.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Training Reinforcement Neurocontrollers Using the Polytope Algorithm does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Training Reinforcement Neurocontrollers Using the Polytope Algorithm, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training Reinforcement Neurocontrollers Using the Polytope Algorithm will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-208471

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.