Computer Science – Neural and Evolutionary Computing
Scientific paper
1998-12-03
Computer Science
Neural and Evolutionary Computing
Scientific paper
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorithm to adjust the weights of the action network so that a simple direct measure of the training performance is maximized. Experimental results from the application of the method to the pole balancing problem indicate improved training performance compared with critic-based and genetic reinforcement approaches.
Lagaris I. E.
Likas A.
No associations
LandOfFree
Training Reinforcement Neurocontrollers Using the Polytope Algorithm does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Training Reinforcement Neurocontrollers Using the Polytope Algorithm, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training Reinforcement Neurocontrollers Using the Polytope Algorithm will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-208471