Accelerating Reinforcement Learning through Implicit Imitation

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

10.1613/jair.898

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent's ability to learn useful behaviors by making intelligent use of the knowledge implicit in behaviors demonstrated by cooperative teachers or other more experienced agents. We propose and study a formal model of implicit imitation that can accelerate reinforcement learning dramatically in certain cases. Roughly, by observing a mentor, a reinforcement-learning agent can extract information about its own capabilities in, and the relative value of, unvisited parts of the state space. We study two specific instantiations of this model, one in which the learning agent and the mentor have identical abilities, and one designed to deal with agents and mentors with different action sets. We illustrate the benefits of implicit imitation by integrating it with prioritized sweeping, and demonstrating improved performance and convergence through observation of single and multiple mentors. Though we make some stringent assumptions regarding observability and possible interactions, we briefly comment on extensions of the model that relax these restricitions.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Accelerating Reinforcement Learning through Implicit Imitation does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Accelerating Reinforcement Learning through Implicit Imitation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Accelerating Reinforcement Learning through Implicit Imitation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-224079

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.