MDPs with Unawareness

Computer Science – Artificial Intelligence

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

11 pages

Scientific paper

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision maker (DM) knows all states and actions. However, this may not be true in many situations of interest. We define a new framework, MDPs with unawareness (MDPUs) to deal with the possibilities that a DM may not be aware of all possible actions. We provide a complete characterization of when a DM can learn to play near-optimally in an MDPU, and give an algorithm that learns to play near-optimally when it is possible to do so, as efficiently as possible. In particular, we characterize when a near-optimal solution can be found in polynomial time.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

MDPs with Unawareness does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with MDPs with Unawareness, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and MDPs with Unawareness will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-76967

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.