Restricted Value Iteration: Theory and Algorithms

Computer Science – Artificial Intelligence

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

10.1613/jair.1379

Value iteration is a popular algorithm for finding near optimal policies for POMDPs. It is inefficient due to the need to account for the entire belief space, which necessitates the solution of large numbers of linear programs. In this paper, we study value iteration restricted to belief subsets. We show that, together with properly chosen belief subsets, restricted value iteration yields near-optimal policies and we give a condition for determining whether a given belief subset would bring about savings in space and time. We also apply restricted value iteration to two interesting classes of POMDPs, namely informative POMDPs and near-discernible POMDPs.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Restricted Value Iteration: Theory and Algorithms does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Restricted Value Iteration: Theory and Algorithms, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Restricted Value Iteration: Theory and Algorithms will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-480003

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.