Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

Computer Science – Artificial Intelligence

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal reward of such a Markov Decision Process, satisfying a Bellman equation, converges to the solution of a continuous Hamilton-Jacobi-Bellman (HJB) equation based on the mean field approximation of the Markov Decision Process. We give bounds on the difference of the rewards, and a constructive algorithm for deriving an approximating solution to the Markov Decision Process from a solution of the HJB equations. We illustrate the method on three examples pertaining respectively to investment strategies, population dynamics control and scheduling in queues are developed. They are used to illustrate and justify the construction of the controlled ODE and to show the gain obtained by solving a continuous HJB equation rather than a large discrete Bellman equation.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Mean field for Markov Decision Processes: from Discrete to Continuous Optimization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Mean field for Markov Decision Processes: from Discrete to Continuous Optimization will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-525654

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.