Computer Science – Learning
Scientific paper
2009-03-19
Computer Science
Learning
Scientific paper
This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.
Bréhard Thomas
Coquelin Pierre-Arnaud
Duflos Emmanuel
Vanheeghe Philippe
No associations
LandOfFree
Optimal Policies Search for Sensor Management does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Optimal Policies Search for Sensor Management, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Optimal Policies Search for Sensor Management will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-648689