Dual Control with Active Learning using Gaussian Process Regression

Mathematics – Optimization and Control

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

In many real world problems, control decisions have to be made with limited information. The controller may have no a priori (or even posteriori) data on the nonlinear system, except from a limited number of points that are obtained over time. This is either due to high cost of observation or the highly non-stationary nature of the system. The resulting conflict between information collection (identification, exploration) and control (optimization, exploitation) necessitates an active learning approach for iteratively selecting the control actions which concurrently provide the data points for system identification. This paper presents a dual control approach where the information acquired at each control step is quantified using the entropy measure from information theory and serves as the training input to a state-of-the-art Gaussian process regression (Bayesian learning) method. The explicit quantification of the information obtained from each data point allows for iterative optimization of both identification and control objectives. The approach developed is illustrated with two examples: control of logistic map as a chaotic system and position control of a cart with inverted pendulum.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Dual Control with Active Learning using Gaussian Process Regression does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Dual Control with Active Learning using Gaussian Process Regression, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Dual Control with Active Learning using Gaussian Process Regression will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-611750

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.