A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

Mathematics – Statistics Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We consider a Kullback-Leibler-based algorithm for the stochastic multi-armed bandit problem in the case of distributions with finite supports (not necessarily known beforehand), whose asymptotic regret matches the lower bound of \cite{Burnetas96}. Our contribution is to provide a finite-time analysis of this algorithm; we get bounds whose main terms are smaller than the ones of previously known algorithms with finite-time analyses (like UCB-type algorithms).

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-290085

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.