Multi-Armed Bandits in Metric Spaces

Computer Science – Data Structures and Algorithms

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

16 pages, 0 figures

Scientific paper

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of trials so as to maximize the total payoff of the chosen strategies. While the performance of bandit algorithms with a small finite strategy set is quite well understood, bandit problems with large strategy sets are still a topic of very active investigation, motivated by practical applications such as online auctions and web advertisement. The goal of such research is to identify broad and natural classes of strategy sets and payoff functions which enable the design of efficient solutions. In this work we study a very general setting for the multi-armed bandit problem in which the strategies form a metric space, and the payoff function satisfies a Lipschitz condition with respect to the metric. We refer to this problem as the "Lipschitz MAB problem". We present a complete solution for the multi-armed problem in this setting. That is, for every metric space (L,X) we define an isometry invariant which bounds from below the performance of Lipschitz MAB algorithms for X, and we present an algorithm which comes arbitrarily close to meeting this bound. Furthermore, our technique gives even better results for benign payoff functions.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Multi-Armed Bandits in Metric Spaces does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Multi-Armed Bandits in Metric Spaces, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multi-Armed Bandits in Metric Spaces will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-48746

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.