Online Learning: Stochastic and Constrained Adversaries

Statistics – Machine Learning

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Online Learning: Stochastic and Constrained Adversaries Online Learning: Stochastic and Constrained Adversaries

: 2011-04-27
: arxiv.org/abs/1104.5070v1
: Statistics
: Machine Learning

: Scientific paper
: Learning theory has largely focused on two main learning scenarios. The first is the classical statistical setting where instances are drawn i.i.d. from a fixed distribution and the second scenario is the online learning, completely adversarial scenario where adversary at every time step picks the worst instance to provide the learner with. It can be argued that in the real world neither of these assumptions are reasonable. It is therefore important to study problems with a range of assumptions on data. Unfortunately, theoretical results in this area are scarce, possibly due to absence of general tools for analysis. Focusing on the regret formulation, we define the minimax value of a game where the adversary is restricted in his moves. The framework captures stochastic and non-stochastic assumptions on data. Building on the sequential symmetrization approach, we define a notion of distribution-dependent Rademacher complexity for the spectrum of problems ranging from i.i.d. to worst-case. The bounds let us immediately deduce variation-type bounds. We then consider the i.i.d. adversary and show equivalence of online and batch learnability. In the supervised setting, we consider various hybrid assumptions on the way that x and y variables are chosen. Finally, we consider smoothed learning problems and show that half-spaces are online learnable in the smoothed model. In fact, exponentially small noise added to adversary's decisions turns this problem with infinite Littlestone's dimension into a learnable problem.

Affiliated with

Rakhlin Alexander

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Sridharan Karthik

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Tewari Ambuj

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Online Learning: Stochastic and Constrained Adversaries does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Online Learning: Stochastic and Constrained Adversaries, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Online Learning: Stochastic and Constrained Adversaries will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-475062

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure