Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

Computer Science – Learning

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

: 2010-03-31
: arxiv.org/abs/1003.5956v2
: Computer Science
: Learning

: 10 pages, 7 figures, revised from the published version at the WSDM 2011 conference
: Scientific paper
: 10.1145/1935826.1935878
: Contextual bandit algorithms have become popular for online recommendation systems such as Digg, Yahoo! Buzz, and news recommendation in general. \emph{Offline} evaluation of the effectiveness of new algorithms in these applications is critical for protecting online user experiences but very challenging due to their "partial-label" nature. Common practice is to create a simulator which simulates the online environment for the problem at hand and then run an algorithm against this simulator. However, creating simulator itself is often difficult and modeling bias is usually unavoidably introduced. In this paper, we introduce a \emph{replay} methodology for contextual bandit algorithm evaluation. Different from simulator-based approaches, our method is completely data-driven and very easy to adapt to different applications. More importantly, our method can provide provably unbiased evaluations. Our empirical results on a large-scale news article recommendation dataset collected from Yahoo! Front Page conform well with our theoretical results. Furthermore, comparisons between our offline replay and online bucket evaluation of several contextual bandit algorithms show accuracy and effectiveness of our offline evaluation method.

Affiliated with

Chu Wei

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Langford J. J.

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Li Lihong

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Wang Xuanhui

Computer Science – Information Retrieval

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-579386

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure