A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior

Computer Science – Learning

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior

: 2009-07-04
: arxiv.org/abs/0907.0808v1
: Computer Science
: Learning

: Scientific paper
: We develop a Bayesian framework for tackling the supervised clustering problem, the generic problem encountered in tasks such as reference matching, coreference resolution, identity uncertainty and record linkage. Our clustering model is based on the Dirichlet process prior, which enables us to define distributions over the countably infinite sets that naturally arise in this problem. We add supervision to our model by positing the existence of a set of unobserved random variables (we call these "reference types") that are generic across all clusters. Inference in our framework, which requires integrating over infinitely many parameters, is solved using Markov chain Monte Carlo techniques. We present algorithms for both conjugate and non-conjugate priors. We present a simple--but general--parameterization of our model based on a Gaussian assumption. We evaluate this model on one artificial task and three real-world tasks, comparing it against both unsupervised and state-of-the-art supervised algorithms. Our results show that our model is able to outperform other models across a variety of tasks and performance metrics.

Affiliated with

III Hal Daume

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Marcu Daniel

Computer Science – Learning

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-187258

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure