Statistics – Machine Learning
Scientific paper
2012-01-08
Statistics
Machine Learning
Scientific paper
The hierarchical Dirichlet process (HDP) has become an important Bayesian nonparametric model for grouped data, such as document collections. The HDP is used to construct a flexible mixed-membership model where the number of components is determined by the data. As for most Bayesian nonparametric models, exact posterior inference is intractable---practitioners use Markov chain Monte Carlo (MCMC) or variational inference. Inspired by the split-merge MCMC algorithm for the Dirichlet process (DP) mixture model, we describe a novel split-merge MCMC sampling algorithm for posterior inference in the HDP. We study its properties on both synthetic data and text corpora. We find that split-merge MCMC for the HDP can provide significant improvements over traditional Gibbs sampling, and we give some understanding of the data properties that give rise to larger improvements.
Blei David M.
Wang Chong
No associations
LandOfFree
A Split-Merge MCMC Algorithm for the Hierarchical Dirichlet Process does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Split-Merge MCMC Algorithm for the Hierarchical Dirichlet Process, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Split-Merge MCMC Algorithm for the Hierarchical Dirichlet Process will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-641640