Statistics – Machine Learning
Scientific paper
2011-06-13
Statistics
Machine Learning
8 pages, to be presented at UAI 2011
Scientific paper
We introduce the Pitman Yor Diffusion Tree (PYDT) for hierarchical clustering, a generalization of the Dirichlet Diffusion Tree (Neal, 2001) which removes the restriction to binary branching structure. The generative process is described and shown to result in an exchangeable distribution over data points. We prove some theoretical properties of the model and then present two inference methods: a collapsed MCMC sampler which allows us to model uncertainty over tree structures, and a computationally efficient greedy Bayesian EM search algorithm. Both algorithms use message passing on the tree structure. The utility of the model and algorithms is demonstrated on synthetic and real world data, both continuous and binary.
Ghahramani Zoubin
Knowles David A.
No associations
LandOfFree
Pitman-Yor Diffusion Trees does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Pitman-Yor Diffusion Trees, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Pitman-Yor Diffusion Trees will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-429779