Statistics – Machine Learning
Scientific paper
2010-03-03
Statistics
Machine Learning
Scientific paper
We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. The model accommodates a variety of response types. We derive an approximate maximum-likelihood procedure for parameter estimation, which relies on variational methods to handle intractable posterior expectations. Prediction problems motivate this research: we use the fitted model to predict response values for new documents. We test sLDA on two real-world problems: movie ratings predicted from reviews, and the political tone of amendments in the U.S. Senate based on the amendment text. We illustrate the benefits of sLDA versus modern regularized regression, as well as versus an unsupervised LDA analysis followed by a separate regression.
Blei David M.
McAuliffe Jon D.
No associations
LandOfFree
Supervised Topic Models does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Supervised Topic Models, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Supervised Topic Models will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-684487