Modeling Online Reviews with Multi-grain Topic Models

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

In this paper we present a novel framework for extracting the ratable aspects of objects from online user reviews. Extracting such aspects is an important challenge in automatically mining product opinions from the web and in generating opinion-based summaries of user reviews. Our models are based on extensions to standard topic modeling methods such as LDA and PLSA to induce multi-grain topics. We argue that multi-grain models are more appropriate for our task since standard models tend to produce topics that correspond to global properties of objects (e.g., the brand of a product type) rather than the aspects of an object that tend to be rated by a user. The models we present not only extract ratable aspects, but also cluster them into coherent topics, e.g., `waitress' and `bartender' are part of the same topic `staff' for restaurants. This differentiates it from much of the previous work which extracts aspects through term frequency analysis with minimal clustering. We evaluate the multi-grain models both qualitatively and quantitatively to show that they improve significantly upon standard topic models.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Modeling Online Reviews with Multi-grain Topic Models does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Modeling Online Reviews with Multi-grain Topic Models, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Modeling Online Reviews with Multi-grain Topic Models will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-654867

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.