Training Restricted Boltzmann Machines on Word Observations

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

The restricted Boltzmann machine (RBM) is a flexible tool for modeling complex data, however there have been significant computational difficulties in using RBMs to model high-dimensional multinomial observations. In natural language processing applications, words are naturally modeled by K-ary discrete distributions, where K is determined by the vocabulary size and can easily be in the hundred thousands. The conventional approach to training RBMs on word observations is limited because it requires sampling the states of K-way softmax visible units during block Gibbs updates, an operation that takes time linear in K. In this work, we address this issue by employing a more general class of Markov chain Monte Carlo operators on the visible units, yielding updates with computational complexity independent of K. We demonstrate the success of our approach by training RBMs on hundreds of millions of word n-grams using larger vocabularies than previously feasible with RBMs and using the learned features to improve performance on chunking and sentiment classification tasks, achieving state-of-the-art results on the latter.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Training Restricted Boltzmann Machines on Word Observations does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Training Restricted Boltzmann Machines on Word Observations, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training Restricted Boltzmann Machines on Word Observations will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-215452

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.