GraphLab: A Distributed Framework for Machine Learning in the Cloud

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

CMU Tech Report, GraphLab project webpage: http://graphlab.org

Scientific paper

Machine Learning (ML) techniques are indispensable in a wide range of fields. Unfortunately, the exponential increase of dataset sizes are rapidly extending the runtime of sequential algorithms and threatening to slow future progress in ML. With the promise of affordable large-scale parallel computing, Cloud systems offer a viable platform to resolve the computational challenges in ML. However, designing and implementing efficient, provably correct distributed ML algorithms is often prohibitively challenging. To enable ML researchers to easily and efficiently use parallel systems, we introduced the GraphLab abstraction which is designed to represent the computational patterns in ML algorithms while permitting efficient parallel and distributed implementations. In this paper we provide a formal description of the GraphLab parallel abstraction and present an efficient distributed implementation. We conduct a comprehensive evaluation of GraphLab on three state-of-the-art ML algorithms using real large-scale data and a 64 node EC2 cluster of 512 processors. We find that GraphLab achieves orders of magnitude performance gains over Hadoop while performing comparably or superior to hand-tuned MPI implementations.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

GraphLab: A Distributed Framework for Machine Learning in the Cloud does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with GraphLab: A Distributed Framework for Machine Learning in the Cloud, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and GraphLab: A Distributed Framework for Machine Learning in the Cloud will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-477912

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.