Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere

Computer Science – Distributed – Parallel – and Cluster Computing

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We describe the design and implementation of a high performance cloud that we have used to archive, analyze and mine large distributed data sets. By a cloud, we mean an infrastructure that provides resources and/or services over the Internet. A storage cloud provides storage services, while a compute cloud provides compute services. We describe the design of the Sector storage cloud and how it provides the storage services required by the Sphere compute cloud. We also describe the programming paradigm supported by the Sphere compute cloud. Sector and Sphere are designed for analyzing large data sets using computer clusters connected with wide area high performance networks (for example, 10+ Gb/s). We describe a distributed data mining application that we have developed using Sector and Sphere. Finally, we describe some experimental studies comparing Sector/Sphere to Hadoop.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-185364

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.