RIOT: I/O-Efficient Numerical Computing without SQL

Computer Science – Databases

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

CIDR 2009

Scientific paper

R is a numerical computing environment that is widely popular for statistical data analysis. Like many such environments, R performs poorly for large datasets whose sizes exceed that of physical memory. We present our vision of RIOT (R with I/O Transparency), a system that makes R programs I/O-efficient in a way transparent to the users. We describe our experience with RIOT-DB, an initial prototype that uses a relational database system as a backend. Despite the overhead and inadequacy of generic database systems in handling array data and numerical computation, RIOT-DB significantly outperforms R in many large-data scenarios, thanks to a suite of high-level, inter-operation optimizations that integrate seamlessly into R. While many techniques in RIOT are inspired by databases (and, for RIOT-DB, realized by a database system), RIOT users are insulated from anything database related. Compared with previous approaches that require users to learn new languages and rewrite their programs to interface with a database, RIOT will, we believe, be easier to adopt by the majority of the R users.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

RIOT: I/O-Efficient Numerical Computing without SQL does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with RIOT: I/O-Efficient Numerical Computing without SQL, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and RIOT: I/O-Efficient Numerical Computing without SQL will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-386652

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.