Computer Science – Databases
Scientific paper
2009-09-09
Computer Science
Databases
CIDR 2009
Scientific paper
Workloads that comb through vast amounts of data are gaining importance in the sciences. These workloads consist of "needle in a haystack" queries that are long running and data intensive so that query throughput limits performance. To maximize throughput for data-intensive queries, we put forth LifeRaft: a query processing system that batches queries with overlapping data requirements. Rather than scheduling queries in arrival order, LifeRaft executes queries concurrently against an ordering of the data that maximizes data sharing among queries. This decreases I/O and increases cache utility. However, such batch processing can increase query response time by starving interactive workloads. LifeRaft addresses starvation using techniques inspired by head scheduling in disk drives. Depending upon the workload saturation and queuing times, the system adaptively and incrementally trades-off processing queries in arrival order and data-driven batch processing. Evaluating LifeRaft in the SkyQuery federation of astronomy databases reveals a two-fold improvement in query throughput.
Burns Randal
Malik Tanu
Wang Xiaodan
No associations
LandOfFree
LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-386623