Computer Science – Databases
Scientific paper
2007-03-13
Computer Science
Databases
Scientific paper
A data warehouse cannot materialize all possible views, hence we must estimate quickly, accurately, and reliably the size of views to determine the best candidates for materialization. Many available techniques for view-size estimation make particular statistical assumptions and their error can be large. Comparatively, unassuming probabilistic techniques are slower, but they estimate accurately and reliability very large view sizes using little memory. We compare five unassuming hashing-based view-size estimation techniques including Stochastic Probabilistic Counting and LogLog Probabilistic Counting. Our experiments show that only Generalized Counting, Gibbons-Tirthapura, and Adaptive Counting provide universally tight estimates irrespective of the size of the view; of those, only Adaptive Counting remains constantly fast as we increase the memory budget.
Aouiche Kamel
Lemire Daniel
No associations
LandOfFree
A Comparison of Five Probabilistic View-Size Estimation Techniques in OLAP does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Comparison of Five Probabilistic View-Size Estimation Techniques in OLAP, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Comparison of Five Probabilistic View-Size Estimation Techniques in OLAP will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-438765