A role-free approach to indexing large RDF data sets in secondary memory for efficient SPARQL evaluation

Computer Science – Databases

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

12 pages, 5 figures, 2 tables

Scientific paper

Massive RDF data sets are becoming commonplace. RDF data is typically generated in social semantic domains (such as personal information management) wherein a fixed schema is often not available a priori. We propose a simple Three-way Triple Tree (TripleT) secondary-memory indexing technique to facilitate efficient SPARQL query evaluation on such data sets. The novelty of TripleT is that (1) the index is built over the atoms occurring in the data set, rather than at a coarser granularity, such as whole triples occurring in the data set; and (2) the atoms are indexed regardless of the roles (i.e., subjects, predicates, or objects) they play in the triples of the data set. We show through extensive empirical evaluation that TripleT exhibits multiple orders of magnitude improvement over the state of the art on RDF indexing, in terms of both storage and query processing costs.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A role-free approach to indexing large RDF data sets in secondary memory for efficient SPARQL evaluation does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A role-free approach to indexing large RDF data sets in secondary memory for efficient SPARQL evaluation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A role-free approach to indexing large RDF data sets in secondary memory for efficient SPARQL evaluation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-228562

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.