Search Driven Analysis of Heterogenous XML Data

Computer Science – Databases

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

CIDR 2009

Scientific paper

Analytical processing on XML repositories is usually enabled by designing complex data transformations that shred the documents into a common data warehousing schema. This can be very time-consuming and costly, especially if the underlying XML data has a lot of variety in structure, and only a subset of attributes constitutes meaningful dimensions and facts. Today, there is no tool to explore an XML data set, discover interesting attributes, dimensions and facts, and rapidly prototype an OLAP solution. In this paper, we propose a system, called SEDA that enables users to start with simple keyword-style querying, and interactively refine the query based on result summaries. SEDA then maps query results onto a set of known, or newly created, facts and dimensions, and derives a star schema and its instantiation to be fed into an off-the-shelf OLAP tool, for further analysis.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Search Driven Analysis of Heterogenous XML Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Search Driven Analysis of Heterogenous XML Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Search Driven Analysis of Heterogenous XML Data will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-386761

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.