Automatic Wrapper Adaptation by Tree Edit Distance Matching

Computer Science – Artificial Intelligence

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

7 pages, 3 figures, In Proceedings of the 2nd International Workshop on Combinations of Intelligent Methods and Applications (

Scientific paper

Information distributed through the Web keeps growing faster day by day, and for this reason, several techniques for extracting Web data have been suggested during last years. Often, extraction tasks are performed through so called wrappers, procedures extracting information from Web pages, e.g. implementing logic-based techniques. Many fields of application today require a strong degree of robustness of wrappers, in order not to compromise assets of information or reliability of data extracted. Unfortunately, wrappers may fail in the task of extracting data from a Web page, if its structure changes, sometimes even slightly, thus requiring the exploiting of new techniques to be automatically held so as to adapt the wrapper to the new structure of the page, in case of failure. In this work we present a novel approach of automatic wrapper adaptation based on the measurement of similarity of trees through improved tree edit distance matching techniques.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Automatic Wrapper Adaptation by Tree Edit Distance Matching does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Automatic Wrapper Adaptation by Tree Edit Distance Matching, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic Wrapper Adaptation by Tree Edit Distance Matching will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-607020

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.