Joint Structured Models for Extraction from Overlapping Sources

Computer Science – Artificial Intelligence

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We consider the problem of jointly training structured models for extraction from sources whose instances enjoy partial overlap. This has important applications like user-driven ad-hoc information extraction on the web. Such applications present new challenges in terms of the number of sources and their arbitrary pattern of overlap not seen by earlier collective training schemes applied on two sources. We present an agreement-based learning framework and alternatives within it to trade-off tractability, robustness to noise, and extent of agreement. We provide a principled scheme to discover low-noise agreement sets in unlabeled data across the sources. Through extensive experiments over 58 real datasets, we establish that our method of additively rewarding agreement over maximal segments of text provides the best trade-offs, and also scores over alternatives such as collective inference, staged training, and multi-view learning.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Joint Structured Models for Extraction from Overlapping Sources does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Joint Structured Models for Extraction from Overlapping Sources, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Joint Structured Models for Extraction from Overlapping Sources will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-723750

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.