Combining Linguistic and Spatial Information for Document Analysis

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Appeared in: J. Mariani and D. Harman (Eds.) Proceedings of RIAO'2000 Content-Based Multimedia Information Access, CID, 2000.

Scientific paper

We present a framework to analyze color documents of complex layout. In addition, no assumption is made on the layout. Our framework combines in a content-driven bottom-up approach two different sources of information: textual and spatial. To analyze the text, shallow natural language processing tools, such as taggers and partial parsers, are used. To infer relations of the logical layout we resort to a qualitative spatial calculus closely related to Allen's calculus. We evaluate the system against documents from a color journal and present the results of extracting the reading order from the journal's pages. In this case, our analysis is successful as it extracts the intended reading order from the document.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Combining Linguistic and Spatial Information for Document Analysis does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Combining Linguistic and Spatial Information for Document Analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Combining Linguistic and Spatial Information for Document Analysis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-367524

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.