Reengineering PDF-Based Documents Targeting Complex Software Specifications

Computer Science – Digital Libraries

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

27 pages, 15 figures; International Journal of Knowledge and Web Intelligence (IJKWI), Inderscience Publishers, 2011

Scientific paper

10.1504/IJKWI.2011.045165

This article aims at reengineering of PDF-based complex documents, where specifications of the Object Management Group (OMG) are our initial targets. Our motivation is that such specifications are dense and intricate to use, and tend to have complicated structures. Our objective is therefore to create an approach that allows us to reengineer PDF-based documents, and to illustrate how to make more usable versions of electronic documents (such as specifications, technical books, etc) so that end users to have a better experience with them. The first step was to extract the logical structure of the document in a meaningful XML format for subsequent processing. Our initial assumption was that, many key concepts of a document are expressed in this structure. In the next phase, we created a multilayer hypertext version of the document to facilitate browsing and navigating. Although we initially focused on OMG software specifications, we chose a general approach for different phases of our work including format conversions, logical structure extraction, text extraction, multilayer hypertext generation, and concept exploration. As a consequence, we can process other complex documents to achieve our goals.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Reengineering PDF-Based Documents Targeting Complex Software Specifications does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Reengineering PDF-Based Documents Targeting Complex Software Specifications, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Reengineering PDF-Based Documents Targeting Complex Software Specifications will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-294733

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.