Acronym-Meaning Extraction from Corpora Using Multi-Tape Weighted Finite-State Machines

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

6 pages, LaTeX

Scientific paper

The automatic extraction of acronyms and their meaning from corpora is an important sub-task of text mining. It can be seen as a special case of string alignment, where a text chunk is aligned with an acronym. Alternative alignments have different cost, and ideally the least costly one should give the correct meaning of the acronym. We show how this approach can be implemented by means of a 3-tape weighted finite-state machine (3-WFSM) which reads a text chunk on tape 1 and an acronym on tape 2, and generates all alternative alignments on tape 3. The 3-WFSM can be automatically generated from a simple regular expression. No additional algorithms are required at any stage. Our 3-WFSM has a size of 27 states and 64 transitions, and finds the best analysis of an acronym in a few milliseconds.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Acronym-Meaning Extraction from Corpora Using Multi-Tape Weighted Finite-State Machines does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Acronym-Meaning Extraction from Corpora Using Multi-Tape Weighted Finite-State Machines, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Acronym-Meaning Extraction from Corpora Using Multi-Tape Weighted Finite-State Machines will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-452125

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.