Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

4 pages, In Proc. of the NAACL-HLT 2009, Boulder, USA, June

Scientific paper

The complexity of sentences characteristic to biomedical articles poses a challenge to natural language parsers, which are typically trained on large-scale corpora of non-technical text. We propose a text simplification process, bioSimplify, that seeks to reduce the complexity of sentences in biomedical abstracts in order to improve the performance of syntactic parsers on the processed sentences. Syntactic parsing is typically one of the first steps in a text mining pipeline. Thus, any improvement in performance would have a ripple effect over all processing steps. We evaluated our method using a corpus of biomedical sentences annotated with syntactic links. Our empirical results show an improvement of 2.90% for the Charniak-McClosky parser and of 4.23% for the Link Grammar parser when processing simplified sentences rather than the original sentences in the corpus.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-129994

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.