A Data Driven Approach to Query Expansion in Question Answering

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

Automated answering of natural language questions is an interesting and useful problem to solve. Question answering (QA) systems often perform information retrieval at an initial stage. Information retrieval (IR) performance, provided by engines such as Lucene, places a bound on overall system performance. For example, no answer bearing documents are retrieved at low ranks for almost 40% of questions. In this paper, answer texts from previous QA evaluations held as part of the Text REtrieval Conferences (TREC) are paired with queries and analysed in an attempt to identify performance-enhancing words. These words are then used to evaluate the performance of a query expansion method. Data driven extension words were found to help in over 70% of difficult questions. These words can be used to improve and evaluate query expansion methods. Simple blind relevance feedback (RF) was correctly predicted as unlikely to help overall performance, and an possible explanation is provided for its low value in IR for QA.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Data Driven Approach to Query Expansion in Question Answering does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Data Driven Approach to Query Expansion in Question Answering, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Data Driven Approach to Query Expansion in Question Answering will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-382421

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.