Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

9 pages. This is close to how it appears on the publisher's website (http://bioinformatics.oupjournals.org/cgi/reprint/19/supp

Scientific paper

MOTIVATION: The biological literature is a major repository of knowledge. Many biological databases draw much of their content from a careful curation of this literature. However, as the volume of literature increases, the burden of curation increases. Text mining may provide useful tools to assist in the curation process. To date, the lack of standards has made it impossible to determine whether text mining techniques are sufficiently mature to be useful. RESULTS: We report on a Challenge Evaluation task that we created for the Knowledge Discovery and Data Mining (KDD) Challenge Cup. We provided a training corpus of 862 articles consisting of journal articles curated in FlyBase, along with the associated lists of genes and gene products, as well as the relevant data fields from FlyBase. For the test, we provided a corpus of 213 new (`blind') articles; the 18 participating groups provided systems that flagged articles for curation, based on whether the article contained experimental evidence for gene expression products. We report on the the evaluation results and describe the techniques used by the top performing groups. CONTACT: asy@mitre.org KEYWORDS: text mining, evaluation, curation, genomics, data management

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-167083

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.