Computer Science – Digital Libraries
Scientific paper
2004-01-27
Computer Science
Digital Libraries
10 pages, 1 figure; accepted for publication in the proceedings of the 2004 Meeting of the International Federation of Classif
Scientific paper
We describe a system used by the NASA Astrophysics Data System to identify bibliographic references obtained from scanned article pages by OCR methods with records in a bibliographic database. We analyze the process generating the noisy references and conclude that the three-step procedure of correcting the OCR results, parsing the corrected string and matching it against the database provides unsatisfactory results. Instead, we propose a method that allows a controlled merging of correction, parsing and matching, inspired by dependency grammars. We also report on the effectiveness of various heuristics that we have employed to improve recall.
Accomazzi Alberto
Demleitner Markus
Eichhorn Gunther
Grant Carolyn S.
Kurtz Michael
No associations
LandOfFree
Automated Resolution of Noisy Bibliographic References does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Automated Resolution of Noisy Bibliographic References, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated Resolution of Noisy Bibliographic References will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-646938