Computer Science – Information Retrieval
Scientific paper
2007-06-19
Dans 7i\`emes Journ\'ees francophones Extraction et Gestion des Connaissances EGC 2007 76300 (23/01/2007) pp. 533-538
Computer Science
Information Retrieval
The bibteX file has been replaced with the correct one
Scientific paper
The goal of our work is to use a set of reports and extract named entities, in our case the names of Industrial or Academic partners. Starting with an initial list of entities, we use a first set of documents to identify syntactic patterns that are then validated in a supervised learning phase on a set of annotated documents. The complete collection is then explored. This approach is similar to the ones used in data extraction from semi-structured documents (wrappers) and do not need any linguistic resources neither a large set for training. As our collection of documents would evolve over years, we hope that the performance of the extraction would improve with the increased size of the training set.
Despeyroux Thierry
Fraschini Eduardo
Vercoustre Anne-Marie
No associations
LandOfFree
Extraction d'entités dans des collections évolutives does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Extraction d'entités dans des collections évolutives, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Extraction d'entités dans des collections évolutives will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-281748