Computer Science – Computation and Language
Scientific paper
2009-08-30
Computer Science
Computation and Language
Scientific paper
This paper presents the system called PATATRAS (PATent and Article Tracking, Retrieval and AnalysiS) realized for the IP track of CLEF 2009. Our approach presents three main characteristics: 1. The usage of multiple retrieval models (KL, Okapi) and term index definitions (lemma, phrase, concept) for the three languages considered in the present track (English, French, German) producing ten different sets of ranked results. 2. The merging of the different results based on multiple regression models using an additional validation set created from the patent collection. 3. The exploitation of patent metadata and of the citation structures for creating restricted initial working sets of patents and for producing a final re-ranking regression model. As we exploit specific metadata of the patent documents and the citation relations only at the creation of initial working sets and during the final post ranking step, our architecture remains generic and easy to extend.
Lopez Patrice
Romary Laurent
No associations
LandOfFree
Multiple Retrieval Models and Regression Models for Prior Art Search does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Multiple Retrieval Models and Regression Models for Prior Art Search, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multiple Retrieval Models and Regression Models for Prior Art Search will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-622061