Computer Science – Computation and Language
Scientific paper
2002-06-09
Computers and the Humanities, Vol.35, No.4, pp.389-420, 2001
Computer Science
Computation and Language
Scientific paper
Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper proposes a Japanese/English CLIR system, where we combine a query translation and retrieval modules. We currently target the retrieval of technical documents, and therefore the performance of our system is highly dependent on the quality of the translation of technical terms. However, the technical term translation is still problematic in that technical terms are often compound words, and thus new terms are progressively created by combining existing base words. In addition, Japanese often represents loanwords based on its special phonogram. Consequently, existing dictionaries find it difficult to achieve sufficient coverage. To counter the first problem, we produce a Japanese/English dictionary for base words, and translate compound words on a word-by-word basis. We also use a probabilistic method to resolve translation ambiguity. For the second problem, we use a transliteration method, which corresponds words unlisted in the base word dictionary to their phonetic equivalents in the target language. We evaluate our system using a test collection for CLIR, and show that both the compound word translation and transliteration methods improve the system performance.
Fujii Atsushi
Ishikawa Tetsuya
No associations
LandOfFree
Japanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Japanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Japanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-186136