Computer Science – Computation and Language
Scientific paper
2008-07-02
Computer Science
Computation and Language
3 pages
Scientific paper
The algorithm of the creation texts parallel corpora was presented. The algorithm is based on the use of "key words" in text documents, and on the means of their automated translation. Key words were singled out by means of using Russian and Ukrainian morphological dictionaries, as well as dictionaries of the translation of nouns for the Russian and Ukrainianlanguages. Besides, to calculate the weights of the terms in the documents, empiric-statistic rules were used. The algorithm under consideration was realized in the form of a program complex, integrated into the content-monitoring InfoStream system. As a result, a parallel bilingual corpora of web-publications containing about 30 thousand documents, was created
Lande D. V.
Zhygalo V. V.
No associations
LandOfFree
About the creation of a parallel bilingual corpora of web-publications does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with About the creation of a parallel bilingual corpora of web-publications, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and About the creation of a parallel bilingual corpora of web-publications will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-422031