Estimation of English and non-English Language Use on the WWW

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

The World Wide Web has grown so big, in such an anarchic fashion, that it is difficult to describe. One of the evident intrinsic characteristics of the World Wide Web is its multilinguality. Here, we present a technique for estimating the size of a language-specific corpus given the frequency of commonly occurring words in the corpus. We apply this technique to estimating the number of words available through Web browsers for given languages. Comparing data from 1996 to data from 1999 and 2000, we calculate the growth of a number of European languages on the Web. As expected, non-English languages are growing at a faster pace than English, though the position of English is still dominant.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Estimation of English and non-English Language Use on the WWW does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Estimation of English and non-English Language Use on the WWW, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Estimation of English and non-English Language Use on the WWW will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-95887

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.