Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

10.1371/journal.pone.0007678

Background: Zipf's discovery that word frequency distributions obey a power law established parallels between biological and physical processes, and language, laying the groundwork for a complex systems perspective on human communication. More recent research has also identified scaling regularities in the dynamics underlying the successive occurrences of events, suggesting the possibility of similar findings for language as well. Methodology/Principal Findings: By considering frequent words in USENET discussion groups and in disparate databases where the language has different levels of formality, here we show that the distributions of distances between successive occurrences of the same word display bursty deviations from a Poisson process and are well characterized by a stretched exponential (Weibull) scaling. The extent of this deviation depends strongly on semantic type -- a measure of the logicality of each word -- and less strongly on frequency. We develop a generative model of this behavior that fully determines the dynamics of word usage. Conclusions/Significance: Recurrence patterns of words are well described by a stretched exponential distribution of recurrence times, an empirical scaling that cannot be anticipated from Zipf's law. Because the use of words provides a uniquely precise and powerful lens on human thought and activity, our findings also have implications for other overt manifestations of collective human dynamics.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-126745

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.