Computer Science – Computation and Language
Scientific paper
1998-08-12
Computer Science
Computation and Language
32 pages, 11 figures, uses epsf.tex
Scientific paper
Recent observations in the theory of verse and empirical metrics have suggested that constructing a verse line involves a pattern-matching search through a source text, and that the number of found elements (complete words totaling a specified number of syllables) is given by dividing the total number of words by the mean number of syllables per word in the source text. This paper makes this latter point explicit mathematically, and in the course of this demonstration shows that the word length frequency totals in English output are distributed geometrically (previous researchers reported an adjusted Poisson distribution), and that the sequential distribution is random at the global level, with significant non-randomness in the fine structure. Data from a corpus of just under two million words, and a syllable-count lexicon of 71,000 word-forms is reported. The pattern-matching theory is shown to be internally coherent, and it is observed that some of the analytic techniques described here form a satisfactory test for regular (isometric) lineation in a text.
Aoyama Hideaki
Constable John
No associations
LandOfFree
Word Length Frequency and Distribution in English: Observations, Theory, and Implications for the Construction of Verse Lines does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Word Length Frequency and Distribution in English: Observations, Theory, and Implications for the Construction of Verse Lines, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Word Length Frequency and Distribution in English: Observations, Theory, and Implications for the Construction of Verse Lines will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-309913