Detecting gross alignment errors in the Spoken British National Corpus

Computer Science – Sound

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Four pages, 3 figures. Presented at "New Tools and Methods for Very-Large-Scale Phonetics Research", University of Pennsylvani

Scientific paper

The paper presents methods for evaluating the accuracy of alignments between transcriptions and audio recordings. The methods have been applied to the Spoken British National Corpus, which is an extensive and varied corpus of natural unscripted speech. Early results show good agreement with human ratings of alignment accuracy. The methods also provide an indication of the location of likely alignment problems; this should allow efficient manual examination of large corpora. Automatic checking of such alignments is crucial when analysing any very large corpus, since even the best current speech alignment systems will occasionally make serious errors. The methods described here use a hybrid approach based on statistics of the speech signal itself, statistics of the labels being evaluated, and statistics linking the two.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Detecting gross alignment errors in the Spoken British National Corpus does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Detecting gross alignment errors in the Spoken British National Corpus, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Detecting gross alignment errors in the Spoken British National Corpus will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-455137

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.