Estimating Performance of Pipelined Spoken Language Translation Systems

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, Latex source. To appear in Proc. ICSLP '94

Scientific paper

Most spoken language translation systems developed to date rely on a pipelined architecture, in which the main stages are speech recognition, linguistic analysis, transfer, generation and speech synthesis. When making projections of error rates for systems of this kind, it is natural to assume that the error rates for the individual components are independent, making the system accuracy the product of the component accuracies. The paper reports experiments carried out using the SRI-SICS-Telia Research Spoken Language Translator and a 1000-utterance sample of unseen data. The results suggest that the naive performance model leads to serious overestimates of system error rates, since there are in fact strong dependencies between the components. Predicting the system error rate on the independence assumption by simple multiplication resulted in a 16\% proportional overestimate for all utterances, and a 19\% overestimate when only utterances of length 1-10 words were considered.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Estimating Performance of Pipelined Spoken Language Translation Systems does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Estimating Performance of Pipelined Spoken Language Translation Systems, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Estimating Performance of Pipelined Spoken Language Translation Systems will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-620430

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.