Computer Science – Neural and Evolutionary Computing
Scientific paper
1998-12-05
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (1998) 2:1237-1240. Seattle, Washi
Computer Science
Neural and Evolutionary Computing
Source link (9812006.tar.gz) contains: 1 PostScript file (4 pages) and 3 WAV audio files. If your system does not support Wind
Scientific paper
While neural networks have been employed to handle several different text-to-speech tasks, ours is the first system to use neural networks throughout, for both linguistic and acoustic processing. We divide the text-to-speech task into three subtasks, a linguistic module mapping from text to a linguistic representation, an acoustic module mapping from the linguistic representation to speech, and a video module mapping from the linguistic representation to animated images. The linguistic module employs a letter-to-sound neural network and a postlexical neural network. The acoustic module employs a duration neural network and a phonetic neural network. The visual neural network is employed in parallel to the acoustic module to drive a talking head. The use of neural networks that can be retrained on the characteristics of different voices and languages affords our system a degree of adaptability and naturalness heretofore unavailable.
Corrigan Gerald
Karaali Orhan
Mackie Andrew
Massey Noel
Miller Corey
No associations
LandOfFree
A High Quality Text-To-Speech System Composed of Multiple Neural Networks does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A High Quality Text-To-Speech System Composed of Multiple Neural Networks, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A High Quality Text-To-Speech System Composed of Multiple Neural Networks will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-457966