A high speed unsupervised speaker retrieval using vector quantization and second-order statistics

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

This paper describes an effective unsupervised method for query-by-example speaker retrieval. We suppose that only one speaker is in each audio file or in audio segment. The audio data are modeled using a common universal codebook. The codebook is based on bag-of-frames (BOF). The features corresponding to the audio frames are extracted from all audio files. These features are grouped into clusters using the K-means algorithm. The individual audio files are modeled by the normalized distribution of the numbers of cluster bins corresponding to this file. In the first level the k-nearest to the query files are retrieved using vector space representation. In the second level the second-order statistical measure is applied to obtained k-nearest files to find the final result of the retrieval. The described method is evaluated on the subset of Ester corpus of French broadcast news.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A high speed unsupervised speaker retrieval using vector quantization and second-order statistics does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A high speed unsupervised speaker retrieval using vector quantization and second-order statistics, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A high speed unsupervised speaker retrieval using vector quantization and second-order statistics will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-331268

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.