Statistics – Applications
Scientific paper
Sep 2004
adsabs.harvard.edu/cgi-bin/nph-data_query?bibcode=2004esasp.553e..15c&link_type=abstract
Proceedings of ESA-EUSC 2004 - Theory and Applications of Knowledge-Driven Image Information Mining with Focus on Earth Observa
Statistics
Applications
Scientific paper
Along with the aggressive growing of the amount of digital data available (text, audio samples, digital photos and digital movies joined all in the multimedia domain) the need for classification, recognition and retrieval of this kind of data became very important. In this paper will be presented a system structure to handle multimedia data based on a recognition perspective. The main processing steps realized for the interesting multimedia objects are: first, the parameterization, by analysis, in order to obtain a description based on features, forming the parameter vector; second, a classification, generally with a hierarchical structure to make the necessary decisions. For audio signals, both speech and music, the derived perceptual features are the melcepstral (MFCC) and the perceptual linear predictive (PLP) coefficients. For images, the derived features are the geometric parameters of the speaker mouth. The hierarchical classifier consists generally in a clustering stage, based on the Kohonnen Self-Organizing Maps (SOM) and a final stage, based on a powerful classification algorithm called Support Vector Machines (SVM). The system, in specific variants, is applied with good results in two tasks: the first, is a bimodal speech recognition which uses features obtained from speech signal fused to features obtained from speaker's image and the second is a music retrieval from large music database.
Costache G. N.
Gavat I.
No associations
LandOfFree
Multimedia Classifier does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Multimedia Classifier, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimedia Classifier will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-1064394