conference paper

Conference/ProceedingsEURASIP-EUSIPCO 2004
Start date06.09.2004
End date10.09.2004
AddressVienna, Austria
Author(s)Hyoung-Gook Kim, Thomas Sikora
TitleAudio Spectrum Projection Based on Several Basis Decomposition Algorithms applied to General Sound Recognition and Audio Segmentation
AbstractOur challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP)
features based on basis decomposition vs. Mel-scale requency
Cepstrum Coefficients (MFCC). For basis decomposition
in the feature extraction we have three choices: Principal
Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Audio features are computed from these reduced vectors and are fed into hidden Markov model classifier. Experimental results show that the MFCC features yield better performance compared to MPEG-7 ASP in the sound recognition, and audio segmentation.