direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Page Content

Scientific Publications

Audio Spectrum Projection Based on Several Basis Decomposition Algorithms applied to General Sound Recognition and Audio Segmentation
Citation key 0776Kim2004
Author Hyoung-Gook Kim and Thomas Sikora
Title of Book EURASIP-EUSIPCO 2004
Year 2004
Address Vienna, Austria
Month sep
Abstract Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on basis decomposition vs. Mel-scale requency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we have three choices: Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Audio features are computed from these reduced vectors and are fed into hidden Markov model classifier. Experimental results show that the MFCC features yield better performance compared to MPEG-7 ASP in the sound recognition, and audio segmentation.
Link to publication Download Bibtex entry

Zusatzinformationen / Extras

Quick Access:

Schnellnavigation zur Seite über Nummerneingabe