TU Berlin

Communication Systems GroupScientific Publications

Page Content

to Navigation

Scientific Publications

Incorporating prior knowledge on the digital media creation process into audio classifiers
Citation key 1193Lardeur2009
Author Maxime Lardeur and Slim Essid and Gael Richard and Martin Haller and Thomas Sikora
Title of Book Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009)
Pages 1653–1656
Year 2009
Address Taipei, Taiwan
Month apr
Note ISBN 978-1-4244-2354-5
Abstract In the process of music content creation, a wide range of typical audio effects such as reverberation, equalization or dynamic compression are very commonly used. Despite the fact that such effects have a clear impact on the audio features, they are rarely taken into account when building an automatic audio classifier. In this paper, it is shown that the incorporation of prior knowledge of the digital media creation chain can clearly improve the robustness of the audio classifiers, which is demonstrated on a task of musical instrument recognition. The proposed system is based on a robust feature selection strategy, on a novel use of the virtual support vector machines technique and a specific equalization used to normalize the signals to be classified. The robustness of the proposed system is experimentally evidenced using a rather large and varied sound database.
Link to publication Download Bibtex entry


Quick Access

Schnellnavigation zur Seite über Nummerneingabe