Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

A Comparison of Sound Segregation Techniques for Predominant Instrument Recognition in Musical Audio Signals

Title A Comparison of Sound Segregation Techniques for Predominant Instrument Recognition in Musical Audio Signals
Publication Type Conference Paper
Year of Publication 2012
Conference Name 13th International Society for Music Information Retrieval Conference (ISMIR 2012)
Authors Bosch, J. , Janer J. , Fuhrmann F. , & Herrera P.
Pagination 559-564
Conference Start Date 08/10/2012
Conference Location Porto, Portugal
Abstract The authors address the identification of predominant music instruments in polytimbral audio by previously dividing the original signal into several streams. Several strategies are evaluated, ranging from low to high complexity with respect to the segregation algorithm and models used for classification. The dataset of interest is built from professionally produced recordings, which typically pose problems to state-of-art source separation algorithms. The recognition results are improved a 19% with a simple sound segregation pre-step using only panning information, in comparison to the original algorithm. In order to further improve the results, we evaluated the use of a complex source separation as a pre-step. The results showed that the performance was only enhanced if the recognition models are trained with the features extracted from the separated audio streams. In this way, the typical errors of state-of-art separation algorithms are acknowledged, and the performance of the original instrument recognition algorithm is improved in up to 32%.
preprint/postprint document http://mtg.upf.edu/system/files/publications/Bosch-ISMIR2012.pdf