A Comparison of Sound Segregation Techniques for Predominant Instrument Recognition in Musical Audio Signals

TitleA Comparison of Sound Segregation Techniques for Predominant Instrument Recognition in Musical Audio Signals
Publication TypeConference Paper
Year of Publication2012
Conference Name13th International Society for Music Information Retrieval Conference (ISMIR 2012)
AuthorsBosch, J., Janer J., Fuhrmann F., & Herrera P.
Pagination559-564
Conference Start Date08/10/2012
Conference LocationPorto, Portugal
AbstractThe authors address the identification of predominant music instruments in polytimbral audio by previously dividing the original signal into several streams. Several strategies are evaluated, ranging from low to high complexity with respect to the segregation algorithm and models used for classification. The dataset of interest is built from professionally produced recordings, which typically pose problems to state-of-art source separation algorithms. The recognition results are improved a 19% with a simple sound segregation pre-step using only panning information, in comparison to the original algorithm. In order to further improve the results, we evaluated the use of a complex source separation as a pre-step. The results showed that the performance was only enhanced if the recognition models are trained with the features extracted from the separated audio streams. In this way, the typical errors of state-of-art separation algorithms are acknowledged, and the performance of the original instrument recognition algorithm is improved in up to 32%.
preprint/postprint documenthttp://mtg.upf.edu/system/files/publications/Bosch-ISMIR2012.pdf
intranet