Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Shape-based spectral contrast descriptor

Title Shape-based spectral contrast descriptor
Publication Type Conference Paper
Year of Publication 2009
Conference Name Sound and Music Computing Conference
Authors Akkermans, V. , Serrà J. , & Herrera P.
Pagination 143-148
Conference Start Date 25/07/2009
Conference Location Porto, Portugal.
Abstract Mel-frequency cepstral coefficients are used as an abstract representation of the spectral envelope of a given signal. Although they have been shown to be a powerful descriptor for speech and music signals, more accurate and easily interpretable options can be devised. In this study, we present and evaluate the shape-based spectral contrast descriptor, which is build up from the previously proposed octave-based spectral contrast descriptor. We compare the three aforementioned descriptors with regard to their discriminative power and MP3 compression robustness. Discriminative power is evaluated within a prototypical genre classification task. MP3 compression robustness is measured by determining the descriptor values' change between different encodings. We show that the proposed shape-based spectral contrast descriptor yields a significant increase in accuracy, robustness, and applicability over the octave-based spectral contrast descriptor. Our results also corroborate initial findings regarding the accuracy improvement of the octave-based spectral contrast descriptor over Mel-frequency cepstral coefficients for the genre classification task.
preprint/postprint document files/publications/Akkermans-Serra-Herrera-smc09.pdf