Detecting Solo Phrases in Music using Spectral and Pitch-related Descriptors

TitleDetecting Solo Phrases in Music using Spectral and Pitch-related Descriptors
Publication TypeJournal Article
Year of Publication2009
AuthorsFuhrmann, F., Herrera P., & Serra X.
Journal TitleJournal of New Music Research
Pages343 - 356
Journal Date12/2009
AbstractIn this paper we present an algorithm for segmenting musical audio data. Our aim is to identify solo instrument phrases in polyphonic music. We extract relevant features from the audio to be input into our algorithm. A large corpus of audio descriptors was tested for its ability to discriminate between solo and non-solo sections, which resulted in a subset of five best features. We derived a two-stage algorithm that first creates a set of boundary candidates from local changes of these features and then classifies fixed-length segments according to the desired target classes. The output of the two stages is combined to derive the final segmentation and segment labels. Our system was trained and tested with excerpts from classical pieces and evaluated using full-length recordings, all taken from commercially available audio. We evaluated our algorithm by using precision and recall measurements for the boundary estimation and introduced new evaluation metrics from image processing for the final segmentation. Along with a resulting accuracy of 77%, we demonstrate that the selected features are discriminative for this specific task and achieve reasonable results for the segmentation problem.
preprint/postprint document
Final publication