A Vocoder Based Method For Singing Voice Extraction

Pritish Chandna; Merlijn Blaauw; Jordi Bonada; Emilia Gomez

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

A Vocoder Based Method For Singing Voice Extraction

Title	A Vocoder Based Method For Singing Voice Extraction
Publication Type	Conference Paper
Year of Publication	2019
Conference Name	44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019)
Authors	Chandna, P. , Blaauw M. , Bonada J. , & Gomez E.
Conference Start Date	12/05/2019
Publisher	IEEE
Conference Location	Brighton, UK
Abstract	This paper presents a novel method for extracting the vocal track from a musical mixture. The musical mixture consists of a singing voice and a backing track which may comprise of various instruments. We use a convolutional network with skip and residual connections as well as dilated convolutions to estimate vocoder parameters, given the spectrogram of an input mixture. The estimated parameters are then used to synthesize the vocal track, without any interference from the backing track. We evaluate our system, through objective metrics pertinent to audio quality and interference from background sources, and via a comparative subjective evaluation. We use open-source source separation systems based on Non-negative Matrix Factorization (NMFs) and Deep Learning methods as benchmarks for our system and discuss future applications for this particular algorithm.
preprint/postprint document	https://arxiv.org/abs/1903.07554