Systematic Database Creation for Expressive Singing Voice Synthesis Control

Umbert, M.; Bonada, J.; Merlijn Blaauw

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Systematic Database Creation for Expressive Singing Voice Synthesis Control

Title	Systematic Database Creation for Expressive Singing Voice Synthesis Control
Publication Type	Conference Paper
Year of Publication	2013
Conference Name	8th ISCA Speech Synthesis Workshop (SSW8)
Authors	Umbert, M. , Bonada J. , & Blaauw M.
Pagination	213-216
Conference Start Date	31/09/2013
Conference Location	Barcelona
Abstract	In the context of singing voice synthesis, the generation of the synthesizer controls is a key aspect to obtain expressive performances. In our case, we use a system that selects, transforms and concatenates units of short melodic contours from a recorded database. This paper proposes a systematic procedure for the creation of such database. The aim is to cover relevant style-dependent combinations of features such as note duration, pitch interval and note strength. The higher the percentage of covered combinations is, the less transformed the units will be in order to match a target score. At the same time, it is also important that units are musically meaningful according to the target style. In order to create a style-dependent database, the melodic combinations of features to cover are identified, statistically modeled and grouped by similarity. Then, short melodic exercises of four measures are created following a dynamic programming algorithm. The Viterbi cost functions deal with the statistically observed context transitions, harmony, position within the measure and readability. The final systematic score database

UmbertBonadaBlaauwSSW82013.pdf