Systematic Database Creation for Expressive Singing Voice Synthesis Control

TitleSystematic Database Creation for Expressive Singing Voice Synthesis Control
Publication TypeConference Paper
Year of Publication2013
Conference Name8th ISCA Speech Synthesis Workshop (SSW8)
AuthorsUmbert, M., Bonada J., & Blaauw M.
Conference Start Date31/09/2013
Conference LocationBarcelona
Keywordsdatabase creation, expressive singing voice synthesis, unit selection
AbstractIn the context of singing voice synthesis, the generation of the synthesizer controls is a key aspect to obtain expressive performances. In our case, we use a system that selects, transforms and concatenates units of short melodic contours from a recorded database. This paper proposes a systematic procedure for the creation of such database. The aim is to cover relevant style-dependent combinations of features such as note duration, pitch interval and note strength. The higher the percentage of covered combinations is, the less transformed the units will be in order to match a target score. At the same time, it is also important that units are musically meaningful according to the target style. In order to create a style-dependent database, the melodic combinations of features to cover are identified, statistically modeled and grouped by similarity. Then, short melodic exercises of four measures are created following a dynamic programming algorithm. The Viterbi cost functions deal with the statistically observed context transitions, harmony, position within the measure and readability. The final systematic score database