TY - GEN
T1 - Adding personality to neutral speech synthesis voices
AU - Buchanan, Christopher G.
AU - Aylett, Matthew P.
AU - Braude, David A.
PY - 2018/8/25
Y1 - 2018/8/25
N2 - A synthetic voice personifies the system using it. Previous work has shown that using sub-corpora with different voice qualities (e.g. tense and lax) can be used to modify the perceived personality of a voice as well as adding expressive and emotional functionality. In this work we explore the use of LPC source/filter decomposition together with modification of the residual to artificially add voice quality sub-corpora to a voice without recording bespoke data. We evaluate this artificially enhanced voice against a baseline unit selection voice with pre-recorded sub-corpora. Although artificial modification impacts naturalness, it has the advantage of adding emotional range to voices where none was recorded in the source data, deals with data sparsity issues caused by sub-corpora, and results in significant effects in terms of perceived emotion.
AB - A synthetic voice personifies the system using it. Previous work has shown that using sub-corpora with different voice qualities (e.g. tense and lax) can be used to modify the perceived personality of a voice as well as adding expressive and emotional functionality. In this work we explore the use of LPC source/filter decomposition together with modification of the residual to artificially add voice quality sub-corpora to a voice without recording bespoke data. We evaluate this artificially enhanced voice against a baseline unit selection voice with pre-recorded sub-corpora. Although artificial modification impacts naturalness, it has the advantage of adding emotional range to voices where none was recorded in the source data, deals with data sparsity issues caused by sub-corpora, and results in significant effects in terms of perceived emotion.
U2 - 10.1007/978-3-319-99579-3_6
DO - 10.1007/978-3-319-99579-3_6
M3 - Conference contribution
SN - 9783319995786
T3 - Lecture Notes in Computer Science
SP - 49
EP - 57
BT - Speech and Computer. SPECOM 2018
PB - Springer
T2 - 20th International Conference on Speech and Computer 2018
Y2 - 18 September 2018 through 22 September 2018
ER -