Abstract
State-of-the-art speech synthesis owes much to modern AI machine learning, with recurrent neural networks becoming the new standard. However, how you say something is just as important as what you say. If we draw inspiration from human dramatic performance, ideas such as artistic direction can help us design interactive speech synthesis systems which can be finely controlled by a human voice. This "voice puppetry" has many possible applications from film dubbing to the pre-creation of prompts for a conversational agent. Previous work in voice puppetry has raised the question of how such a system should work and how we might interact with it. Here, we share the results of a focus group discussing voice puppetry and responding to a voice puppetry demo. Results highlight a main challenge in user-centred AI: where is the trade-off between control and automation? and how may users control this trade-off?
Original language | English |
---|---|
Title of host publication | Companion Proceedings of the 25th International Conference on Intelligent User Interfaces |
Publisher | Association for Computing Machinery |
Pages | 108-109 |
Number of pages | 2 |
ISBN (Print) | 9781450375139 |
DOIs | |
Publication status | Published - 17 Mar 2020 |