Abstract
The ability to be interrupted and react in a realistic manner is a key requirement for interactive speech interfaces. While previous systems have long implemented techniques such as ‘barge in’ where speech output can be halted at word or phrase boundaries, less work has explored how to mimic human speech output responses to real-time events like interruptions which require a reaction from the system. Unlike previous work which has focused on incremental production, here we explore a novel re-planning approach. The proposed system is versatile and offers a large range of possible ways to react. A focus group was used to evaluate the approach, where participants interacted with a system reading out a text. The system would react to audio interruptions, either with no reactions, passive reactions, or active negative reactions (i.e. getting increasingly irritated). Participants preferred a reactive system.
Original language | English |
---|---|
Title of host publication | Proceedings of the 18th Annual Conference of the International Speech Communication Association 2017 |
Place of Publication | Stockholm, Sweden |
Publisher | ISCA |
Pages | 3996-4000 |
Number of pages | 5 |
ISBN (Print) | 9781510848764 |
DOIs | |
Publication status | Published - Feb 2018 |
Event | 18th Annual Conference of the International Speech Communication Association 2017 - Stockholm , Sweden Duration: 20 Aug 2017 → 24 Aug 2017 https://www.isca-archive.org/interspeech_2017/index.html |
Publication series
Name | 18th Annual Conference of the International Speech Communication Association 2017 |
---|---|
Publisher | International Speech Communication Association (ISCA) |
ISSN (Print) | 2308-457X |
Conference
Conference | 18th Annual Conference of the International Speech Communication Association 2017 |
---|---|
Abbreviated title | INTERSPEECH 2017 |
Country/Territory | Sweden |
City | Stockholm |
Period | 20/08/17 → 24/08/17 |
Internet address |