Real-Time Reactive Speech Synthesis: Incorporating Interruptions

Mirjam Wester, David A. Braude, Blaise Potard, Matthew Aylett, Francesca Shaw

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

The ability to be interrupted and react in a realistic manner is a key requirement for interactive speech interfaces. While previous systems have long implemented techniques such as ‘barge in’ where speech output can be halted at word or phrase boundaries, less work has explored how to mimic human speech output responses to real-time events like interruptions which require a reaction from the system. Unlike previous work which has focused on incremental production, here we explore a novel re-planning approach. The proposed system is versatile and offers a large range of possible ways to react. A focus group was used to evaluate the approach, where participants interacted with a system reading out a text. The system would react to audio interruptions, either with no reactions, passive reactions, or active negative reactions (i.e. getting increasingly irritated). Participants preferred a reactive system.
Original languageEnglish
Title of host publicationProceedings of the 18th Annual Conference of the International Speech Communication Association 2017
Place of PublicationStockholm, Sweden
PublisherISCA
Pages3996-4000
Number of pages5
ISBN (Print)9781510848764
DOIs
Publication statusPublished - Feb 2018
Event18th Annual Conference of the International Speech Communication Association 2017 - Stockholm , Sweden
Duration: 20 Aug 201724 Aug 2017
https://www.isca-archive.org/interspeech_2017/index.html

Publication series

Name18th Annual Conference of the International Speech Communication Association 2017
PublisherInternational Speech Communication Association (ISCA)
ISSN (Print)2308-457X

Conference

Conference18th Annual Conference of the International Speech Communication Association 2017
Abbreviated titleINTERSPEECH 2017
Country/TerritorySweden
CityStockholm
Period20/08/1724/08/17
Internet address

Fingerprint

Dive into the research topics of 'Real-Time Reactive Speech Synthesis: Incorporating Interruptions'. Together they form a unique fingerprint.

Cite this