Speech as Interactive Design Material (SIDM): How to design and evaluate task-tailored synthetic voices?

Mateusz Dubiel, Matthew Peter Aylett, Anuschka Schmitt, Zilin Ma, Gary Hsieh, Thiemo Wambsganss

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The aim of this workshop is two-fold. First, it aims to establish a research community focused on design and evaluation of synthetic speech (TTS) interfaces that are tailored not only to goal oriented tasks (e.g., food ordering, online shopping) but also personal growth and resilience promoting applications (e.g., coaching, mindful reflection, and tutoring). Second, through discussion and collaborative efforts, to establish a set of practices and standards that will help to improve ecological validity of TTS evaluation. In particular, the workshop will explore the topics such as: interaction design of voice-based conversational interfaces; the interplay between prosodic aspects (e.g., pitch variance, loudness, jitter) of TTS and its impact on voice perception. This workshop will serve as a platform on which to build a community that is better equipped to tackle the dynamic field of interactive TTS interfaces, which remains understudied, yet increasingly pertinent to everyday lives of users.
Original languageEnglish
Title of host publicationIUI '24 Companion: Companion Proceedings of the 29th International Conference on Intelligent User Interfaces
PublisherAssociation for Computing Machinery
Pages131-133
Number of pages3
ISBN (Print)9798400705090
DOIs
Publication statusPublished - 5 Apr 2024
Event29th International Conference on Intelligent User Interfaces 2024 - Greenville, United States
Duration: 18 Mar 202421 Mar 2024

Conference

Conference29th International Conference on Intelligent User Interfaces 2024
Abbreviated titleIUI '24
Country/TerritoryUnited States
CityGreenville
Period18/03/2421/03/24

Fingerprint

Dive into the research topics of 'Speech as Interactive Design Material (SIDM): How to design and evaluate task-tailored synthetic voices?'. Together they form a unique fingerprint.

Cite this