All Together Now: The Living Audio Dataset

David A. Braude, Matthew P. Aylett, Caoimhin Laoide-Kemp, Simone Ashby, Kristen M. Scott, Brian O. Raghallaigh, Anna Braudo, Alex Brouwer, Adriana Stan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

The ongoing focus in speech technology research on machine learning based approaches leaves the community hungry for data. However, datasets tend to be recorded once and then released, sometimes behind registration requirements or paywalls. In this paper we describe our Living Audio Dataset. The aim is to provide audio data that is in the public domain, multilingual, and expandable by communities. We discuss the role of linguistic resources, given the success of systems such as Tacotron which use direct text-to-speech mappings, and consider how data provenance could be built into such resources. So far the data has been collected for TTS purposes, however, it is also suitable for ASR. At the time of publication audio resources already exist for Dutch, R.P. English, Irish, and Russian.
Original languageEnglish
Title of host publicationProceedings of Interspeech 2019
PublisherISCA
Pages1521-1525
Number of pages5
DOIs
Publication statusPublished - 2019

Fingerprint

Dive into the research topics of 'All Together Now: The Living Audio Dataset'. Together they form a unique fingerprint.

Cite this