Evaluating speech synthesis in a mobile context: Audio presentation of Facebook, Twitter and RSS

Mathew Aylett, Yolanda Vazquez-Alvarez, Lynne Baillie

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents an evaluation of a podcast service that aggregates data from Facebook, Twitter and RSS feeds, using speech synthesis. The service uses a novel approach to speech synthesis generation, where XML markup is used to control both the speech synthesis and the sound design of a resulting podcast. A two-phase evaluation was carried out: 1) participants listening to the podcasts on desktop computers, 2) participants listening to the podcasts while walking. Our findings show that participants preferred shorter podcasts with sound effects and background music, and were affected by the surrounding environmental noise. However, audio advertising which is part of the service did not have a significant negative effect. Another finding was that the advantage of using multiple voices for content segmentation may have been undermined by difficulties in listener adaptation. The work is part of a new approach to speech synthesis provision, where its style of rendition forms a part of the application design and it is evaluated within an application context.

Original languageEnglish
Title of host publicationProceedings of the ITI 2013 35th International Conference on Information Technology Interfaces
PublisherIEEE
Pages167-172
Number of pages6
ISBN (Electronic)9789537138325
ISBN (Print)9789537138301
DOIs
Publication statusPublished - 31 Oct 2013
Event35th International Conference on Information Technology Interfaces 2013 - Cavtat / Dubrovnik, Croatia
Duration: 24 Jun 201327 Jun 2013

Conference

Conference35th International Conference on Information Technology Interfaces 2013
Abbreviated titleITI 2013
CountryCroatia
CityCavtat / Dubrovnik
Period24/06/1327/06/13

Fingerprint

RSS
Speech synthesis
Acoustic waves
XML
Personal computers
Marketing

Keywords

  • auditory interfaces
  • evaluation
  • mobile systems
  • Speech synthesis

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Aylett, M., Vazquez-Alvarez, Y., & Baillie, L. (2013). Evaluating speech synthesis in a mobile context: Audio presentation of Facebook, Twitter and RSS. In Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces (pp. 167-172). IEEE. https://doi.org/10.2498/iti.2013.0537
Aylett, Mathew ; Vazquez-Alvarez, Yolanda ; Baillie, Lynne. / Evaluating speech synthesis in a mobile context : Audio presentation of Facebook, Twitter and RSS. Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces. IEEE, 2013. pp. 167-172
@inproceedings{2b85f151b0144ba8bae500a34026f94c,
title = "Evaluating speech synthesis in a mobile context: Audio presentation of Facebook, Twitter and RSS",
abstract = "This paper presents an evaluation of a podcast service that aggregates data from Facebook, Twitter and RSS feeds, using speech synthesis. The service uses a novel approach to speech synthesis generation, where XML markup is used to control both the speech synthesis and the sound design of a resulting podcast. A two-phase evaluation was carried out: 1) participants listening to the podcasts on desktop computers, 2) participants listening to the podcasts while walking. Our findings show that participants preferred shorter podcasts with sound effects and background music, and were affected by the surrounding environmental noise. However, audio advertising which is part of the service did not have a significant negative effect. Another finding was that the advantage of using multiple voices for content segmentation may have been undermined by difficulties in listener adaptation. The work is part of a new approach to speech synthesis provision, where its style of rendition forms a part of the application design and it is evaluated within an application context.",
keywords = "auditory interfaces, evaluation, mobile systems, Speech synthesis",
author = "Mathew Aylett and Yolanda Vazquez-Alvarez and Lynne Baillie",
year = "2013",
month = "10",
day = "31",
doi = "10.2498/iti.2013.0537",
language = "English",
isbn = "9789537138301",
pages = "167--172",
booktitle = "Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces",
publisher = "IEEE",
address = "United States",

}

Aylett, M, Vazquez-Alvarez, Y & Baillie, L 2013, Evaluating speech synthesis in a mobile context: Audio presentation of Facebook, Twitter and RSS. in Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces. IEEE, pp. 167-172, 35th International Conference on Information Technology Interfaces 2013, Cavtat / Dubrovnik, Croatia, 24/06/13. https://doi.org/10.2498/iti.2013.0537

Evaluating speech synthesis in a mobile context : Audio presentation of Facebook, Twitter and RSS. / Aylett, Mathew; Vazquez-Alvarez, Yolanda; Baillie, Lynne.

Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces. IEEE, 2013. p. 167-172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Evaluating speech synthesis in a mobile context

T2 - Audio presentation of Facebook, Twitter and RSS

AU - Aylett, Mathew

AU - Vazquez-Alvarez, Yolanda

AU - Baillie, Lynne

PY - 2013/10/31

Y1 - 2013/10/31

N2 - This paper presents an evaluation of a podcast service that aggregates data from Facebook, Twitter and RSS feeds, using speech synthesis. The service uses a novel approach to speech synthesis generation, where XML markup is used to control both the speech synthesis and the sound design of a resulting podcast. A two-phase evaluation was carried out: 1) participants listening to the podcasts on desktop computers, 2) participants listening to the podcasts while walking. Our findings show that participants preferred shorter podcasts with sound effects and background music, and were affected by the surrounding environmental noise. However, audio advertising which is part of the service did not have a significant negative effect. Another finding was that the advantage of using multiple voices for content segmentation may have been undermined by difficulties in listener adaptation. The work is part of a new approach to speech synthesis provision, where its style of rendition forms a part of the application design and it is evaluated within an application context.

AB - This paper presents an evaluation of a podcast service that aggregates data from Facebook, Twitter and RSS feeds, using speech synthesis. The service uses a novel approach to speech synthesis generation, where XML markup is used to control both the speech synthesis and the sound design of a resulting podcast. A two-phase evaluation was carried out: 1) participants listening to the podcasts on desktop computers, 2) participants listening to the podcasts while walking. Our findings show that participants preferred shorter podcasts with sound effects and background music, and were affected by the surrounding environmental noise. However, audio advertising which is part of the service did not have a significant negative effect. Another finding was that the advantage of using multiple voices for content segmentation may have been undermined by difficulties in listener adaptation. The work is part of a new approach to speech synthesis provision, where its style of rendition forms a part of the application design and it is evaluated within an application context.

KW - auditory interfaces

KW - evaluation

KW - mobile systems

KW - Speech synthesis

UR - http://www.scopus.com/inward/record.url?scp=84891274063&partnerID=8YFLogxK

U2 - 10.2498/iti.2013.0537

DO - 10.2498/iti.2013.0537

M3 - Conference contribution

AN - SCOPUS:84891274063

SN - 9789537138301

SP - 167

EP - 172

BT - Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces

PB - IEEE

ER -

Aylett M, Vazquez-Alvarez Y, Baillie L. Evaluating speech synthesis in a mobile context: Audio presentation of Facebook, Twitter and RSS. In Proceedings of the ITI 2013 35th International Conference on Information Technology Interfaces. IEEE. 2013. p. 167-172 https://doi.org/10.2498/iti.2013.0537