The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering

  • Sabrina Chiesurin
  • , Dimitris Dimakopoulos
  • , Marco Antonio Sobrevilla Cabezudo
  • , Arash Eshghi
  • , Ioannis Papaioannou
  • , Verena Rieser
  • , Ioannis Konstas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Downloads (Pure)

Abstract

Large language models are known to produce output which sounds fluent and convincing, but is also often wrong, e.g. “unfaithful" with respect to a rationale as retrieved from a knowledge base. In this paper, we show that task-based systems which exhibit certain advanced linguistic dialog behaviors, such as lexical alignment (repeating what the user said), are in fact preferred and trusted more, whereas other phenomena, such as pronouns and ellipsis are dis-preferred. We use open-domain question answering systems as our test-bed for task based dialog generation and compare several open- and closed-book models. Our results highlight the danger of systems that appear to be trustworthy by parroting user input while providing an unfaithful response.

Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: ACL 2023
PublisherAssociation for Computational Linguistics
Pages947-959
Number of pages13
ISBN (Electronic)9781959429623
DOIs
Publication statusPublished - 9 Jul 2023
Event61st Annual Meeting of the Association for Computational Linguistics 2023 - Toronto, Canada
Duration: 9 Jul 202314 Jul 2023

Conference

Conference61st Annual Meeting of the Association for Computational Linguistics 2023
Abbreviated titleACL 2023
Country/TerritoryCanada
CityToronto
Period9/07/2314/07/23

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering'. Together they form a unique fingerprint.

Cite this