EPIC: Multi-Perspective Annotation of a Corpus of Irony

  • Simona Frenda*
  • , Alessandro Pedrani
  • , Valerio Basile*
  • , Soda Marem Lo*
  • , Alessandra Teresa Cignarella*
  • , Raffaella Panizzon
  • , Cristina Marco
  • , Bianca Scarlini
  • , Viviana Patti*
  • , Cristina Bosco*
  • , Davide Bernardi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

28 Citations (Scopus)

Abstract

We present EPIC (English Perspectivist Irony Corpus), the first annotated corpus for irony analysis based on the principles of data perspectivism. The corpus contains short conversations from social media in five regional varieties of English, and it is annotated by contributors from five countries corresponding to those varieties. We analyse the resource along the perspectives induced by the diversity of the annotators, in terms of origin, age, and gender, and the relationship between these dimensions, irony, and the topics of conversation. We validate EPIC by creating perspective-aware models that encode the perspectives of annotators grouped according to their demographic characteristics. Firstly, the performance of perspectivist models confirms that different annotators induce very different models. Secondly, in the classification of ironic and non-ironic texts, perspectivist models prove to be generally more confident than the non-perspectivist ones. Furthermore, comparing the performance on a perspective-based test set with those achieved on a gold standard test set, we can observe how perspectivist models tend to detect more precisely the positive class, showing their ability to capture the different perceptions of irony. Thanks to these models, we are moreover able to show interesting insights about the variation in the perception of irony by the different groups of annotators, such as among different generations and nationalities.

Original languageEnglish
Title of host publicationProceedings of the 61st Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationLong Papers
PublisherAssociation for Computational Linguistics
Pages13844-13857
Number of pages14
Volume1
ISBN (Electronic)9781959429722
DOIs
Publication statusPublished - 9 Jul 2023
Event61st Annual Meeting of the Association for Computational Linguistics 2023 - Toronto, Canada
Duration: 9 Jul 202314 Jul 2023

Conference

Conference61st Annual Meeting of the Association for Computational Linguistics 2023
Abbreviated titleACL 2023
Country/TerritoryCanada
CityToronto
Period9/07/2314/07/23

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'EPIC: Multi-Perspective Annotation of a Corpus of Irony'. Together they form a unique fingerprint.

Cite this