Abstract
Increasing hateful conduct online demands effective counterspeech strategies to mitigate its impact. We introduce a novel dataset annotated with such strategies, aimed at facilitating the generation of targeted responses to hateful language. We labelled 1000 hate speech/counterspeech pairs from an existing dataset with strategies established in the social sciences. We find that a one-shot prompted classification model achieves promising accuracy in classifying the strategies according to the manual labels, demonstrating the potential of generative Large Language Models (LLMs) to distinguish between counterspeech strategies.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 8th Workshop on Online Abuse and Harms (WOAH 2024) |
| Editors | Yi-Ling Chung, Zeerak Talat, Debora Nozza, Flor Miriam Plaza-del-Arco, Paul Rottger, Aida Mostafazadeh Davani, Agostina Calabrese |
| Publisher | Association for Computational Linguistics |
| Pages | 256-265 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798891761056 |
| DOIs | |
| Publication status | Published - 20 Jun 2024 |
| Event | 8th Workshop on Online Abuse and Harms 2024 - Mexico City, Mexico Duration: 20 Jun 2024 → 20 Jun 2024 |
Conference
| Conference | 8th Workshop on Online Abuse and Harms 2024 |
|---|---|
| Abbreviated title | WOAH 2024 |
| Country/Territory | Mexico |
| City | Mexico City |
| Period | 20/06/24 → 20/06/24 |
ASJC Scopus subject areas
- Language and Linguistics
- Computational Theory and Mathematics
- Software
- Linguistics and Language