Abstract
The presence of social stereotypes in NLP resources is an emerging topic that challenges traditionally used approaches for the creation of corpora and resources. An increasing number of scholars proposed strategies for considering annotators’ subjectivity in order to reduce such bias both in computational resources and in NLP models. In this paper, we present Open-Stereotype, an annotated corpus of Italian tweets and news headlines regarding immigration in Italy developed through an experimental procedure for the annotation of stereotypes aimed to investigate their different interpretation. The annotation is the result of a six-step process, where annotators identify text-spans expressing stereotypes, generate rationales about these spans and group them in a more comprehensive set of labels. Results show that humans exhibit high subjectivity in conceptualizing this phenomenon, and that the prior knowledge of an Italian LLM leads to more consistent classifications of specific labels that do not depend on annotators’ background.
| Original language | English |
|---|---|
| Pages (from-to) | 603-612 |
| Number of pages | 10 |
| Journal | CEUR Workshop Proceedings |
| Volume | 4112 |
| Publication status | Published - 30 Nov 2025 |
| Event | 11th Italian Conference on Computational Linguistics 2025 - Cagliari, Italy Duration: 24 Sept 2025 → 26 Sept 2025 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 10 Reduced Inequalities
Keywords
- Annotation
- Italian
- Social Bias
- Stereotypes
- Subjectivity
ASJC Scopus subject areas
- General Computer Science
Fingerprint
Dive into the research topics of 'Subjectivity in Stereotypes Against Migrants in Italian: An Experimental Annotation Procedure'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver