Abstract
Large Language Models (LLMs), the bedrock of many “artificial intelligence” (AI) applications, are known to reproduce social biases present in their training data. Yet resources to measure and control this issue are limited. Research identifying and mitigating stereotype biases have primarily been concentrated around English, lagging the rapid advancement of LLMs in multilingual settings. To help further advance the ability to address stereotype bias in AI systems, we introduce a new multilingual dataset: SHADES. Designed for examining culturally-specific stereotypes that may be learned by LLMs, SHADES includes over 300 stereotypes from 37 regions, translated across 16 languages and annotated with multiple features to aid multilingual stereotype analysis. All statements in all languages are paired with templates, to serve as a resource for unlimited generation of new evaluation data. We demonstrate the utility of the dataset in a series of exploratory evaluations that reveal significant differences in how stereotypes are recognized and reflected across models and languages.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics |
| Subtitle of host publication | Human Language Technologies |
| Editors | Luis Chiruzzo, Alan Ritter, Lu Wang |
| Publisher | Association for Computational Linguistics |
| Pages | 11995-12041 |
| Number of pages | 47 |
| Volume | 1 |
| ISBN (Electronic) | 9798891761896 |
| DOIs | |
| Publication status | Published - Apr 2025 |
| Event | 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, - Hybrid, Albuquerque, United States Duration: 29 Apr 2025 → 4 May 2025 |
Conference
| Conference | 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics |
|---|---|
| Abbreviated title | NAACL-HLT 2025 |
| Country/Territory | United States |
| City | Hybrid, Albuquerque |
| Period | 29/04/25 → 4/05/25 |
ASJC Scopus subject areas
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems
- Software
Fingerprint
Dive into the research topics of 'SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver