Skip to main navigation Skip to search Skip to main content

O-Dang! The Ontology of Dangerous Speech Messages

  • Marco Antonio Stranisci*
  • , Simona Frenda*
  • , Mirko Lai*
  • , Oscar Araque*
  • , Alessandra Teresa Cignarella*
  • , Valerio Basile*
  • , Viviana Patti*
  • , Cristina Bosco*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Inside the NLP community there is a considerable amount of language resources created, annotated and released every day with the aim of studying specific linguistic phenomena. Despite a variety of attempts in order to organize such resources has been carried on, a lack of systematic methods and of possible interoperability between resources are still present. Furthermore, when storing linguistic information, still nowadays, the most common practice is the concept of “gold standard”, which is in contrast with recent trends in NLP that aim at stressing the importance of different subjectivities and points of view when training machine learning and deep learning methods. In this paper we present O-Dang!: The Ontology of Dangerous Speech Messages, a systematic and interoperable Knowledge Graph (KG) for the collection of linguistic annotated data. O-Dang! is designed to gather and organize Italian datasets into a structured KG, according to the principles shared within the Linguistic Linked Open Data community. The ontology has also been designed to account a perspectivist approach, since it provides a model for encoding both gold standard and single-annotator labels in the KG. The paper is structured as follows. In Section 1. the motivations of our work are outlined. Section 2. describes the O-Dang! Ontology, that provides a common semantic model for the integration of datasets in the KG. The Ontology Population stage with information about corpora, users, and annotations is presented in Section 3.. Finally, in Section 4. an analysis of offensiveness across corpora is provided as a first case study for the resource.
Original languageEnglish
Title of host publicationProceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data
EditorsIlan Kernerman, Sara Carvalho, Carlos A. Iglesias, Rachele Sprugnoli
PublisherEuropean Language Resources Association
Pages2-8
Number of pages7
ISBN (Electronic)9791095546764
Publication statusPublished - 20 Jun 2022
Event2nd Workshop on Sentiment Analysis and Linguistic Linked Data 2022 - Marseille, France
Duration: 24 Jun 202224 Jun 2022

Conference

Conference2nd Workshop on Sentiment Analysis and Linguistic Linked Data 2022
Abbreviated titleLREC 2022 SALLD-2
Country/TerritoryFrance
CityMarseille
Period24/06/2224/06/22

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 5 - Gender Equality
    SDG 5 Gender Equality

Keywords

  • Annotations
  • Hate Speech
  • Irony
  • Knowledge Graph
  • LLOD
  • Misogyny
  • NLP
  • Perspectivism
  • Sarcasm
  • Subjectivity

ASJC Scopus subject areas

  • Language and Linguistics
  • Education
  • Library and Information Sciences
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'O-Dang! The Ontology of Dangerous Speech Messages'. Together they form a unique fingerprint.

Cite this