Data Quality Issues in Current Nanopublications

Research output: Chapter in Book/Report/Conference proceedingConference contribution

61 Downloads (Pure)

Abstract

Nanopublications are a granular way of publishing scientific claims together with their associated provenance and publication information. More than 10 million nanopublications have been published by a handful of researchers covering a wide range of topics within the life sciences. We were motivated to replicate an existing analysis of these nanopublications, but then went deeper into the structure of the existing nanopublications. In this paper, we analyse the usage of nanopublications by investigating the distribution of triples in each part and discuss the data quality issues that were subsequently revealed. We argue that there is a need for the community to develop a set of guidelines for the modelling of nanopublications.
Original languageEnglish
Title of host publication2019 IEEE 15th International Conference on e-Science (e-Science)
PublisherIEEE
Publication statusAccepted/In press - 29 Jul 2019

Keywords

  • Semantic Publication
  • Nanopublication
  • Reproducibility
  • Provenance
  • Linked Data
  • Data Quality

Fingerprint Dive into the research topics of 'Data Quality Issues in Current Nanopublications'. Together they form a unique fingerprint.

Cite this