Generating corrupted data sources for the evaluation of matching systems

Fiona McNeill, Diana Bental, Alasdair J. G. Gray, Sabina Jedrzejczyk, Ahmad Alsadeeqi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

19 Downloads (Pure)


One of the most difficult aspects of developing matching systems – whether for matching ontologies or for other types of mismatched data – is evaluation. The accuracy of ontology matchers are usually evaluated by measuring the results produced by the systems against reference ontologies, but reference ontologies are expensive and difficult to create.In this paper we discuss the use of crptr, a system that corrupts data of different sorts in order to mimic the kind of differences one might expect to find between different data sources on related topics. This automatically creates a map between the original and the corrupted data source, and matching systems can be evaluated by comparing their output to this map. We describe the extension of crptr to ontology-based data and query mismatch, and then discuss how it could be extended to other kinds of matching including ontology matching.
Original languageEnglish
Title of host publicationThe Fourteenth International Workshop on Ontology Matching
Publication statusAccepted/In press - 25 Jul 2019
EventFourteenth International Workshop on Ontology Matching - Auckland , New Zealand
Duration: 26 Oct 201926 Oct 2019


WorkshopFourteenth International Workshop on Ontology Matching
Abbreviated titleOM-2019
CountryNew Zealand

Fingerprint Dive into the research topics of 'Generating corrupted data sources for the evaluation of matching systems'. Together they form a unique fingerprint.

Cite this