Analysing and Improving Embedded Markup of Learning Resources on the Web

Stefan Dietze, Davide Taibi, Ran Yu, Phil Barker, Mathieu d'Anquin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)
148 Downloads (Pure)

Abstract

Web-scale reuse and interoperability of learning resources have been major concerns for the technology-enhanced learning community. While work in this area traditionally focused on learning resource metadata, provided through learning resource repositories, the recent emergence of structured entity markup on the Web through standards such as RDFa and Microdata and initiatives such as schema.org, has provided new forms of entity-centric knowledge, which is so far under-investigated and hardly exploited. The Learning Resource Metadata Initiative (LRMI) provides a vocabulary for annotating learning resources through schema.org terms. Although recent studies have shown markup adoption by approximately 30% of all Web pages, understanding of the scope, distribution and quality of learning resources markup is limited. We provide the first public corpus of LRMI extracted from a representative Web crawl together with an analysis of LRMI adoption on the Web, with the goal to inform data consumers as well as future vocabulary refinements through a thorough understanding of the use as well as misuse of LRMI vocabulary terms. While errors and schema misuse are frequent, we also discuss a set of simple heuristics which significantly improve the accuracy of markup, a prerequisite for reusing learning resource metadata sourced from markup.
Original languageEnglish
Title of host publicationProceedings of the 26th International Conference on World Wide Web Companion
PublisherAssociation for Computing Machinery
Pages283-292
Number of pages10
ISBN (Electronic)9781450349147
DOIs
Publication statusPublished - 3 Apr 2017
Event26th International World Wide Web Conference 2017 - Perth, Australia
Duration: 3 Apr 20177 Apr 2017

Conference

Conference26th International World Wide Web Conference 2017
Country/TerritoryAustralia
CityPerth
Period3/04/177/04/17

Keywords

  • Learning resources
  • lrmi
  • schema.org
  • web markup

Fingerprint

Dive into the research topics of 'Analysing and Improving Embedded Markup of Learning Resources on the Web'. Together they form a unique fingerprint.

Cite this