Administrative Social Science Data: The challenge of reproducible research

Christopher Playford, Vernon Gayle, Roxanne Connelly, Alasdair J G Gray

Research output: Contribution to journalArticlepeer-review

23 Citations (Scopus)
54 Downloads (Pure)


Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data.

We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible.
Original languageEnglish
JournalBig Data & Society
Issue number2
Publication statusPublished - 1 Dec 2016


  • Big Data
  • Administrative Data
  • Reproducibility
  • Replication
  • Workflow
  • Git


Dive into the research topics of 'Administrative Social Science Data: The challenge of reproducible research'. Together they form a unique fingerprint.

Cite this