A Sentiment-labelled Corpus of Hansard Parliamentary Debate Speeches

Gavin Abercrombie, Riza Theresa Batista-Navarro

Research output: Chapter in Book/Report/Conference proceedingConference contribution

59 Downloads (Pure)

Abstract

Hansard transcripts provide access to Members of Parliament’s opinions on many important issues, but are difficult for people to process. Existing corpora for sentiment analysis in Hansard debates rely on speakers’ votes as sentiment labels, but these votes are known to be constrained by speakers’ party affiliations. We develop an annotation scheme and create a novel corpus designed for use in the evaluation of sentiment analysis systems using automatically and manually applied speech labels. Observing the effects on speech sentiment of differing sentiment polarities in debate motions (proposals), we also apply sentiment labels to these motions. We find that humans are able to reach high agreement in identifying sentiment polarity in these debates, and that manually applied and automatically retrieved class labels differ somewhat, suggesting that speech content does not always reflect the voting behaviour of Members of Parliament.
Original languageEnglish
Title of host publicationParlaCLARIN
Subtitle of host publicationCreating and Using Parliamentary Corpora
PublisherClarin
Pages43-47
Number of pages5
ISBN (Electronic)4003994155486
ISBN (Print)9780306406157
Publication statusPublished - 7 May 2018
EventInternational Language Resource and Evaluation Conference 2018: ParlaCLARIN Workshop - Miyazaki, Japan
Duration: 7 May 20187 May 2018

Conference

ConferenceInternational Language Resource and Evaluation Conference 2018
Abbreviated titleLREC 2018
Country/TerritoryJapan
CityMiyazaki
Period7/05/187/05/18

Keywords

  • Hansard
  • UK Parliament
  • Sentiment Analysis

Fingerprint

Dive into the research topics of 'A Sentiment-labelled Corpus of Hansard Parliamentary Debate Speeches'. Together they form a unique fingerprint.

Cite this