Abstract
Hansard transcripts provide access to Members of Parliament’s opinions on many important issues, but are difficult for people to process. Existing corpora for sentiment analysis in Hansard debates rely on speakers’ votes as sentiment labels, but these votes are known to be constrained by speakers’ party affiliations. We develop an annotation scheme and create a novel corpus designed for use in the evaluation of sentiment analysis systems using automatically and manually applied speech labels. Observing the effects on speech sentiment of differing sentiment polarities in debate motions (proposals), we also apply sentiment labels to these motions. We find that humans are able to reach high agreement in identifying sentiment polarity in these debates, and that manually applied and automatically retrieved class labels differ somewhat, suggesting that speech content does not always reflect the voting behaviour of Members of Parliament.
Original language | English |
---|---|
Title of host publication | ParlaCLARIN |
Subtitle of host publication | Creating and Using Parliamentary Corpora |
Publisher | Clarin |
Pages | 43-47 |
Number of pages | 5 |
ISBN (Electronic) | 4003994155486 |
ISBN (Print) | 9780306406157 |
Publication status | Published - 7 May 2018 |
Event | International Language Resource and Evaluation Conference 2018: ParlaCLARIN Workshop - Miyazaki, Japan Duration: 7 May 2018 → 7 May 2018 |
Conference
Conference | International Language Resource and Evaluation Conference 2018 |
---|---|
Abbreviated title | LREC 2018 |
Country/Territory | Japan |
City | Miyazaki |
Period | 7/05/18 → 7/05/18 |
Keywords
- Hansard
- UK Parliament
- Sentiment Analysis