Abstract
Direct quotations are used for opinion mining and information extraction as they have an easy to extract span and they can be attributed to a speaker with high accuracy. However, simply focusing on direct quotations ignores around half of all reported speech, which is in the form of indirect or mixed speech. This work presents the first large-scale experiments in indirect and mixed quotation extraction and attribution. We propose two methods of extracting all quote types from news articles and evaluate them on two large annotated corpora, one of which is a contribution of this work. We further show that direct quotation attribution methods can be successfully applied to indirect and mixed quotation attribution.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing |
| Publisher | Association for Computational Linguistics |
| Pages | 989-999 |
| Number of pages | 11 |
| ISBN (Electronic) | 9781937284978 |
| Publication status | Published - Oct 2013 |
| Event | 2013 Conference on Empirical Methods in Natural Language Processing - Seattle, United States Duration: 18 Oct 2013 → 21 Oct 2013 |
Conference
| Conference | 2013 Conference on Empirical Methods in Natural Language Processing |
|---|---|
| Abbreviated title | EMNLP 2013 |
| Country/Territory | United States |
| City | Seattle |
| Period | 18/10/13 → 21/10/13 |
ASJC Scopus subject areas
- Computational Theory and Mathematics
- Information Systems
- Computer Vision and Pattern Recognition