Concept-to-text generation via discriminative reranking

Ioannis Konstas, Mirella Lapata

Research output: Chapter in Book/Report/Conference proceedingConference contribution

51 Citations (Scopus)

Abstract

This paper proposes a data-driven method for concept-to-text generation, the task of automatically producing textual output from non-linguistic input. A key insight in our approach is to reduce the tasks of content selection ("what to say") and surface realization ("how to say") into a common parsing problem. We define a probabilistic context-free grammar that describes the structure of the input (a corpus of database records and text describing some of them) and represent it compactly as a weighted hypergraph. The hypergraph structure encodes exponentially many derivations, which we rerank discriminatively using local and global features. We propose a novel decoding algorithm for finding the best scoring derivation and generating in this setting. Experimental evaluation on the ATIS domain shows that our model outperforms a competitive discriminative system both using BLEU and in a judgment elicitation study.

Original languageEnglish
Title of host publicationProceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
PublisherAssociation for Computational Linguistics
Pages369-378
Number of pages10
ISBN (Print)9781937284244
Publication statusPublished - Jul 2012
Event50th Annual Meeting of the Association for Computational Linguistics 2012 - Jeju Island, Korea, Republic of
Duration: 8 Jul 201214 Jul 2012

Conference

Conference50th Annual Meeting of the Association for Computational Linguistics 2012
Abbreviated titleACL 2012
Country/TerritoryKorea, Republic of
CityJeju Island
Period8/07/1214/07/12

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Software

Fingerprint

Dive into the research topics of 'Concept-to-text generation via discriminative reranking'. Together they form a unique fingerprint.

Cite this