Abstract
We present a multi-modal dialogue system for interactive learning of perceptually grounded word meanings from a human tutor. The system integrates an incremental, semantic parsing/generation framework - Dynamic Syntax and Type Theory with Records (DS-TTR) - with a set of visual classifiers that are learned throughout the interaction and which ground the meaning representations that it produces. We use this system in interaction with a simulated human tutor to study the effects of different dialogue policies and capabilities on accuracy of learned meanings, learning rates, and efforts/costs to the tutor. We show that the overall performance of the learning agent is affected by (1) who takes initiative in the dialogues; (2) the ability to express/use their confidence level about visual attributes; and (3) the ability to process elliptical and incrementally constructed dialogue turns. Ultimately, we train an adaptive dialogue policy which optimises the trade-off between classifier accuracy and tutoring costs.
Original language | English |
---|---|
Title of host publication | Proceedings of the SIGDIAL 2016 Conference |
Publisher | Association for Computational Linguistics |
Pages | 339-349 |
Number of pages | 11 |
ISBN (Print) | 9781945626234 |
DOIs | |
Publication status | Published - 15 Sept 2016 |
Event | The 17th Annual SIGdial Meeting on Discourse and Dialogue - Institute for Creative Technologies, Los Angeles, United States Duration: 13 Sept 2016 → 15 Sept 2016 http://www.sigdial.org/workshops/conference17/ |
Conference
Conference | The 17th Annual SIGdial Meeting on Discourse and Dialogue |
---|---|
Abbreviated title | SIGDIAL |
Country/Territory | United States |
City | Los Angeles |
Period | 13/09/16 → 15/09/16 |
Internet address |
Keywords
- Natural language processing
- Robotics, Development, Language action, Social interaction, Learning
- Artificial intelligence
Fingerprint
Dive into the research topics of 'Training an adaptive dialogue policy for interactive learning of visually grounded word meanings'. Together they form a unique fingerprint.Datasets
Profiles
-
Arash Eshghi
- School of Mathematical & Computer Sciences - Assistant Professor
- School of Mathematical & Computer Sciences, Computer Science - Assistant Professor
Person: Academic (Teaching)