Abstract
The soundness and optimality of a plan depends on the correctness of the domain model. Specifying complete domain models can be difficult when interactions between an agent and its environment are complex. We propose a model-based reinforcement learning (MBRL) approach to solve planning problems with unknown models. The model is learned incrementally over episodes using only experiences from the current episode which suits non-stationary environments. We introduce the novel concept of reliability as an intrinsic motivation for MBRL, and a method to learn from failure to prevent repeated instances of similar failures. Our motivation is to improve the learning efficiency and goal-directedness of MBRL. We evaluate our work with experimental results for three planning domains.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence |
| Editors | Sarit Kraus |
| Pages | 3195-3201 |
| Number of pages | 7 |
| ISBN (Electronic) | 9780999241141 |
| DOIs | |
| Publication status | Published - Aug 2019 |
| Event | 28th International Joint Conference on Artificial Intelligence 2019 - Macao, China Duration: 10 Aug 2019 → 16 Aug 2019 https://ijcai19.org/ |
Conference
| Conference | 28th International Joint Conference on Artificial Intelligence 2019 |
|---|---|
| Abbreviated title | IJCAI 2019 |
| Country/Territory | China |
| City | Macao |
| Period | 10/08/19 → 16/08/19 |
| Internet address |
ASJC Scopus subject areas
- Artificial Intelligence