Abstract
Reinforcement learning problems are often discretised, use linear function approximation, or perform batch updates. However, many applications that can benefit from reinforcement learning contain continuous variables and are inherently non-linear, for example, the control of aerospace or maritime robotic vehicles. Recent work has brought focus onto online temporal difference methods, specifically for using non-linear function approximation. In this paper, we evaluate the Forward Actor-Critic against the regular Actor-Critic, and Continuous Actor-Critic Learning Automation. We also propose and evaluate a new algorithm called True Online Continuous Learning Automation (TOCLA) which combines these two approaches. The chosen benchmark problem was the Mountain Car Continuous-vO environment from OpenAI Gym, which represents a further step in complexity over the benchmark used to test the Forward Actor Critic in previous works. Our results demonstrate the superiority of TOCLA in terms of its sensitivity to hyper-parameter selection compared with the Forward Actor Critic, Continuous Actor Critic Learning Automation, and Actor Critic algorithms.
Original language | English |
---|---|
Title of host publication | 2020 IEEE Symposium Series on Computational Intelligence (SSCI) |
Publisher | IEEE |
Pages | 266-275 |
Number of pages | 10 |
ISBN (Electronic) | 9781728125473 |
DOIs | |
Publication status | Published - 5 Jan 2021 |
Event | 2020 IEEE Symposium Series on Computational Intelligence - Camberra, Australia Duration: 1 Dec 2020 → 4 Dec 2020 Conference number: 46 http://www.ieeessci2020.org/ |
Conference
Conference | 2020 IEEE Symposium Series on Computational Intelligence |
---|---|
Abbreviated title | SSCI 2020 |
Country/Territory | Australia |
City | Camberra |
Period | 1/12/20 → 4/12/20 |
Internet address |
Keywords
- Actor-Critic
- CACLA
- Forward Actor-Critic
- Nonlinear Function Approximation
- Reinforcement Learning
- TOCLA
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Decision Sciences (miscellaneous)