Abstract
We present a novel path planner for adaptive behaviour of an Autonomous Underwater Vehicle (AUV). A behaviour-based architecture forms the foundation of the system with an extra layer which uses experience to learn a policy for modulating the behaviours' weights. In effect, this creates an abstract environment for the Reinforement Learning (RL) agent's state and action space. Subsequently, it simplifies the problem the RL agent is addressing, creating a more stable system. The Episodic Natural Actor Critic (ENAC) RL algorithm is used due to the continuous input and output domains and for the natural actor critic's convergence properties. Adaptiveness of the system is presented in a thruster failure scenario. RL is used in this failure scenario to learn an appropriate policy for the behaviours' weights under the new vehicle dynamics. We apply this control architecture to the domain of marine archaeology which has an inherent problem of navigation in unknown, potentially complex and dangerous environments. Simulated results of the proposed control architecture demonstrate its feasibility and performance.
Original language | English |
---|---|
Title of host publication | OCEANS 2015 - Genova |
Subtitle of host publication | Discovering Sustainable Ocean Energy for a New World |
Publisher | IEEE |
ISBN (Print) | 9781479987368 |
DOIs | |
Publication status | Published - 2015 |
Event | MTS/IEEE OCEANS 2015 - Genova, Italy Duration: 18 May 2015 → 21 May 2015 |
Conference
Conference | MTS/IEEE OCEANS 2015 |
---|---|
Abbreviated title | OCEANS 2015 |
Country/Territory | Italy |
City | Genova |
Period | 18/05/15 → 21/05/15 |
ASJC Scopus subject areas
- Renewable Energy, Sustainability and the Environment
- Oceanography