A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning

Arrasy Rahman, Ignacio Carlucho, Niklas Hopner, Stefano V. Albrecht

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)
10 Downloads (Pure)

Abstract

Open ad hoc teamwork is the problem of training a single agent to efficiently collaborate with an unknown group of teammates whose composition may change over time. A variable team composition creates challenges for the agent, such as the requirement to adapt to new team dynamics and dealing with changing state vector sizes. These challenges are aggravated in real-world applications in which the controlled agent only has a partial view of the environment. In this work, we develop a class of solutions for open ad hoc teamwork under full and partial observability. We start by developing a solution for the fully observable case that leverages graph neural network architectures to obtain an optimal policy based on reinforcement learning. We then extend this solution to partially observable scenarios by proposing different methodologies that maintain belief estimates over the latent environment states and team composition. These belief estimates are combined with our solution for the fully observable case to compute an agent's optimal policy under partial observability in open ad hoc teamwork. Empirical results demonstrate that our solution can learn efficient policies in open ad hoc teamwork in fully and partially observable cases. Further analysis demonstrates that our methods' success is a result of effectively learning the effects of teammates' actions while also inferring the inherent state of the environment under partial observability.

Original languageEnglish
Pages (from-to)14150-14223
Number of pages74
JournalJournal of Machine Learning Research
Volume24
Issue number1
Publication statusPublished - 2023

Keywords

  • ad hoc teamwork
  • graph neural networks
  • partial observability
  • particle filter
  • reinforcement learning

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Software
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning'. Together they form a unique fingerprint.

Cite this