TY - UNPB
T1 - Improving Cooperation in Collaborative Embodied AI
AU - Suprabha, Hima Jacob Leven
AU - Nagesh, Laxmi Nag Laxminarayan
AU - Nair, Ajith
AU - Selvaster, Alvin Reuben Amal
AU - Khan, Ayan
AU - Damarla, Raghuram
AU - Samuel, Sanju Hannah
AU - Perumal, Sreenithi Saravana
AU - Puech, Titouan
AU - Marella, Venkataramireddy
AU - Sonar, Vishal
AU - Suglia, Alessandro
AU - Lemon, Oliver
N1 - In proceedings of UKCI 2025
PY - 2025/10/3
Y1 - 2025/10/3
N2 - The integration of Large Language Models (LLMs) into multiagent systems has opened new possibilities for collaborative reasoning and cooperation with AI agents. This paper explores different prompting methods and evaluates their effectiveness in enhancing agent collaborative behaviour and decision-making. We enhance CoELA, a framework designed for building Collaborative Embodied Agents that leverage LLMs for multi-agent communication, reasoning, and task coordination in shared virtual spaces. Through systematic experimentation, we examine different LLMs and prompt engineering strategies to identify optimised combinations that maximise collaboration performance. Furthermore, we extend our research by integrating speech capabilities, enabling seamless collaborative voice-based interactions. Our findings highlight the effectiveness of prompt optimisation in enhancing collaborative agent performance; for example, our best combination improved the efficiency of the system running with Gemma3 by 22% compared to the original CoELA system. In addition, the speech integration provides a more engaging user interface for iterative system development and demonstrations.
AB - The integration of Large Language Models (LLMs) into multiagent systems has opened new possibilities for collaborative reasoning and cooperation with AI agents. This paper explores different prompting methods and evaluates their effectiveness in enhancing agent collaborative behaviour and decision-making. We enhance CoELA, a framework designed for building Collaborative Embodied Agents that leverage LLMs for multi-agent communication, reasoning, and task coordination in shared virtual spaces. Through systematic experimentation, we examine different LLMs and prompt engineering strategies to identify optimised combinations that maximise collaboration performance. Furthermore, we extend our research by integrating speech capabilities, enabling seamless collaborative voice-based interactions. Our findings highlight the effectiveness of prompt optimisation in enhancing collaborative agent performance; for example, our best combination improved the efficiency of the system running with Gemma3 by 22% compared to the original CoELA system. In addition, the speech integration provides a more engaging user interface for iterative system development and demonstrations.
KW - cs.AI
KW - cs.MA
KW - cs.RO
U2 - 10.48550/arXiv.2510.03153
DO - 10.48550/arXiv.2510.03153
M3 - Preprint
BT - Improving Cooperation in Collaborative Embodied AI
PB - arXiv
ER -