Loughborough University
Browse

Investigating transferability in multi-agent reinforcement learning

Download (775.33 kB)
conference contribution
posted on 2025-09-04, 11:54 authored by Corentin Artaud, Rafael PinaRafael Pina, Varuna De-SilvaVaruna De-Silva, Xiyu ShiXiyu Shi
<p dir="ltr">Effectively transferring knowledge from one task to another in cooperative Multi-Agent Reinforcement Learning (MARL) can be key to accelerate learning in complex scenarios. For instance, in sports it is common to train in simpler situations and then apply what was learned in the real game. The same logic applies to other scenarios that involve increasing levels of difficulty, such as robotics tasks, or healthcare applications. In the realm of MARL, transferring knowledge can become extremely challenging due to factors such as changes in the observation and action spaces, or the underlying dynamics of the environment. Consequently, most current methods still opt to train each new task from scratch. However, as tasks become increasingly complex, there is a growing interest in leveraging behavioural similarities that emerge among semantically similar tasks. In this context, we explore techniques to facilitate the rapid transfer of knowledge from one policy network to another within off-policy value-based methods. Additionally, we introduce a special case that enables function-preserving transfers of centralised functions between tasks. Our work offers a promising strategy to reduce training time, enabling zero-shot task transfers.</p>

Funding

ATRACT: A Trustworthy Robotic Autonomous system to support Casualty Triage

Engineering and Physical Sciences Research Council

Find out more...

History

School

  • Loughborough University, London

Published in

2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC)

Pages

1583 - 1588

Source

2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC)

Publisher

IEEE

Version

  • AM (Accepted Manuscript)

Rights holder

© IEEE

Publisher statement

This accepted manuscript has been made available under the Creative Commons Attribution licence (CC BY) under the IEEE JISC UK green open access agreement.

Publication date

2025-07-08

Copyright date

2025

ISBN

9798331574345

ISSN

2836-3787

eISSN

2836-3795

Language

  • en

Location

Toronto, ON, Canada

Event dates

8th July 2025 - 11th July 2025

Depositor

Dr Xiyu Shi. Deposit date: 4 September 2025

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC