Context meta-reinforcement learning via neuromodulation

Ben-Iwhiwhu, Eseoghene; Dick, Jeffery; Ketz, Nicholas A; Pilly, Praveen K; Soltoggio, Andrea

Context meta-reinforcement learning via neuromodulation

journal contribution

posted on 2022-04-20, 10:46 authored by Eseoghene Ben-Iwhiwhu, Jeffery Dick, Nicholas A Ketz, Praveen K Pilly, Andrea SoltoggioAndrea Soltoggio

Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent’s policy network (obtained via reasoning about task context, model parameter updates, or both). However, obtaining rich dynamic representations for fast adaptation beyond simple benchmark problems is challenging due to the burden placed on the policy network to accommodate different policies. This paper addresses the challenge by introducing neuromodulation as a modular component to augment a standard policy network that regulates neuronal activities in order to produce efficient dynamic representations for task adaptation. The proposed extension to the policy network is evaluated across multiple discrete and continuous control environments of increasing complexity. To prove the generality and benefits of the extension in meta-RL, the neuromodulated network was applied to two state-of-the-art meta-RL algorithms (CAVIA and PEARL). The result demonstrates that meta-RL augmented with neuromodulation produces significantly better result and richer dynamic representations in comparison to the baselines.

Funding

United States Air Force Research Laboratory (AFRL) and Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C0103

History

School

Science

Department

Computer Science

Published in

Neural Networks

Volume

152

Pages

70-79

Publisher

Elsevier BV

Version

VoR (Version of Record)

Rights holder

Publisher statement

This is an Open Access Article. It is published by Elsevier under the Creative Commons Attribution 4.0 International Licence (CC BY 4.0). Full details of this licence are available at: https://creativecommons.org/licenses/by/4.0/

Acceptance date

2022-04-05

Publication date

2022-04-12

Copyright date

2022

DOI

https://doi.org/10.1016/j.neunet.2022.04.003

ISSN

0893-6080

Publisher version

https://doi.org/10.1016/j.neunet.2022.04.003

Language

en

Depositor

Dr Andrea Soltoggio. Deposit date: 14 April 2022

Context meta-reinforcement learning via neuromodulation

Funding

United States Air Force Research Laboratory (AFRL) and Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C0103

History

School

Department

Published in

Volume

Pages

Publisher

Version

Rights holder

Publisher statement

Acceptance date

Publication date

Copyright date

DOI

ISSN

Publisher version

Language

Depositor

Usage metrics

Categories

Keywords

Licence

Exports