Short-term plasticity as cause-effect hypothesis testing in distal reward learning

Soltoggio, Andrea

1402.0710v5.pdf (1.76 MB)

Short-term plasticity as cause-effect hypothesis testing in distal reward learning

journal contribution

posted on 2014-10-15, 09:07 authored by Andrea SoltoggioAndrea Soltoggio

Asynchrony, overlaps and delays in sensory-motor signals introduce ambiguity as to which stimuli, actions, and rewards are causally related. Only the repetition of reward episodes helps distinguish true cause-effect relationships from coincidental occurrences. In the model proposed here, a novel plasticity rule employs short and long-term changes to evaluate hypotheses on cause-effect relationships. Transient weights represent hypotheses that are consolidated in long-term memory only when they consistently predict or cause future rewards. The main objective of the model is to preserve existing network topologies when learning with ambiguous information flows. Learning is also improved by biasing the exploration of the stimulus-response space towards actions that in the past occurred before rewards. The model indicates under which conditions beliefs can be consolidated in long-term memory, it suggests a solution to the plasticity-stability dilemma, and proposes an interpretation of the role of short-term plasticity.

Funding

European Community’s Seventh Framework Programme FP7/2007-2013, Challenge 2 Cognitive Systems, Interaction, Robotics under Grant Agreement No. 248311-AMARSi

History

School

Science

Department

Computer Science

Published in

Biological Cybernetics

Volume

109

Issue

1

Pages

75 - 94

Citation

SOLTOGGIO, A., 2015. Short-term plasticity as cause-effect hypothesis testing in distal reward learning. Biological Cybernetics, 109 (1), pp.75-94.

Publisher

Version

AM (Accepted Manuscript)

Publisher statement

This work is made available according to the conditions of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) licence. Full details of this licence are available at: https://creativecommons.org/licenses/by-nc-nd/4.0/

Acceptance date

2014-08-06

Publication date

2014-09-05

Notes

This article was published in the journal, Biological Cybernetics. The final publication is available at Springer via http://dx.doi.org/

DOI

https://doi.org/10.1007/s00422-014-0628-0

ISSN

0340-1200

eISSN

1432-0770

Publisher version

http://dx.doi.org/10.1007/s00422-014-0628-0

Language

en

Administrator link

https://repository.lboro.ac.uk/account/articles/9401591

Usage metrics

Keywords

Short-term plasticity Transient weights Distal reward Operant learning Plasticity versus stability Memory consolidation Artificial Intelligence and Image Processing Information and Computing Sciences not elsewhere classified

Licence

CC BY-NC-ND 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Short-term plasticity as cause-effect hypothesis testing in distal reward learning

Funding

European Community’s Seventh Framework Programme FP7/2007-2013, Challenge 2 Cognitive Systems, Interaction, Robotics under Grant Agreement No. 248311-AMARSi

History

School

Department

Published in

Volume

Issue

Pages

Citation

Publisher

Version

Publisher statement

Acceptance date

Publication date

Notes

DOI

ISSN

eISSN

Publisher version

Language

Administrator link

Usage metrics

Categories

Keywords

Licence

Exports