Loughborough University
Browse

Designing explainable cognitive systems and explaining neural networks with plastic dynamical systems

Download (15.21 MB)

The most powerful Artificial Intelligence (AI) to date is based on neural networks (NNs) and features a critical flaw, the lack of explainability, which undermines trust in its decision-making. Building explainable cognitive machines, as well as analysing the existing ones, is impeded by the lack of unambiguous technical definitions for cognition and its constituents, such as representations of memories and categories, behavioural code, learning and thinking, and for explainability itself. Employing dynamical systems (DS) approach to cognition, we introduce a cohesive, self-consistent, and self-contained mathematical framework for designing an explainable artificial cognitive system to specification, anchored in technical definitions we propose. Our definition of cognition is an interpretation in terms of DS theory of the existing definition from computational neuroscience, which builds upon an earlier idea of cognition as self-organisation of a vector field. This vector field, serving as both the behavioural code and the substrate for memory imprinting, evolves obeying some learning rules. We apply our framework to achieve two main goals. Firstly, we design a “perfect” explainable cognitive system that overcomes common NN issues, such as catastrophic forgetting, spurious memories, lack of lifelong learning, and limited explainability, while incorporating additional functions seen in humans and animals. Secondly, we reveal learning rules for the vector field of a simple NN, and benchmark them against those of the “perfect” system. We use these rules to partly explain how the NN learns, to interpret connection weights, and to highlight the inherent lack of explainability. Our framework fosters deeper comprehension of the existing AI, and paves the way for developing explainable AI with cognitive functions closer to those of biological systems.

Funding

Maths DTP 2021/22 Loughborough University

Engineering and Physical Sciences Research Council

Find out more...

History

School

  • Science

Department

  • Mathematical Sciences

Published in

SSRN

Publisher

Elsevier

Version

  • SMUR (Submitted Manuscript Under Review)

Rights holder

© Elsevier

Publisher statement

All materials on SSRN are the copyright of Elsevier or are reproduced with permission from other copyright owners. All rights are reserved. The materials on this web site may be retrieved and downloaded solely for personal non-commercial use. No materials may otherwise be copied, modified, published, broadcast or otherwise distributed. Social Science Research Network, SSRN, SSRN.com and/or any other names of products or services provided by Elsevier or referred to on this web site are either trademarks or registered trademarks of Elsevier.

Copyright date

2024

Language

  • en

Depositor

Natalia Janson. Deposit date: 7 February 2025

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC