Loughborough University
Browse
09146155.pdf (2.4 MB)

Learning to learn sequential network attacks using hidden Markov models

Download (2.4 MB)
journal contribution
posted on 2020-07-21, 08:34 authored by Timothy Chadza, Kostas KyriakopoulosKostas Kyriakopoulos, Sangarapillai LambotharanSangarapillai Lambotharan
The global surge of cyber-attacks in the form of sequential network attacks has propelled the need for robust intrusion detection and prediction systems. Such attacks are difficult to reveal using current intrusion detection systems, since each individual attack phase may appear benign when examined outside of its context. In addition, there are challenges in building supervised learning models for such attacks, since there are limited labelled datasets available. Hence, there is a need for updating already built models to specific operational environments and for addressing the concept drift. Hidden Markov models (HMMs) is a popular framework for sequential modelling, however, in addition to the above challenges, the model parameters are difficult to optimise. This paper proposes a transfer learning (TL) approach that exploits already learned knowledge, gained from a labelled source dataset, and adapts it on a different, unlabelled target dataset. The datasets may be from a different but related domain. Five unsupervised HMM techniques are developed utilising a TL approach and evaluated against conventional machine learning approaches. Baum-Welch (BW), Viterbi training, gradient descent, differential evolution (DE) and simulated annealing, are deployed for the detection of attack stages in the network traffic, as well as, forecasting both the next most probable attack stage and its method of manifestation. Specifically, for the prediction of the three next most likely states and observations, TL with DE achieved a maximum accuracy improvement of 48.3%, and 27.4%, respectively. Finally, the actual detection prediction for the three next most probable states and methods of manifestation reaches 78.9% and 96.3% using TL with BW and DE, respectively.

Funding

Communications Signal Processing Based Solutions for Massive Machine-to-Machine Networks (M3NETs) : EP/R006385/1

History

School

  • Mechanical, Electrical and Manufacturing Engineering

Published in

IEEE Access

Volume

8

Pages

134480 - 134497

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Version

  • VoR (Version of Record)

Rights holder

© The Authors

Publisher statement

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

Acceptance date

2020-07-16

Publication date

2020-07-22

Copyright date

2020

eISSN

2169-3536

Language

  • en

Depositor

Dr Kostas Kyriakopoulos. Deposit date: 20 July 2020

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC