Investigating the effective use of machine learning algorithms in network intruder detection systems

Al-Mandhari, Intisar; Guan, Lin; Edirisinghe, Eran A.

File(s) under permanent embargo

Reason: This item is currently closed access.

Investigating the effective use of machine learning algorithms in network intruder detection systems

conference contribution

posted on 2019-02-07, 09:25 authored by Intisar Al-Mandhari, Lin GuanLin Guan, Eran A. Edirisinghe

Research into the use of machine learning techniques for network intrusion detection, especially carried out with respect to the popular public dataset, KDD cup 99, have become commonplace during the past decade. The recent popularity of cloud-based computing and the realization of the associated risks are the main reasons for this research thrust. The proposed research demonstrates that machine learning algorithms can be effectively used to enhance the performance of existing intrusion detection systems despite the high misclassification rates reported in the literature. This paper reports on an empirical investigation to determine the underlying causes of the poor performance of some of the well-known machine learning classifiers. Especially when learning from minor classes/attacks. The main factor is that the KDD cup 99 dataset, which is popularly used in most of the existing research, is an imbalanced dataset due to the nature of the specific intrusion detection domain, i.e. some attacks being rare and some being very frequent. Therefore, there is a significant imbalance amongst the classes in the dataset. Based on the number of the classes in the dataset, the imbalance dataset issue can be considered a binary problem or a multi-class problem. Most of the researchers focus on conducting a binary class classification as conducting a multi-class classification is complex. In the research proposed in this paper, we consider the problem as a multi-class classification task. The paper investigates the use of different machine learning algorithms in order to overcome the common misclassification problems that have been faced by researchers who used the imbalance KDD cup 99 dataset for their investigations. Recommendations are made as for which classifier is best for the classification of imbalanced data.

History

School

Science

Department

Computer Science

Published in

Future of Information and Communications Conference , FICC 2018

Citation

AL-MANDHARI, I., GUAN, L. and EDIRISINGHE, E.A., 2018. Investigating the effective use of machine learning algorithms in network intruder detection systems. IN: Arai, K., Kapoor, S. and Bhatia, R. (eds). Future of Information and Communication Conference (FICC 2018), Singapore, Singapore, 5-6 April 2018, pp.145-161.

Publisher

Version

VoR (Version of Record)

Publisher statement

This work is made available according to the conditions of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) licence. Full details of this licence are available at: https://creativecommons.org/licenses/by-nc-nd/4.0/

Publication date

2018

Notes

This conference paper is closed access.

DOI

https://doi.org/10.1007/978-3-030-03405-4_10

ISBN

9783030034047;9783030034054

Publisher version

https://doi.org/10.1007/978-3-030-03405-4_10

Book series

Advances in Intelligent Systems and Computing;887

Language

en

Location

Singapore

Administrator link

https://repository.lboro.ac.uk/account/articles/9400955

Usage metrics

Keywords

Cloud computing Data mining Intrusion detection Imbalanced dataset Information and Computing Sciences not elsewhere classified

Licence

CC BY-NC-ND 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Investigating the effective use of machine learning algorithms in network intruder detection systems

History

School

Department

Published in

Citation

Publisher

Version

Publisher statement

Publication date

Notes

DOI

ISBN

Publisher version

Book series

Language

Location

Administrator link

Usage metrics

Categories

Keywords

Licence

Exports