• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Li, Jieling (Li, Jieling.) [1] | Zhang, Hao (Zhang, Hao.) [2] (Scholars:张浩) | Liu, Yanhua (Liu, Yanhua.) [3] (Scholars:刘延华) | Liu, Zhihuang (Liu, Zhihuang.) [4]

Indexed by:

EI Scopus SCIE

Abstract:

Network intrusion detection plays an important role as tools for managing and identifying potential threats, which presents various challenges. Redundant features and difficult marking in data cause a long-term problem in network traffic detection. In this paper, we propose a semi-supervised machine learning framework based on multi-strategy feature filtering, principal component analysis (PCA), and an improved Tri-Light Gradient Boosting Machine (Tri-LightGBM) based on stratified sampling. This multi-strategy feature filtering method employing Fisher score and Information gain can select features that have good category discrimination and are more relevant to category labels. After that, we combine PCA to convert multiple features into comprehensive features, which are used as the input of the Tri-LightGBM model. Tri-LightGBM can exploit unlabeled data cooperatively and maintain a large disagreement among the base learners. Moreover, we propose a stratified sampling based on labeled categories to reduce the probability of being selected as the same category during the model update process. Thus, the Tri-LightGBM based on stratified sampling can compensate for the classification error rate caused by the imbalance of the dataset. The semi-supervised machine learning framework is evaluated on two intrusion detection evaluation datasets, namely UNSW-NB15 and CIC-IDS-2017. The evaluation results show that the multi-strategy feature filtering method can increase the accuracy, recall, precision, and F-measure by up to 0.5%, and reduce the false-positive rate by up to 0.5%. Furthermore, the precision rate of minority categories can be increased by about 1-2%.

Keyword:

Fisher score Information gain Network intrusion detection PCA Tri-LightGBM

Community:

  • [ 1 ] [Li, Jieling]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
  • [ 2 ] [Zhang, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
  • [ 3 ] [Liu, Yanhua]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
  • [ 4 ] [Liu, Zhihuang]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
  • [ 5 ] [Li, Jieling]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China
  • [ 6 ] [Zhang, Hao]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China
  • [ 7 ] [Liu, Yanhua]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China
  • [ 8 ] [Liu, Zhihuang]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China

Reprint 's Address:

  • 张浩

    [Zhang, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China;;[Zhang, Hao]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China

Show more details

Related Keywords:

Source :

JOURNAL OF SUPERCOMPUTING

ISSN: 0920-8542

Year: 2022

Issue: 11

Volume: 78

Page: 13122-13144

3 . 3

JCR@2022

2 . 5 0 0

JCR@2023

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:61

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count: 6

SCOPUS Cited Count: 9

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:126/10008996
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1