Journal of Information Security Reserach ›› 2024, Vol. 10 ›› Issue (11): 1012-.

Previous Articles     Next Articles

Abnormal Traffic Detection in the Internet of Things Based on Imbalanced Data

Zhang Guanghua, Wang Ziyu, and Cai Mingwei   

  1. (School of Information Science and Engineering, Hebei University of Science Technology, Shijiazhuang 050018)
  • Online:2024-11-10 Published:2024-11-10

基于不平衡数据的物联网异常流量检测

张光华王子昱蔡明伟   

  1. (河北科技大学信息科学与工程学院石家庄050018)
  • 通讯作者: 张光华 博士,教授,CCF高级会员.主要研究方向为网络与信息安全、密码学. xian_software@163.com
  • 作者简介:张光华 博士,教授,CCF高级会员.主要研究方向为网络与信息安全、密码学. xian_software@163.com 王子昱 硕士研究生.主要研究方向为网络与信息安全、物联网安全. 909441858@qq.com 蔡明伟 硕士,实验师.主要研究方向为物联网系统与应用. 925330829@qq.com

Abstract: In order to deal with the problem of data category imbalance, which puts forward the low performance of the abnormal traffic detection model of the Internet of things, this paper proposes an abnormal traffic detection method based on category imbalance. Firstly, the Kmeans SMOTEENN algorithm based on MD (Mahalanobis distance) is used to generate noisefree data to effectively achieve balanced data sample distribution. Secondly, aiming at the low performance of the abnormal traffic detection model, a model combining the CNN (convolutional neural network) and the BiLSTM (Bidirectional long shortterm memory) is constructed. By extracting the local convolution features and key features of abnormal traffic. Finally, classification is performed through the fully connected layer and Softmax classifier. Experimental results show that compared with existing abnormal traffic detection methods, the proposed method achieved significant improvements in evaluation indicators such as accuracy, recall, precision and F1 value. The model can accurately identify abnormal behaviors in traffic with an accuracy rate as high as 99.43%.

Key words: IoT, abnormal traffic detection, deep learning, sampling algorithms, CNNBiLSTM

摘要: 为应对数据类别不平衡问题,导致物联网异常流量检测模型性能低下,提出一种基于不平衡数据的物联网异常流量检测方法.首先,采用基于马氏距离(Mahalanobis distance, MD)的Kmeans SMOTEENN算法生成无噪声的数据,以有效实现数据样本分布均衡.其次,针对异常流量检测模型性能低下,构建了卷积神经网络(convolutional neural network, CNN)和双向长短期记忆网络(bidirectional long shortterm memory, BiLSTM)相结合的模型,提取异常流量的局部卷积特征以及关键特征.最后,通过全连接层和分类器进行分类.实验结果显示,相较于现有异常流量检测方法,所提出的方法在准确率、召回率、精确率和F1值等评价指标上均取得显著提升.该模型能够准确识别流量中的异常行为,准确率高达99.43%.

关键词: 物联网, 异常流量, 深度学习, 采样算法, CNNBiLSTM

CLC Number: