基于ELMoTextCNN的网络欺凌检测模型

摘要/Abstract

摘要： 网络欺凌检测是网络空间信息内容安全的重要研究内容，也关乎青少年在线安全.针对目前网络欺凌检测方案存在的训练样本少、难以处理多义词、分类性能不太理想等问题，提出一种ELMoTextCNN检测模型.该模型首先采用迁移学习思想，利用预训练的ELMo(embeddings from language models)生成动态词向量，不仅解决了网络欺凌样本规模小的问题，而且由于ELMo采用了双向长短期记忆(bidirectional long shortterm memory, BiLSTM)网络结构，会根据上下文推断每个词对应的词向量，能够根据语境理解多义词.该模型再通过擅长处理短文本数据的TextCNN(text convolutional neural network)提取文本特征，最后经过全连接层输出分类结果.实验结果证明，提出的ELMoTextCNN检测方法能够处理一词多义，并获得更好的分类检测效果.

关键词: 网络欺凌检测, 深度学习, 迁移学习, ELMo模型, TextCNN模型

Abstract: Cyberbullying detection is an important research content on cyberspace information content security, and it is also related to youth online security. Aiming at the problems of few training samples, difficulty in processing polysemous words and unsatisfactory classification performance in current cyberbullying detection schemes, an ELMoTextCNN detection model is proposed. The model first adopts the idea of transfer learning and uses pretrained embeddings from language models (ELMo) to generate dynamic word vectors, which not only solves the problem of small cyberbullying sample size, but also because ELMo uses the bidirectional long shortterm memory (BiLSTM) network structure, it will infer the word vector corresponding to each word based on the context, and can understand polysemous words according to context. The model extracts text features through a text convolutional neural network (TextCNN), which is good at processing short text data, and finally outputs the classification results through a fully connected layer. Experimental results prove that the proposed ELMoTextCNN detection method can handle the ambiguity of a word and obtain better classification and detection results.

Key words: cyberbullying detection;deep learning, transfer learning ;ELMo model, TextCNN model

中图分类号:

TP391

叶水欢, 葛寅辉, 陈波, 于泠, . 基于ELMoTextCNN的网络欺凌检测模型[J]. 信息安全研究, 2023, 9(9): 868-.

参考文献

［1］Belsey B. What is cyberbullying［EBOL］. ［20211208］. https:cyberbullying.orgwhatiscyberbullying［2］Smith P K, Mahdavi J, Carvalho M, et al. Cyberbullying: Its nature and impact in secondary school pupils［J］. Journal of Child Psychology and Psychiatry, 2008, 49(4): 376385［3］共青团中央维护青少年权益部. 2020年全国未成年人互联网使用情况研究报告［EBOL］. (20210720)［20211212］. http:www.cnnic.cnhlwfzyjhlwxzbgqsnbg202107P020210720571098696248.pdf［4］曹文, 张香兰. 小学生网络欺凌现状及其对策——基于山东14所小学的调查［J］. 少年儿童研究, 2020, 33(6): 1623［5］黄天红. 联邦与州协同治理：美国防治青少年网络欺凌的立法实践［J］. 世界教育信息, 2021, 34(7): 7377［6］中华人民共和国教育部. 未成年人学校保护规定［EBOL］. (20210601)［20211212］. http:www.moe.gov.cnsrcsiteA02s5911moe_621202106t20210601_534640.html［7］Xu J M, Jun K S, Zhu X, et al. Learning from bullying traces in social media［C］ Proc of the 2012 Conf of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, PA: ACL, 2012: 656666［8］Waseem Z, Hovy D. Hateful symbols or hateful people? Predictive features for hate speech detection on twitter［C］ Proc of the NAACL Student Research Workshop. Stroudsburg, PA: ACL, 2016: 8893［9］Bengio Y, Ducharme R, Vincent P, et al. A neural probabilistic language model［J］. Journal of Machine Learning Research, 2003, 3(6): 11371155［10］Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space［J］. arXiv preprint, arXiv:1301.3781, 2013［11］Gada M, Damania K, Sankhe S. Cyberbullying detection using LSTMCNN architecture and its applications［C］ Proc of the 2021 Int Conf on Computer Communication and Informatics. Piscataway, NJ: IEEE, 2021: 16［12］Pennington J, Socher R, Manning C D. GloVe: Global vectors for word representation［C］ Proc of the 2014 Conf on Empirical Methods in Natural Language Processing. Stroudsburg, PA: ACL, 2014: 15321543［13］Banerjee V,Telavane J, Gaikwad P, et al. Detection of cyberbullying using deep neural network［C］ Proc of the 5th Int Conf on Advanced Computing & Communication Systems. Piscataway, NJ: IEEE, 2019: 604607［14］Peters M E, Neumann M, Iyyer M, et al. Deep contextualized word representation［C］ Proc of the 2018 Conf of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, PA: ACL, 2018: 22272237［15］Zhao R, Zhou A, Mao K. Automatic detection of cyberbullying on social networks based on bullying features［C］ Proc of the 17th Int Conf on Distributed Computing and Networking. New York: ACM, 2016: 16［16］n E P, Yeniterzi R. Cyberbullying detection using deep learning and word embedding analysis［C］ Proc of the 28th Signal Processing and Communications Applications Conf. Piscataway, NJ: IEEE, 2020: 14［17］刘小乐, 方勇, 黄诚, 等. 基于深度图卷积神经网络的Exploit Kit攻击活动检测方法［J］. 信息安全研究, 2022, 8(7): 685693［18］Kim Y. Convolutional neural networks for sentence classification［C］ Proc of the 2014 Conf on Empirical Methods in Natural Language Processing. Stroudsburg, PA: ACL, 2014: 17461751［19］Zhang Xiang, Zhao Junbo, LeCun Y. Characterlevel convolutional networks for text classification［C］ Proc of the 28th Int Conf on Neural Information Processing Systems. Cambridge, MA: MIT Press, 2015: 649657［20］Le H T, Cerisara C, Denis A. Do convolutional networks need to be deep for text classification［C］ Proc of AAAI Workshop on Affective Content Analysis. Menlo Park, CA: AAAI, 2018: 2936［21］Huang Gao, Liu Zhuang, Maaten V, et al. Densely connected convolutional networks［C］ Proc of the 2017 IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2017: 770778［22］Laxmi S T,Rismala R, Nurrahmi H. Cyberbullying detection on Indonesian twitter using Doc2Vec and convolutional neural network［C］ Proc of the 9th Int Conf on Information and Communication Technology. Piscataway, NJ: IEEE, 2021: 8286［23］Agrawal S,Awekar A. Deep learning for detecting cyberbullying across multiple social media platforms［C］ Proc of the 2018 European Conf on Information Retrieval. Berlin: Springer, 2018: 141153［24］Mahat M. Detecting cyberbullying across multiple social media platforms using deep learning［C］ Proc of the 2021 Int Conf on Advance Computing and Innovative Technologies in Engineering. Piscataway, NJ: IEEE, 2021: 299301［25］何力, 郑灶贤, 项凤涛, 等. 基于深度学习的文本分类技术研究进展［J］. 计算机工程, 2021, 47(2): 111［26］Akhter M P, Zheng Jiangbin, Naqvi I R, et al. Abusive language detection from social media comments using conventional machine learning and deep learning approaches［J］. Multimedia Systems, 2021, 27(6): 16［27］Li Chen, Zhang Xu, Qaosar M, et al. Multifactor based stock price prediction using hybrid neural networks with attention mechanism［C］ Proc of the 2019 Dependable Autonomic and Secure Computing. Piscataway, NJ: IEEE, 2019: 961966［28］Reynolds K,Kontostathis A, Edwards L. Using machine learning to detect cyberbullying［C］ Proc of the 10th Int Conf on Machine Learning and Applications and Workshops. Piscataway, NJ: IEEE, 2011: 241244［29］Hosseinmardi H, Mattson S A, Rafiq R I, et al. Analyzing labeled cyberbullying incidents on the instagram social network［C］ Proc of the Int Conf on Social Informatics. Berlin: Springer, 2015: 4966［30］Apeksha K. Cyberbullying detection in tweets［EBOL］. ［20211002］. https:github.comapeksha104CyberbullyingDetectioninTweetsblobmasternew_cleaned_data.csv［31］DataTurks. Tweets dataset for detection of cyber trolls［EBOL］. ［20210827］. https:www.kaggle.comdataturksdatasetfordetectionofcybertrolls［32］AndrewZ. Cyberbullying detection bot［EBOL］. ［20210925］. https:github.comAndrewZeitlerCyberBullyingDetectionBottreemastercyberdata

[1]	王耀辉, 王可, 宫良一, 付豫豪, 王跃达, 李婧, . 基于异构图的恶意域名检测方法研究[J]. 信息安全研究, 2023, 9(E1): 38-.
[2]	郑丽娜, 杜彦辉, . 基于深度学习的HTTP慢速DoS攻击检测研究[J]. 信息安全研究, 2023, 9(E1): 72-.
[3]	王志强, 王姿旖, 王庆德, 徐华福, . 基于LightGBM的区块链异常交易检测技术研究[J]. 信息安全研究, 2023, 9(9): 877-.
[4]	李敬. 基于卷积神经网络的加密代理流量识别方法[J]. 信息安全研究, 2023, 9(8): 722-.
[5]	张鹏飞. 基于机器学习的入侵检测模型对比研究[J]. 信息安全研究, 2023, 9(8): 739-.
[6]	杜林, 许传淇. 基于BERT的漏洞文本特征分类技术研究[J]. 信息安全研究, 2023, 9(7): 687-.
[7]	蒋明, 张宗凯, 刘熙尧, 郭标, 胡家馨, 张硕, . 基于多注意力机制的孪生网络图像隐写分析方法[J]. 信息安全研究, 2023, 9(6): 573-.
[8]	刘亦纯, 张光华, 宿景芳. 基于多级度量差值的神经网络后门检测方法[J]. 信息安全研究, 2023, 9(6): 587-.
[9]	王志强, 都迎迎, 林雨衡, 陈旭东, . 基于文本关键词的对抗样本生成技术研究[J]. 信息安全研究, 2023, 9(4): 338-.
[10]	王桂江, 黄润才, 马诗语, 黄小刚, 王承茂. 微博截图中的用户观点定位方法研究[J]. 信息安全研究, 2022, 8(9): 908-.
[11]	王中华, 徐杰, 韩健, 臧天宁. 基于卷积神经网络的恶意区块链域名检测方法[J]. 信息安全研究, 2022, 8(8): 760-.
[12]	颜祺, 牛彦杰, 陈国友. 基于深度学习的信息高保密率传输方法[J]. 信息安全研究, 2022, 8(8): 793-.
[13]	周梓馨, 张功萱, 寇小勇, 杨威. 一种基于自注意力机制的深度学习侧信道攻击方法[J]. 信息安全研究, 2022, 8(8): 812-.
[14]	刘小乐, 方勇, 黄诚, 许益家. 基于深度图卷积神经网络的Exploit Kit攻击活动检测方法[J]. 信息安全研究, 2022, 8(7): 685-.
[15]	李晓明, 任琳琳, 王汝墨, 刘家译, 李忠林, 刘学君, 沙芸, 万园春. 油料储运工控系统业务安全数据集研究[J]. 信息安全研究, 2022, 8(6): 570-.