基于操作注意力和数据增强的内部威胁检测被引量：1

Insider threat detection based on operational attention and data augmentation

下载PDF

导出

摘要内部威胁是组织中出现重大安全隐患的主要原因之一,也是一个长期的挑战。通过分析现有的内部威胁数据,指出内部威胁检测最大的挑战在于数据不平衡、有标注的威胁样本少。内部威胁检测的经典数据集CMU-C R4.2共有322万条日志数据,其中标记出的恶意操作日志仅7 423条;日志中的大多数操作类型与恶意行为无关,如泄露企业数据这一恶意行为仅与两种类型操作高度相关,而其余的40多种类型操作的日志则可能对检测造成干扰。针对这一挑战,设计了一种基于操作注意力和数据增强的数据处理框架。该框架首先对操作进行异常评估,对低异常评分的操作进行掩码操作,使模型更好地关注与恶意行为相关的操作,可以被认为是一种操作的硬注意力机制。通过分析内部威胁数据集的特点,设计了3种规则对恶意样本进行数据增强,以增加样本的多样性和缓解正负样本严重不平衡的问题。将有监督的内部威胁检测视作一个时序分类问题,在长短期记忆卷积神经网络(LSTM-FCN)模型中加入残差连接以实现多粒度的检测,并使用精确率、召回率等指标实施评估,要优于现有的基线模型;另外,在ITD-Bert、TextCNN等多种经典模型上实施基于操作注意力和数据增强的数据处理框架,结果表明所提方法能够有效提升内部威胁检测模型的性能。 In recent years,there has been an increased focus on the issue of insider threats.Insider threats are a major cause security breaches in organizations and pose an ongoing challenge.By analyzing the existing insider threat data,it was identified that the biggest challenge in insider threat detection lies in data imbalance and the limited number of labeled threat samples.In the Cert R4.2 dataset,which is a classic dataset for insider threat detection,there are over 3.22 million log data,but only 7,423 are marked as malicious operation logs.Furthermore,most of the operation types in the logs are not related to malicious behavior,and only two types of operations are highly correlated with malicious behavior,such as leaking company data,creating interference in the detection process.To address this challenge,a data processing framework was designed based on operational attention and data augmentation.Anomaly evaluation was first performed on operations by the framework,and operations with low anomaly scores were then masked.This makes the model better focus on operations related to malicious behavior,which can be considered as a hard attention mechanism for operations.Next,the characteristics of the insider threat dataset were analyzed,and three rules were designed for data augmentation on malicious samples to increase the diversity of samples and alleviate the substantial imbalance between positive and negative samples.Supervised insider threat detection was regarded as a time-series classification problem.Residual connections were added to the LSTM-FCN model to achieve multi-granularity detection,and indicators such as precision rate and recall rate were used to evaluate the model.The results indicate superior performance over existing baseline models.Moreover,the data processing framework was implemented on various classic models,such as ITD-Bert and TextCNN,and the results show that the methods effectively improve the performance of insider threat detection models.

作者冯冠云付才吕建强韩兰胜 FENG Guanyun;FU Cai;LYU Jianqiang;HAN Lansheng(Hubei Engineering Research Center on Big Data Security,Hubei Key Laboratory of Distributed System Security,Wuhan 430074,China;School of Cyber Science and Engineering,Huazhong University of Science and Technology,Wuhan 430074,China)

机构地区分布式系统安全湖北省重点实验室华中科技大学网络空间安全学院

出处《网络与信息安全学报》 2023年第3期102-112,共11页 Chinese Journal of Network and Information Security

基金国家自然科学基金(62072200,62172176) 国家重点研发计划(2022YFB3103400)。

关键词内部威胁检测硬注意力数据增强神经网络 Insider threat detection hard attention data augmentation neural network

分类号 TP309.2 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

同被引文献5

1张光华,闫风如,张冬雯,刘雪峰.基于LSTM-Attention的内部威胁检测模型[J].信息网络安全,2022(2):1-10. 被引量：6
2郭世泽,张磊,潘雨,陶蔚,白玮,郑奇斌,刘艺,潘志松.内部威胁发现检测方法研究综述[J].数据采集与处理,2022,37(3):488-501. 被引量：1
3邵志鹏,李伟伟,周诚.基于机器学习的电力网络安全检测技术研究[J].自动化与仪表,2022,37(9):104-108. 被引量：7
4孙小双,王宇.基于自动编码器的内部威胁检测技术[J].计算机工程与设计,2022,43(10):2725-2730. 被引量：1
5张安勤,王小慧.基于时序异常检测的动力电池安全预警[J].计算机应用,2023,43(12):3799-3805. 被引量：1

引证文献1

1刘爱国.基于异常检测的内部威胁识别系统开发[J].移动信息,2024,46(5):152-154.

1邹芸.综合护理干预在神经外科高血压脑出血患者护理中的效果[J].中文科技期刊数据库（全文版）医药卫生,2023(6):156-159.
2曲长新.无痛肠镜下钛夹联合高频电凝电切术治疗大肠息肉的临床效果研究[J].中国科技期刊数据库医药,2023(7):75-78.
3李言曼,李绍斌,屈金燕,刘留.基于概率校准平衡随机森林算法的轨道电路故障诊断方法[J].现代电子技术,2023,46(13):176-182. 被引量：5
4刘卫明.优质护理在小儿肺炎中的应用分析[J].中文科技期刊数据库（引文版）医药卫生,2023(6):139-142.
5张钰,邵若洋,刘启发.FLT3突变急性髓系白血病的全程管理[J].临床血液学杂志,2023,36(5):303-308. 被引量：3
6唐芳.“多规合一”背景下的空间规划实施评估指标体系构建实证研究[J].中文科技期刊数据库（全文版）社会科学,2023(7):103-107.
7张嘉鑫,常金烁,牛思雨,陈嘉一,孙秋雨,程姝媛,梁帅.关于河北省自建房安全问题及建议[J].管理科学与工程,2023,12(3):259-265.
8姜卫,杨春侠,崔鸿知.基于随机子空间法的风电塔筒模态参数识别研究[J].能源与节能,2023(6):8-13. 被引量：1
9黄雪莲,潘哲.面向单元的国土空间规划全周期管控平台设计与实现[J].中文科技期刊数据库（全文版）自然科学,2023(6):69-74.
10刘欣逸,宁博,王明,杨超,商迪,李冠宇.基于句法增强的细粒度情感三元组抽取方法[J].计算机研究与发展,2023,60(7):1649-1660. 被引量：7

网络与信息安全学报

2023年第3期

浏览历史

内容加载中请稍等...

基于操作注意力和数据增强的内部威胁检测被引量：1

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于操作注意力和数据增强的内部威胁检测 被引量：1

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于操作注意力和数据增强的内部威胁检测被引量：1