期刊文献+

基于操作注意力和数据增强的内部威胁检测 被引量:1

Insider threat detection based on operational attention and data augmentation
在线阅读 下载PDF
导出
摘要 内部威胁是组织中出现重大安全隐患的主要原因之一,也是一个长期的挑战。通过分析现有的内部威胁数据,指出内部威胁检测最大的挑战在于数据不平衡、有标注的威胁样本少。内部威胁检测的经典数据集CMU-C R4.2共有322万条日志数据,其中标记出的恶意操作日志仅7 423条;日志中的大多数操作类型与恶意行为无关,如泄露企业数据这一恶意行为仅与两种类型操作高度相关,而其余的40多种类型操作的日志则可能对检测造成干扰。针对这一挑战,设计了一种基于操作注意力和数据增强的数据处理框架。该框架首先对操作进行异常评估,对低异常评分的操作进行掩码操作,使模型更好地关注与恶意行为相关的操作,可以被认为是一种操作的硬注意力机制。通过分析内部威胁数据集的特点,设计了3种规则对恶意样本进行数据增强,以增加样本的多样性和缓解正负样本严重不平衡的问题。将有监督的内部威胁检测视作一个时序分类问题,在长短期记忆卷积神经网络(LSTM-FCN)模型中加入残差连接以实现多粒度的检测,并使用精确率、召回率等指标实施评估,要优于现有的基线模型;另外,在ITD-Bert、TextCNN等多种经典模型上实施基于操作注意力和数据增强的数据处理框架,结果表明所提方法能够有效提升内部威胁检测模型的性能。 In recent years,there has been an increased focus on the issue of insider threats.Insider threats are a major cause security breaches in organizations and pose an ongoing challenge.By analyzing the existing insider threat data,it was identified that the biggest challenge in insider threat detection lies in data imbalance and the limited number of labeled threat samples.In the Cert R4.2 dataset,which is a classic dataset for insider threat detection,there are over 3.22 million log data,but only 7,423 are marked as malicious operation logs.Furthermore,most of the operation types in the logs are not related to malicious behavior,and only two types of operations are highly correlated with malicious behavior,such as leaking company data,creating interference in the detection process.To address this challenge,a data processing framework was designed based on operational attention and data augmentation.Anomaly evaluation was first performed on operations by the framework,and operations with low anomaly scores were then masked.This makes the model better focus on operations related to malicious behavior,which can be considered as a hard attention mechanism for operations.Next,the characteristics of the insider threat dataset were analyzed,and three rules were designed for data augmentation on malicious samples to increase the diversity of samples and alleviate the substantial imbalance between positive and negative samples.Supervised insider threat detection was regarded as a time-series classification problem.Residual connections were added to the LSTM-FCN model to achieve multi-granularity detection,and indicators such as precision rate and recall rate were used to evaluate the model.The results indicate superior performance over existing baseline models.Moreover,the data processing framework was implemented on various classic models,such as ITD-Bert and TextCNN,and the results show that the methods effectively improve the performance of insider threat detection models.
作者 冯冠云 付才 吕建强 韩兰胜 FENG Guanyun;FU Cai;LYU Jianqiang;HAN Lansheng(Hubei Engineering Research Center on Big Data Security,Hubei Key Laboratory of Distributed System Security,Wuhan 430074,China;School of Cyber Science and Engineering,Huazhong University of Science and Technology,Wuhan 430074,China)
出处 《网络与信息安全学报》 2023年第3期102-112,共11页 Chinese Journal of Network and Information Security
基金 国家自然科学基金(62072200,62172176) 国家重点研发计划(2022YFB3103400)。
关键词 内部威胁检测 硬注意力 数据增强 神经网络 Insider threat detection hard attention data augmentation neural network
  • 相关文献

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部