融合图嵌入和BERT嵌入的文本分类模型

Text Classification Model Based on BERT and Dual-Level Attention Mechanism

下载PDF

导出

摘要文本分类作为自然语言领域中的重要任务之一,广泛应用于问答系统、推荐系统以及情感分析等相关任务中。为了提取文本数据中的复杂语义特征信息并捕获全局的图信息,提出一种融合图嵌入和BERT(bidirectional encoder representation from Transformers)嵌入的文本分类模型。该模型引入双级注意力机制考虑不同类型节点的重要性以及同一类型不同相邻节点的重要性,同时采用BERT预训练模型获得包含上下文信息的嵌入并解决一词多义的问题。该模型把所有单词和文本均视为节点,为整个语料库构建一张异构图,将文本分类问题转化为节点分类问题。将双级注意力机制与图卷积神经网络进行融合,双级注意力机制包含类型级注意力和节点级注意力。类型级注意力机制捕获不同类型的节点对某一节点的重要性,节点级注意力机制可以捕获相同类型的相邻节点对某一节点的重要性。将BERT模型获得的文本中局部语义信息与经图卷积神经网络得到的具有全局信息的图嵌入表示相结合,得到最后的文本嵌入表示,并完成文本分类。在4个广泛使用的公开数据集上与7个基线模型进行对比实验,结果表明本文模型提高了文本分类的准确性。 Text classification is one of the crucial tasks in the field of natural language and is widely used in related tasks such as question answering system,recommendation system,and sentiment analysis.In order to extract complex semantic feature information in text data and capture global graph information,a text classification model based on bidirectional encoder representation from Transformers(BERT)and a dual-level attention mechanism is proposed in this article.This model introduces a two-level attention mechanism to consider the importance of different types of nodes and the importance of dif-ferent neighboring nodes of the same type.At the same time,a BERT pre-training model is used to obtain embeddings con-taining contextual information and solve the problem of polysemy.This method treats all words and texts as nodes,builds a heterogeneous graph for the entire corpus,and transforms the text classification problem into a node classification problem.The dual-level attention mechanism is then integrated with the graph convolutional neural network.The dual-level attention mechanism includes type-level attention and node-level attention.The type-level attention mechanism captures the importance of different types of nodes to a certain node.The node-level attention mechanism can capture the importance of neighbor nodes of the same type to a certain node.Then,the local semantic information in the text obtained by the BERT model is combined with the graph embedding representation with global information obtained by the graph convolutional neural network to obtain the final text embedding representation and complete the text classification.Comparative experiments were conducted with seven baseline models on four widely used public data sets.The results showed that the proposed model improved the accuracy of text classification.

作者常慧霞李孝忠 CHANG Huixia;LI Xiaozhong(College of Artificial Intelligence,Tianjin University of Science and Technology,Tianjin 300457,China)

机构地区天津科技大学人工智能学院

出处《天津科技大学学报》 2025年第1期72-80,共9页 Journal of Tianjin University of Science & Technology

关键词文本分类图卷积神经网络注意力机制 BERT text classification graph convolutional neural network attention mechanism BERT

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1徐冰冰,岑科廷,黄俊杰,沈华伟,程学旗.图卷积神经网络综述[J].计算机学报,2020,43(5):755-780. 被引量：286
2王婷,朱小飞,唐顾.基于知识增强的图卷积神经网络的文本分类[J].浙江大学学报（工学版）,2022,56(2):322-328. 被引量：15
3李全鑫,庞俊,朱峰冉.结合Bert与超图卷积网络的文本分类模型[J].计算机工程与应用,2023,59(17):107-115. 被引量：11

二级参考文献5

1周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251. 被引量：1822
2齐金山,梁循,李志宇,陈燕方,许媛.大规模复杂信息网络表示学习:概念、方法与挑战[J].计算机学报,2018,41(10):2394-2420. 被引量：44
3申艳光,贾耀清.基于词共现与图卷积的文本分类方法[J].计算机工程与应用,2021,57(11):173-178. 被引量：8
4滕金保,孔韦韦,田乔鑫,王照乾,李龙.基于CNN和LSTM的多通道注意力机制文本分类模型[J].计算机工程与应用,2021,57(23):154-162. 被引量：20
5檀莹莹,王俊丽,张超波.基于图卷积神经网络的文本分类方法研究综述[J].计算机科学,2022,49(8):205-216. 被引量：21

共引文献308

1赵东明,张继军,王博,张亚洲,刘静,石理.基于SpanBERT模型的服务语音负面情绪识别方法研究[J].新一代信息技术,2023,6(15):1-5.
2杨鑫,李星甫,唐雁冰,戴隽成,戚涛,李闽,刘旭.孔隙—溶孔型碳酸盐岩气驱水动态网络模拟[J].天然气地球科学,2023,34(6):973-979. 被引量：1
3张格,林岚,吴水才.脑群体图中图卷积神经网络应用研究综述[J].生命科学仪器,2021,19(4):23-30. 被引量：2
4韩煦,姜强,杨宏伟,俞荣君,祁琳.TOPAZ-2型热离子反应堆电源中子物理代理模型开发[J].电子技术应用,2024,50(S01):212-215.
5朱威,马小明,张纲,周忠志,薛原,邓艾东.基于GraphSage和自注意力机制的滚动轴承半监督故障诊断方法[J].信息化研究,2023,49(4):48-54.
6应嘉顺,张金艺,陈琪.融合时空特征的浓雾短临趋势预测算法[J].电子测量技术,2023,46(19):87-95.
7唐闽,高彦杰,汪长虹.基于图注意力网络的输电线路故障诊断[J].电子测量技术,2023,46(18):92-99. 被引量：2
8邓宇平,王桂棠.基于GoogleNet网络与残差网络的织物纹理分析[J].电子测量技术,2021,44(7):31-38. 被引量：4
9曾瑞,张海翔,马汉杰,蒋明峰,冯杰.基于图卷积的手势骨架生成[J].智能计算机与应用,2021,11(10):33-37.
10马超,熊顺,蒋丹妮.图卷积神经网络在道路网选取中的应用[J].测绘科学,2022,47(12):200-205. 被引量：3

1盖佳妮.中华传统美德在幼儿园语言领域的渗透路径探索[J].求知导刊,2025(6):128-130.
2刘兰辉.人工智能对应用型民办高校外语专业学生就业影响及对策研究[J].山西经济管理干部学院学报,2024,32(4):6-9.
3王平平,岳琳静,姜楠楠.基于功能性行为评估下的感觉统合训练对自闭症谱系障碍患儿自闭程度、行为状态及语言发育水平的影响[J].四川生理科学杂志,2025,47(2):278-280.

天津科技大学学报

2025年第1期

浏览历史

内容加载中请稍等...

融合图嵌入和BERT嵌入的文本分类模型

参考文献3

二级参考文献5

共引文献308

相关作者

相关机构

相关主题

浏览历史