期刊文献+
共找到510篇文章
< 1 2 26 >
每页显示 20 50 100
Effects of data smoothing and recurrent neural network(RNN)algorithms for real-time forecasting of tunnel boring machine(TBM)performance 被引量:1
1
作者 Feng Shan Xuzhen He +1 位作者 Danial Jahed Armaghani Daichao Sheng 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2024年第5期1538-1551,共14页
Tunnel boring machines(TBMs)have been widely utilised in tunnel construction due to their high efficiency and reliability.Accurately predicting TBM performance can improve project time management,cost control,and risk... Tunnel boring machines(TBMs)have been widely utilised in tunnel construction due to their high efficiency and reliability.Accurately predicting TBM performance can improve project time management,cost control,and risk management.This study aims to use deep learning to develop real-time models for predicting the penetration rate(PR).The models are built using data from the Changsha metro project,and their performances are evaluated using unseen data from the Zhengzhou Metro project.In one-step forecast,the predicted penetration rate follows the trend of the measured penetration rate in both training and testing.The autoregressive integrated moving average(ARIMA)model is compared with the recurrent neural network(RNN)model.The results show that univariate models,which only consider historical penetration rate itself,perform better than multivariate models that take into account multiple geological and operational parameters(GEO and OP).Next,an RNN variant combining time series of penetration rate with the last-step geological and operational parameters is developed,and it performs better than other models.A sensitivity analysis shows that the penetration rate is the most important parameter,while other parameters have a smaller impact on time series forecasting.It is also found that smoothed data are easier to predict with high accuracy.Nevertheless,over-simplified data can lose real characteristics in time series.In conclusion,the RNN variant can accurately predict the next-step penetration rate,and data smoothing is crucial in time series forecasting.This study provides practical guidance for TBM performance forecasting in practical engineering. 展开更多
关键词 Tunnel boring machine(TBM) Penetration rate(PR) Time series forecasting recurrent neural network(RNN)
在线阅读 下载PDF
Optimized Phishing Detection with Recurrent Neural Network and Whale Optimizer Algorithm
2
作者 Brij Bhooshan Gupta Akshat Gaurav +3 位作者 Razaz Waheeb Attar Varsha Arya Ahmed Alhomoud Kwok Tai Chui 《Computers, Materials & Continua》 SCIE EI 2024年第9期4895-4916,共22页
Phishing attacks present a persistent and evolving threat in the cybersecurity land-scape,necessitating the development of more sophisticated detection methods.Traditional machine learning approaches to phishing detec... Phishing attacks present a persistent and evolving threat in the cybersecurity land-scape,necessitating the development of more sophisticated detection methods.Traditional machine learning approaches to phishing detection have relied heavily on feature engineering and have often fallen short in adapting to the dynamically changing patterns of phishingUniformResource Locator(URLs).Addressing these challenge,we introduce a framework that integrates the sequential data processing strengths of a Recurrent Neural Network(RNN)with the hyperparameter optimization prowess of theWhale Optimization Algorithm(WOA).Ourmodel capitalizes on an extensive Kaggle dataset,featuring over 11,000 URLs,each delineated by 30 attributes.The WOA’s hyperparameter optimization enhances the RNN’s performance,evidenced by a meticulous validation process.The results,encapsulated in precision,recall,and F1-score metrics,surpass baseline models,achieving an overall accuracy of 92%.This study not only demonstrates the RNN’s proficiency in learning complex patterns but also underscores the WOA’s effectiveness in refining machine learning models for the critical task of phishing detection. 展开更多
关键词 Phishing detection recurrent neural network(RNN) Whale Optimization Algorithm(WOA) CYBERSECURITY machine learning optimization
在线阅读 下载PDF
Optimizing the Clinical Decision Support System (CDSS) by Using Recurrent Neural Network (RNN) Language Models for Real-Time Medical Query Processing
3
作者 Israa Ibraheem Al Barazanchi Wahidah Hashim +4 位作者 Reema Thabit Mashary Nawwaf Alrasheedy Abeer Aljohan Jongwoon Park Byoungchol Chang 《Computers, Materials & Continua》 SCIE EI 2024年第12期4787-4832,共46页
This research aims to enhance Clinical Decision Support Systems(CDSS)within Wireless Body Area Networks(WBANs)by leveraging advanced machine learning techniques.Specifically,we target the challenges of accurate diagno... This research aims to enhance Clinical Decision Support Systems(CDSS)within Wireless Body Area Networks(WBANs)by leveraging advanced machine learning techniques.Specifically,we target the challenges of accurate diagnosis in medical imaging and sequential data analysis using Recurrent Neural Networks(RNNs)with Long Short-Term Memory(LSTM)layers and echo state cells.These models are tailored to improve diagnostic precision,particularly for conditions like rotator cuff tears in osteoporosis patients and gastrointestinal diseases.Traditional diagnostic methods and existing CDSS frameworks often fall short in managing complex,sequential medical data,struggling with long-term dependencies and data imbalances,resulting in suboptimal accuracy and delayed decisions.Our goal is to develop Artificial Intelligence(AI)models that address these shortcomings,offering robust,real-time diagnostic support.We propose a hybrid RNN model that integrates SimpleRNN,LSTM layers,and echo state cells to manage long-term dependencies effectively.Additionally,we introduce CG-Net,a novel Convolutional Neural Network(CNN)framework for gastrointestinal disease classification,which outperforms traditional CNN models.We further enhance model performance through data augmentation and transfer learning,improving generalization and robustness against data scarcity and imbalance.Comprehensive validation,including 5-fold cross-validation and metrics such as accuracy,precision,recall,F1-score,and Area Under the Curve(AUC),confirms the models’reliability.Moreover,SHapley Additive exPlanations(SHAP)and Local Interpretable Model-agnostic Explanations(LIME)are employed to improve model interpretability.Our findings show that the proposed models significantly enhance diagnostic accuracy and efficiency,offering substantial advancements in WBANs and CDSS. 展开更多
关键词 Computer science clinical decision support system(CDSS) medical queries healthcare deep learning recurrent neural network(RNN) long short-term memory(LSTM)
在线阅读 下载PDF
An INS/GNSS integrated navigation in GNSS denied environment using recurrent neural network 被引量:14
4
作者 Hai-fa Dai Hong-wei Bian +1 位作者 Rong-ying Wang Heng Ma 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2020年第2期334-340,共7页
In view of the failure of GNSS signals,this paper proposes an INS/GNSS integrated navigation method based on the recurrent neural network(RNN).This proposed method utilizes the calculation principle of INS and the mem... In view of the failure of GNSS signals,this paper proposes an INS/GNSS integrated navigation method based on the recurrent neural network(RNN).This proposed method utilizes the calculation principle of INS and the memory function of the RNN to estimate the errors of the INS,thereby obtaining a continuous,reliable and high-precision navigation solution.The performance of the proposed method is firstly demonstrated using an INS/GNSS simulation environment.Subsequently,an experimental test on boat is also conducted to validate the performance of the method.The results show a promising application prospect for RNN in the field of positioning for INS/GNSS integrated navigation in the absence of GNSS signal,as it outperforms extreme learning machine(ELM)and EKF by approximately 30%and 60%,respectively. 展开更多
关键词 INERTIAL NAVIGATION system(INS) Global NAVIGATION satellite system(GNSS) Integrated NAVIGATION recurrent neural network(RNN)
在线阅读 下载PDF
Remaining Useful Life Prediction for a Roller in a Hot Strip Mill Based on Deep Recurrent Neural Networks 被引量:11
5
作者 Ruihua Jiao Kaixiang Peng Jie Dong 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第7期1345-1354,共10页
Accurate estimation of the remaining useful life(RUL)and health state for rollers is of great significance to hot rolling production.It can provide decision support for roller management so as to improve the productiv... Accurate estimation of the remaining useful life(RUL)and health state for rollers is of great significance to hot rolling production.It can provide decision support for roller management so as to improve the productivity of the hot rolling process.In addition,the RUL prediction for rollers is helpful in transitioning from the current regular maintenance strategy to conditional-based maintenance.Therefore,a new method that can extract coarse-grained and fine-grained features from batch data to predict the RUL of the rollers is proposed in this paper.Firstly,a new deep learning network architecture based on recurrent neural networks that can make full use of the extracted coarsegrained fine-grained features to estimate the heath indicator(HI)is developed,where the HI is able to indicate the health state of the roller.Following that,a state-space model is constructed to describe the HI,and the probabilistic distribution of RUL can be estimated by extrapolating the HI degradation model to a predefined failure threshold.Finally,application to a hot strip mill is given to verify the effectiveness of the proposed methods using data collected from an industrial site,and the relatively low RMSE and MAE values demonstrate its advantages compared with some other popular deep learning methods. 展开更多
关键词 Hot strip mill prognostics and health management(PHM) recurrent neural network(RNN) remaining useful life(RUL) roller management.
在线阅读 下载PDF
New Stability Criteria for Recurrent Neural Networks with a Time-varying Delay 被引量:2
6
作者 Hong-Bing Zeng Shen-Ping Xiao Bin Liu 《International Journal of Automation and computing》 EI 2011年第1期128-133,共6页
This paper deals with the stability of static recurrent neural networks (RNNs) with a time-varying delay. An augmented Lyapunov-Krasovskii functional is employed, in which some useful terms are included. Furthermore... This paper deals with the stability of static recurrent neural networks (RNNs) with a time-varying delay. An augmented Lyapunov-Krasovskii functional is employed, in which some useful terms are included. Furthermore, the relationship among the timevarying delay, its upper bound and their difierence, is taken into account, and novel bounding techniques for 1- τ(t) are employed. As a result, without ignoring any useful term in the derivative of the Lyapunov-Krasovskii functional, the resulting delay-dependent criteria show less conservative than the existing ones. Finally, a numerical example is given to demonstrate the effectiveness of the proposed methods. 展开更多
关键词 STABILITY recurrent neural networks rnns time-varying delay DELAY-DEPENDENT augmented Lyapunov-Krasovskii functional.
在线阅读 下载PDF
Robust exponential stability analysis of a larger class of discrete-time recurrent neural networks 被引量:1
7
作者 ZHANG Jian-hai ZHANG Sen-lin LIU Mei-qin 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2007年第12期1912-1920,共9页
The robust exponential stability of a larger class of discrete-time recurrent neural networks (RNNs) is explored in this paper. A novel neural network model, named standard neural network model (SNNM), is introduced t... The robust exponential stability of a larger class of discrete-time recurrent neural networks (RNNs) is explored in this paper. A novel neural network model, named standard neural network model (SNNM), is introduced to provide a general framework for stability analysis of RNNs. Most of the existing RNNs can be transformed into SNNMs to be analyzed in a unified way. Applying Lyapunov stability theory method and S-Procedure technique, two useful criteria of robust exponential stability for the discrete-time SNNMs are derived. The conditions presented are formulated as linear matrix inequalities (LMIs) to be easily solved using existing efficient convex optimization techniques. An example is presented to demonstrate the transformation procedure and the effectiveness of the results. 展开更多
关键词 Standard neural network model (SNNM) Robust exponential stability recurrent neural networks rnns DISCRETE-TIME Time-delay system Linear matrix inequality (LMI)
在线阅读 下载PDF
Global stability of interval recurrent neural networks 被引量:1
8
作者 袁铸钢 刘志远 +1 位作者 裴润 申涛 《Journal of Beijing Institute of Technology》 EI CAS 2012年第3期382-386,共5页
The robust global exponential stability of a class of interval recurrent neural networks(RNNs) is studied,and a new robust stability criterion is obtained in the form of linear matrix inequality.The problem of robus... The robust global exponential stability of a class of interval recurrent neural networks(RNNs) is studied,and a new robust stability criterion is obtained in the form of linear matrix inequality.The problem of robust stability of interval RNNs is transformed into a problem of solving a class of linear matrix inequalities.Thus,the robust stability of interval RNNs can be analyzed by directly using the linear matrix inequalities(LMI) toolbox of MATLAB.Numerical example is given to show the effectiveness of the obtained results. 展开更多
关键词 recurrent neural networks(rnns interval systems linear matrix inequalities(LMI) global exponential stability
在线阅读 下载PDF
ENDPOINT DETECTOR OF NOISY SPEECH SIGNAL USING A RECURRENT NEURAL NETWORK
9
作者 韦晓东 胡光锐 《Journal of Shanghai Jiaotong university(Science)》 EI 1999年第1期60-63,共4页
IntroductionEndpointdetectionofspeechsignalisimportantinmanyareasofspeechprocessingtechnology,suchasspeechen... IntroductionEndpointdetectionofspeechsignalisimportantinmanyareasofspeechprocessingtechnology,suchasspeechenhancement,speechr... 展开更多
关键词 SPEECH ENDPOINT detection recurrent neural network(RNN) immunity learning
在线阅读 下载PDF
A Prediction Method of Trend-Type Capacity Index Based on Recurrent Neural Network
10
作者 Wenxiao Wang Xiaoyu Li +2 位作者 Yin Ding Feizhou Wu Shan Yang 《Journal of Quantum Computing》 2021年第1期25-33,共9页
Due to the increase in the types of business and equipment in telecommunications companies,the performance index data collected in the operation and maintenance process varies greatly.The diversity of index data makes... Due to the increase in the types of business and equipment in telecommunications companies,the performance index data collected in the operation and maintenance process varies greatly.The diversity of index data makes it very difficult to perform high-precision capacity prediction.In order to improve the forecasting efficiency of related indexes,this paper designs a classification method of capacity index data,which divides the capacity index data into trend type,periodic type and irregular type.Then for the prediction of trend data,it proposes a capacity index prediction model based on Recurrent Neural Network(RNN),denoted as RNN-LSTM-LSTM.This model includes a basic RNN,two Long Short-Term Memory(LSTM)networks and two Fully Connected layers.The experimental results show that,compared with the traditional Holt-Winters,Autoregressive Integrated Moving Average(ARIMA)and Back Propagation(BP)neural network prediction model,the mean square error(MSE)of the proposed RNN-LSTM-LSTM model are reduced by 11.82%and 20.34%on the order storage and data migration,which has greatly improved the efficiency of trend-type capacity index prediction. 展开更多
关键词 recurrent neural network(RNN) Long Short-Term Memory(LSTM)network capacity prediction
在线阅读 下载PDF
Hyperparameter Tuning for Deep Neural Networks Based Optimization Algorithm 被引量:3
11
作者 D.Vidyabharathi V.Mohanraj 《Intelligent Automation & Soft Computing》 SCIE 2023年第6期2559-2573,共15页
For training the present Neural Network(NN)models,the standard technique is to utilize decaying Learning Rates(LR).While the majority of these techniques commence with a large LR,they will decay multiple times over ti... For training the present Neural Network(NN)models,the standard technique is to utilize decaying Learning Rates(LR).While the majority of these techniques commence with a large LR,they will decay multiple times over time.Decaying has been proved to enhance generalization as well as optimization.Other parameters,such as the network’s size,the number of hidden layers,drop-outs to avoid overfitting,batch size,and so on,are solely based on heuristics.This work has proposed Adaptive Teaching Learning Based(ATLB)Heuristic to identify the optimal hyperparameters for diverse networks.Here we consider three architec-tures Recurrent Neural Networks(RNN),Long Short Term Memory(LSTM),Bidirectional Long Short Term Memory(BiLSTM)of Deep Neural Networks for classification.The evaluation of the proposed ATLB is done through the various learning rate schedulers Cyclical Learning Rate(CLR),Hyperbolic Tangent Decay(HTD),and Toggle between Hyperbolic Tangent Decay and Triangular mode with Restarts(T-HTR)techniques.Experimental results have shown the performance improvement on the 20Newsgroup,Reuters Newswire and IMDB dataset. 展开更多
关键词 Deep learning deep neural network(DNN) learning rates(LR) recurrent neural network(RNN) cyclical learning rate(CLR) hyperbolic tangent decay(HTD) toggle between hyperbolic tangent decay and triangular mode with restarts(T-HTR) teaching learning based optimization(TLBO)
在线阅读 下载PDF
Multimodal emotion recognition based on deep neural network 被引量:1
12
作者 Ye Jiayin Zheng Wenming +2 位作者 Li Yang Cai Youyi Cui Zhen 《Journal of Southeast University(English Edition)》 EI CAS 2017年第4期444-447,共4页
In order to increase the accuracy rate of emotion recognition in voiceand video,the mixed convolutional neural network(CNN)and recurrent neural network(RNN)ae used to encode and integrate the two information sources.F... In order to increase the accuracy rate of emotion recognition in voiceand video,the mixed convolutional neural network(CNN)and recurrent neural network(RNN)ae used to encode and integrate the two information sources.For the audio signals,several frequency bands as well as some energy functions are extacted as low-level features by using a sophisticated audio technique,and then they are encoded w it a one-dimensional(I D)convolutional neural network to abstact high-level features.Finally,tiese are fed into a recurrent neural network for te sake of capturing dynamic tone changes in a temporal dimensionality.As a contrast,a two-dimensional(2D)convolutional neural network and a similar RNN are used to capture dynamic facial appearance changes of temporal sequences.The method was used in te Chinese Natral Audio-'Visual Emotion Database in te Chinese Conference on Pattern Recognition(CCPR)in2016.Experimental results demonstrate that te classification average precision of the proposed metiod is41.15%,which is increased by16.62%compaed with te baseline algorithm offered by the CCPR in2016.It is proved ta t te proposed method has higher accuracy in te identification of emotional information. 展开更多
关键词 emotion recognition convolutional neural network ( CNN) recurrent neural networks ( RNN)
在线阅读 下载PDF
Seismic-inversion method for nonlinear mapping multilevel well–seismic matching based on bidirectional long short-term memory networks
13
作者 Yue You-Xi Wu Jia-Wei Chen Yi-Du 《Applied Geophysics》 SCIE CSCD 2022年第2期244-257,308,共15页
In this paper,the recurrent neural network structure of a bidirectional long shortterm memory network(Bi-LSTM)with special memory cells that store information is used to characterize the deep features of the variation... In this paper,the recurrent neural network structure of a bidirectional long shortterm memory network(Bi-LSTM)with special memory cells that store information is used to characterize the deep features of the variation pattern between logging and seismic data.A mapping relationship model between high-frequency logging data and low-frequency seismic data is established via nonlinear mapping.The seismic waveform is infinitely approximated using the logging curve in the low-frequency band to obtain a nonlinear mapping model of this scale,which then stepwise approach the logging curve in the high-frequency band.Finally,a seismic-inversion method of nonlinear mapping multilevel well–seismic matching based on the Bi-LSTM network is developed.The characteristic of this method is that by applying the multilevel well–seismic matching process,the seismic data are stepwise matched to the scale range that is consistent with the logging curve.Further,the matching operator at each level can be stably obtained to effectively overcome the problems that occur in the well–seismic matching process,such as the inconsistency in the scale of two types of data,accuracy in extracting the seismic wavelet of the well-side seismic traces,and multiplicity of solutions.Model test and practical application demonstrate that this method improves the vertical resolution of inversion results,and at the same time,the boundary and the lateral characteristics of the sand body are well maintained to improve the accuracy of thin-layer sand body prediction and achieve an improved practical application effect. 展开更多
关键词 bidirectional recurrent neural networks long short-term memory nonlinear mapping well–seismic matching seismic inversion
在线阅读 下载PDF
Lightweight and highly robust memristor-based hybrid neural networks for electroencephalogram signal processing
14
作者 童霈文 徐晖 +5 位作者 孙毅 汪泳州 彭杰 廖岑 王伟 李清江 《Chinese Physics B》 SCIE EI CAS CSCD 2023年第7期582-590,共9页
Memristor-based neuromorphic computing shows great potential for high-speed and high-throughput signal processing applications,such as electroencephalogram(EEG)signal processing.Nonetheless,the size of one-transistor ... Memristor-based neuromorphic computing shows great potential for high-speed and high-throughput signal processing applications,such as electroencephalogram(EEG)signal processing.Nonetheless,the size of one-transistor one-resistor(1T1R)memristor arrays is limited by the non-ideality of the devices,which prevents the hardware implementation of large and complex networks.In this work,we propose the depthwise separable convolution and bidirectional gate recurrent unit(DSC-BiGRU)network,a lightweight and highly robust hybrid neural network based on 1T1R arrays that enables efficient processing of EEG signals in the temporal,frequency and spatial domains by hybridizing DSC and BiGRU blocks.The network size is reduced and the network robustness is improved while ensuring the network classification accuracy.In the simulation,the measured non-idealities of the 1T1R array are brought into the network through statistical analysis.Compared with traditional convolutional networks,the network parameters are reduced by 95%and the network classification accuracy is improved by 21%at a 95%array yield rate and 5%tolerable error.This work demonstrates that lightweight and highly robust networks based on memristor arrays hold great promise for applications that rely on low consumption and high efficiency. 展开更多
关键词 MEMRISTOR LIGHTWEIGHT ROBUST hybrid neural networks depthwise separable convolution bidirectional gate recurrent unit(BiGRU) one-transistor one-resistor(1T1R)arrays
在线阅读 下载PDF
基于多空间维度联合方法改进的BiLSTM出水氨氮预测方法
15
作者 王雷 张煜 +3 位作者 赵艺琨 刘明勇 刘子航 李杰 《中国农村水利水电》 北大核心 2025年第2期17-24,共8页
出水氨氮作为衡量污水处理厂水质处理工艺的重要指标之一,准确预测污水处理厂出水水质中的氨氮含量对于及时调整处理工艺,保障水环境安全有着重要的作用。提出了一种基于联合多空间维度(Multi-spatial Dimensional Cooperative Attenti... 出水氨氮作为衡量污水处理厂水质处理工艺的重要指标之一,准确预测污水处理厂出水水质中的氨氮含量对于及时调整处理工艺,保障水环境安全有着重要的作用。提出了一种基于联合多空间维度(Multi-spatial Dimensional Cooperative Attention)改进的双向长短期记忆网络(Bi-directional Long Short-Term Memory,BiLSTM)的水质预测模型,首先通过皮尔逊(Pearson)系数法筛选出与出水氨氮相关性较强的总氮、污泥沉降比和温度3个指标作为模型输入,联合3个维度的强相关信息对未来6 h的出水氨氮进行预测。结果表明,MDCA-BiLSTM模型在融合残差序列后对出水氨氮的预测准确率R2为0.979,并在太平污水处理厂和文昌污水处理厂两个站点收集到的数据集上总氮、总磷和溶解氧的均方根误差分别为0.002、0.003、0.001和0.004、0.003、0.002;预测精度分别为0.959、0.947、0.971和0.962、0.951、0.983;与BiLSTM相比,均方根误差分别降低了0.007、0.007、0.007和0.017、0.006、0.005;预测精度分别提高了0.176、0.183、0.258和0.098、0.109、0.11。同时,该模型在面对未来6、12和24 h的预测步长时,仍能够达到0.956、0.933和0.917的预测精度,说明改进后的模型在预测准确性和鲁棒性方面表现出显著优势。该方法能够有效提高污水处理厂出水氨氮的及其他指标的预测准确性,可作为水资源循环和管理决策的一种有效参考手段,具有较强的实际应用价值。 展开更多
关键词 水质参数 时序预测 时序卷积网络 双向长短期记忆循环神经网络 注意力机制
在线阅读 下载PDF
基于语义分类的物联网固件中第三方组件识别
16
作者 马峰 于丹 +2 位作者 杨玉丽 马垚 陈永乐 《计算机工程与设计》 北大核心 2025年第1期274-281,共8页
为扩大物联网固件中第三方组件识别范围,从软件供应链层面研究物联网固件安全,提出一种基于语义短文本分类的第三方组件识别方法。通过固件解压提取内部第三方组件和模拟组件运行的方式获取组件语义输出数据,利用Skip-gram将语义输出转... 为扩大物联网固件中第三方组件识别范围,从软件供应链层面研究物联网固件安全,提出一种基于语义短文本分类的第三方组件识别方法。通过固件解压提取内部第三方组件和模拟组件运行的方式获取组件语义输出数据,利用Skip-gram将语义输出转化为词嵌入表示,通过卷积神经网络和双向门控循环单元分别提取语义信息局部特征和全局特征,经过多头注意力机制区分关键语义特征,输入到Softmax分类器中实现可用于识别组件的语义信息分类。通过在10个流行的物联网生产商发布的5453个固件上进行实验,验证了该方法可有效识别第三方组件。 展开更多
关键词 物联网 软件供应链 固件安全 短文本分类 卷积神经网络 双向门控循环单元 多头注意力
在线阅读 下载PDF
融合RNN与稀疏自注意力的文本摘要方法
17
作者 刘钟 唐宏 +1 位作者 王宁喆 朱传润 《计算机工程》 北大核心 2025年第1期312-320,共9页
随着深度学习的高速发展,基于序列到序列(Seq2Seq)架构的文本摘要方法成为研究焦点,但现有大多数文本摘要模型受限于长期依赖,忽略了注意力机制复杂度以及词序信息对文本摘要生成的影响,生成的摘要丢失关键信息,偏离原文内容与意图,影... 随着深度学习的高速发展,基于序列到序列(Seq2Seq)架构的文本摘要方法成为研究焦点,但现有大多数文本摘要模型受限于长期依赖,忽略了注意力机制复杂度以及词序信息对文本摘要生成的影响,生成的摘要丢失关键信息,偏离原文内容与意图,影响用户体验。为了解决上述问题,提出一种基于Transformer改进的融合递归神经网络(RNN)与稀疏自注意力的文本摘要方法。首先采用窗口RNN模块,将输入文本按窗口划分,每个RNN对窗口内词序信息进行压缩,并通过窗口级别的表示整合为整个文本的表示,进而增强模型捕获局部依赖的能力;其次采用基于递归循环机制的缓存模块,循环缓存上一文本片段的信息到当前片段,允许模型更好地捕获长期依赖和全局信息;最后采用稀疏自注意力模块,通过块稀疏矩阵对注意力矩阵按块划分,关注并筛选出重要令牌对,而不是在所有令牌对上平均分配注意力,从而降低注意力的时间复杂度,提高长文本摘要任务的效率。实验结果表明,该方法在数据集text8、enwik8上的BPC分数相比于LoBART模型降低了0.02,在数据集wikitext-103以及ptb上的PPL分数相比于LoBART模型分别降低了1.0以上,验证了该方法的可行性与有效性。 展开更多
关键词 序列到序列架构 文本摘要 Transformer模型 递归神经网络 递归循环机制 稀疏自注意力机制
在线阅读 下载PDF
基于链路质量预测的UANET改进蚁群路由算法
18
作者 曾囿钧 周劼 +3 位作者 刘友江 曹韬 杨大龙 刘羽 《太赫兹科学与电子信息学报》 2025年第3期240-246,共7页
无人机自组网(UANET)可通过多跳转发增大通信范围,其中路由算法承担数据包传输路径规划的任务。针对高动态网络下,无人机定位偏差带来的定向天线波束对不准所造成的增益衰减问题,提出一种基于链路质量预测的蚁群路由算法(LQP-ACO)。该... 无人机自组网(UANET)可通过多跳转发增大通信范围,其中路由算法承担数据包传输路径规划的任务。针对高动态网络下,无人机定位偏差带来的定向天线波束对不准所造成的增益衰减问题,提出一种基于链路质量预测的蚁群路由算法(LQP-ACO)。该算法利用双向门控循环单元-全连接神经网络(BiGRU-FCNN)预测无人机节点之间的链路质量,然后根据预测的链路质量,利用蚁群算法寻找最优的2条路径进行业务数据传输。仿真结果表明,提出的路由算法相较于传统的Dijkstra算法,在随机路点(RWP)及随机游走(RW)移动模型下,丢包率分别降低了2.75%、4.5%。 展开更多
关键词 无人机自组网路由 蚁群优化算法 双向门控循环单元 全连接神经网络(FCNN)
在线阅读 下载PDF
面向样本不平衡的网络入侵检测方法
19
作者 王肖 李大鹏 《无线通信技术》 2025年第1期6-12,共7页
针对当前网络入侵检测方法特征信息提取不足、网络异常流量样本数量不平衡导致入侵检测准确率低的问题,提出一种结合卷积神经网络(Convolutional Neural Network,CNN)、双向门控循环单元(Bidirectional Gated Recurrent Unit,BiGRU)与... 针对当前网络入侵检测方法特征信息提取不足、网络异常流量样本数量不平衡导致入侵检测准确率低的问题,提出一种结合卷积神经网络(Convolutional Neural Network,CNN)、双向门控循环单元(Bidirectional Gated Recurrent Unit,BiGRU)与注意力机制的网络入侵检测方法。首先,对网络流量数据进行数据预处理;然后,通过一维卷积神经网络提取其局部特征,双向门控循环单元提取其长距离序列特征;最后,融合注意力机制使关键信息得到更好的表达。此外,引入Equalization Loss v2(EQL v2)作为损失函数对少数类样本进行加权,以解决网络流量样本不平衡的问题。在CICIDS2017数据集上的结果表明,所提方法能够有效改善原始数据集中的样本不平衡问题,提高对网络入侵的检测准确率和异常流量样本的检测能力。 展开更多
关键词 网络入侵检测 样本不平衡 卷积神经网络 双向门控循环控制单元 注意力机制
在线阅读 下载PDF
基于MSRC-BiGRU-SA的人体活动识别
20
作者 芦平 于增辉 华国环 《中国电子科学研究院学报》 2025年第1期25-32,共8页
针对目前基于可穿戴传感器的复杂人体活动分类算法大多忽略对多尺度特征的提取和关键特征捕捉的问题,文中提出一种多尺度残差卷积网络叠加双向门控循环单元和自注意力机制(MSRC-BiGRU-SA)的模型。首先,通过MSRC模块充分提取传感器数据... 针对目前基于可穿戴传感器的复杂人体活动分类算法大多忽略对多尺度特征的提取和关键特征捕捉的问题,文中提出一种多尺度残差卷积网络叠加双向门控循环单元和自注意力机制(MSRC-BiGRU-SA)的模型。首先,通过MSRC模块充分提取传感器数据的多尺度空间和时间特征并有效融合原始数据的特征信息,增强特征的表达能力和鲁棒性;其次,利用BiGRU模块充分捕捉时间序列的前后依赖关系;最后,通过SA模块增强模型对复杂活动关键特征的捕捉能力以提升分类性能。实验结果表明,在公开数据集上,该模型对复杂活动的分类准确率达到97.50%,相较于原始CNN-BiGRU模型提升了5.77%,与现有先进模型相比,具有更好的识别效果。 展开更多
关键词 复杂人体活动识别 卷积神经网络 双向门控循环单元 可穿戴传感器 深度学习
在线阅读 下载PDF
上一页 1 2 26 下一页 到第
使用帮助 返回顶部