期刊文献+

基于注意力和深度学习的双耳语音增强算法

Binaural speech enhancement algorithm based on attention and deep learning
原文传递
导出
摘要 为了降低双耳语音中噪声和混响的影响,提高语音质量和可懂度,提出了一种基于注意力机制和改进的卷积循环神经网络的双耳语音增强算法.在该算法中,首先提取双耳语音的谱特征和双耳线索,对谱特征应用通道注意力得到可靠的谱特征,同时对双耳线索应用空间注意力得到可靠的双耳线索作为神经网络的输入特征.然后,构建了将模型注意力作为卷积循环神经网络编解码层的跳跃连接的神经网络结构,并利用双向长短期记忆网络获取时序信息.实验结果表明:在不同噪声与混响的条件下,所提出的算法具有更好的性能. In order to reduce the influence of noise and reverberation in binaural speech,and improve speech quality and intelligibility,a binaural speech enhancement algorithm based on attention mechanism and improved convolutional recurrent neural network was proposed.In this algorithm,the spectral features and binaural cues of binaural speech were first extracted,and channel attention was applied to the spectral features to obtain reliable spectral features,while reliable binaural cues were obtained by applying spatial attention to the binaural cues,then the two features were combined as the input of neural network.A neural network structure that uses model attention as a skip connection for the encode layer and decode layer of convolutional recurrent neural network,and the bidirectional long and short-term memory network was used to obtain time domain information.Experimental results show that the proposed algorithm has better performance in different noise and reverberation conditions.
作者 李如玮 李秋艳 赵丰年 刘尚枫 LI Ruwei;LI Qiuyan;ZHAO Fengnian;LIU Shangfeng(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China)
出处 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2023年第9期125-131,166,共8页 Journal of Huazhong University of Science and Technology(Natural Science Edition)
基金 国家自然科学基金项目资助(61971016).
关键词 双耳语音增强 注意力机制 卷积循环神经网络 双耳线索 Gammatone滤波器 binaural speech enhancement attention mechanism convolutional recurrent neural network binaural cues Gammatone filter
  • 相关文献

参考文献3

二级参考文献14

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部