期刊文献+

基于MLER的语音/音乐分类方法 被引量:6

Speech/music discrimination based on a modified low energy ratio
原文传递
导出
摘要 音频分类是音频信号处理中一项重要的预处理工作。该文描述了一种基于能量的分类方法,将音频信号分为语音和音乐2种类型。分类的过程分为3个阶段,首先计算优化低能量率MLER(modified low energy ratio)作为特征,然后利用初级分类器得到初步分类的结果,最后利用音频类别的前后相关性,使用上下文分类器修正初始分类得到最终分类的结果。该文重点对MLER中参数的合理选取范围进行了讨论,并对传统的初始分类器作了改进,用非参数分类器和参数分类器代替原有的Bayes硬判决的方法,避免了由于门限选择不当所带来的分类错误。实验表明,使用参数分类器时,对纯语音和纯音乐分类效果很好,正确率达99%以上。 Audio classification is an important pretreatment for audio signal processing.This paper presents a speech/music discrimination method based on the energy of the audio signal.The first step extracts the modified low energy ratio.A junior classifier then gives a primary speech/music discrimination result with a context-based classifier then used to refine the primary result based on the class correlation between adjacent audio frames.This paper focuses on selection of the most appropriate parameters and impr...
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第S1期720-724,共5页 Journal of Tsinghua University(Science and Technology)
关键词 MLER(modified LOW ENERGY ratio) 音频分类 非参数分类器 参数分类器 MLER(modified low energy ratio) audio classification nonparametric classifier parametric classifier
  • 相关文献

参考文献6

  • 1LIN Rueishiang,CHEN Linghwei.A new approach for audioclassification and segmentation using gabor wavelets andfisher linear discriminator[].International Journal ofPattern Recognition and Artif icial Intelligence.2005
  • 2Ajmera J,McCowan I,Bourlard H.Speech/musicsegmentation using entropy and dynamism features in aHMM classification framework[].Speech Communication.2003
  • 3Saunder J.Real-time discrimination of broadcastspeech/music[].ICASSP’.1996
  • 4Scheirer E,Slaney M.Construction and evaluation of arobust multifeature speech/music discrimination[].ICASSP’.1997
  • 5Wang W Q,Gao W,Ying D W.A fast and robustspeech/music discrimination approach[].ICICS-PCM.2003
  • 6Lu L,Zhang H J,Jiang H.Content Analysis for Audio Classification and Segmentation[].IEEE Transactions on Speech and Audio Processing.2002

同被引文献48

  • 1陈功,王振力,张建兵.基于短时能量的语音/音乐快速分类[J].电子技术应用,2006,32(1):53-55. 被引量:3
  • 2董婧,赵晓晖,应娜.基于二进小波变换的基音检测算法[J].吉林大学学报(工学版),2006,36(6):978-982. 被引量:2
  • 3陈功,张雄伟.一种基于灰关联分析的语音/音乐分类方法[J].声学技术,2007,26(2):262-267. 被引量:8
  • 4SAUNDERS J. Real-time discrimination of broadcast speech/music [ C]//Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing: ICASSP 96. Washington, DC: IEEE Computer Society, 1996, 2:993 -996.
  • 5SCHEIRER E, SLANEY M. Construction and evaluation of a robust multifeature music/speech discriminator [ C ]// Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP97. Washington, DC: IEEE Computer Society, 1997,2: 1331.
  • 6WOLD E, BLUM T, KEISLAR D, et al. Content-based classification search and retrieval of audio[ J]. IEEE Multimedia Magazine, 1996,3(3): 27 -36.
  • 7CORTIZO E, ZURERA M, FERRERAS F. Application of Fisher linear discriminant analysis to speech/music classification [ C]// International Conference on Computer as a Tool: EUROCON 2005. Wash- ington, DC: IEEE Press, 2005:21-24.
  • 8QURESHI A F, KIRANYAZ S, GABBOUJ M. A genetic audio classification and segmentation approach for multimedia indexing and retrieval[ J]. IEEE Transactions on Speech and Audio Processing, 2006, 9(3) : 517 - 523.
  • 9SARIKAYA R, PELLOM B L, HANSEN J H L. Wavelet packet transform features with application to speaker identification [ C]// IEEE Nordic Signal Processing Symposium: NORSIG 98. Washington, DC: IEEE Press, 1998:81-84.
  • 10GROFTT S, LAVNER Y. Time-scale modification of audio signals using enhanced WSOLA with management of transients[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2008, 16(1): 106-115.

引证文献6

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部