期刊文献+

基于文字笔画方向直方图的文本图像文种识别 被引量:3

Script Identification of Document Image Based on Stroke Direction Histogram
在线阅读 下载PDF
导出
摘要 针对文本图像文种识别中特征提取速度和识别精度之间的矛盾,提出了一种基于文字笔画方向直方图的文种识别方法,利用笔画方向直方图对不同文种文字的笔画方向分布差异进行描述并提取特征,采用支持向量机对所提特征进行训练和分类,实现文字种类识别。在实验中选用有质量退化的中、英、俄、日、韩、阿拉伯等10种不同语言文字文本图像。实验结果表明,本方法运算速度快,有较高的识别准确率并对图像质量退化有较好鲁棒性。 Considering the contradiction between the speed of feature extraction and accuracy of identification results in script identification of document image,this paper proposes a new script identification algorithm based on the difference of the stroke direction distribution,and defines the stroke direction histogram,which describes the distribution of the stroke direction effectively.The Support Vector Machine(SVM) is applied for training and classifying the features extracted based on the stroke direction histogram to identify scripts in different languages.Experiments have been performed upon degraded document images,which include ten kinds of languages(Chinese,Russian,English,Japanese,Korean,Arabic,etc).Experimental results confirm that the proposed algorithm can identify scripts accurately and efficiently,and is robust to degraded images.
出处 《信息工程大学学报》 2011年第2期231-237,共7页 Journal of Information Engineering University
基金 国家自然科学基金资助项目(60970172)
关键词 文本图像 文种识别 笔画方向直方图 支持向量机 document image script identification stroke direction histogram support vector machine
  • 相关文献

参考文献9

  • 1Airami S,Manjula D.A survey of script identification techniques for multi-Script document images[J].International Journal of Recent Transactions in Engineering,2009,1(2):246-249.
  • 2Andrew Busch,Wageeh W Boles,Sridha Sridharan.Texture for script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(11):1720-1732.
  • 3Tan T N.Rotation invariant texture features and their use in automatic script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(7):751-756.
  • 4曾理,唐远炎,陈廷槐.基于多尺度小波纹理分析的文字种类自动识别[J].计算机学报,2000,23(7):699-704. 被引量:26
  • 5Hiremath P S,Shivashankar S.Wavelet based co-occurrence histogram features for texture classification with an application to script identification in a document images[J].Pattern Recognition Letters,2008,29(9):1182-1189.
  • 6朱华光,平西建,程娟.基于二元树复数小波变换的文种自动识别[J].数据采集与处理,2008,23(6):766-771. 被引量:4
  • 7Padma M C,Vijaya P A.Entropy based texture features useful for automatic script identification[J].International Journal on Computer Science and Engineering,2010,2(2):115-120.
  • 8Vailaya A.Jain A K,Zhang H J.On image classification:city vs.Landscape[C]///IEEE Workshop on Content-based Access of Image and Video Libraries.1998,6(21):3-8.
  • 9Vapnik V.The nature of statstical learning theory[M].New York:Springer-Vedag,1995.

二级参考文献12

  • 1Wood S L, Yao X Z, Krishnamurthi K, et al. 1995 Language identification for printed text independent of segmentation[C]//Proceedings of the 1995 International Conference on Image Processing. [s.l.]:IEEE, 1995,9 :428-431.
  • 2Spitz A L. Determination of the script and language content of document images[J]. IEEE PAMI, 1997, 19(3):235-245.
  • 3Peake G S, Tan T N. Script and language identification from document images [C]//Proc of Workshop Document Image analysis. [S.l.]: IEEE, 1997, 5 : 10- 17.
  • 4Kingsbury N G. The dual-tree complex wavelet transform: a new technique for shift invariance and directional filters[C]//Proc 8th IEEE DSP Workshop. [S.l.]:IEEE, 1998:86-89.
  • 5Kingsbury N G. Image processing with complex wavelets[J]. Philos, Math Phys Sci, 1999, 357 (1760) : 2543-2560.
  • 6Kingsbury N G. Complex wavelets for shift invariant analysis and filtering of signals[J].Appl Comput Harmon, 2001, 10(3) :234-253.
  • 7Selesnick I W. Hilbert transform pairs of wavelet bases[J]. IEEE Signal Processing Letters, 2001, 8 (6) : 170-173.
  • 8Abdelnour A F, Selesnick I W. Symmetric nearly shift-invariant tight frame wavelets [J]. IEEE Transactions on Signal Processing, 2005, 53 (1) : 231-239.
  • 9Kingsbury N G. A dual-tree complex wavelet transform with improved orthogonality and symmetry properties [C]//IEEE Int Conf Image Processing. [S.l]:IEEE,2000, 2,275-378.
  • 10Tan T N,IEEE Trans Pattern Anal Machine Intell,1998年,20卷,7期,751页

共引文献25

同被引文献3

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部