期刊文献+

基于HNC理论的中文文本词汇链构造方法

Constructing Method of Chinese Text Lexical Chains Based on HNC Theory
在线阅读 下载PDF
导出
摘要 [目的/意义]词汇链是文本中一系列词汇关联而成的语义链。构造词汇链有助于读者把握文本主题,对知识元构建、自动文摘生成等领域研究有重要价值。[方法/过程]借鉴HNC理论的概念层次原理对词汇语义进行形式化描述,利用HNC的符号和同行优先准则,及依存句法揭示的多义词上下文信息,对词语语义进行消歧处理,进而运用HNC语义相似度计算方法获得词语的语义关联性。[结果/结论]在此基础上,提出词汇链构建算法,并对词汇链进行优选处理,获得优选词汇链。实验结果表明,基于HNC理论和依存句法提出的词汇链构造方法的可接受度较高。 [ Purpose/Significance] The lexical chain is a semantic chain connected with a series of words in the text. Building lexical chains helps readers to grasp the theme of the text, and has an important value on knowledge element building and automatic abstract generation. [ Method/Process] Based on the HNC ( Hierarchical Network of Concepts) theory, this paper describes the lexical semantics for- mally, using the symbols and the same line priority criteria of the HNC, and the polysemy contextual information revealed by dependency syntax. [ Result/Conclusion] Then the semantic relatedness of words is obtained by applying HNC semantic similarity calculation. On this basis, a lexical chain building algorithm is proposed, and the preferred lexical chain is achieved after optimization processing. Experiment results show that the proposed lexical chain building method is of a high acceptance degree.
作者 王宇 伍力慧
出处 《情报杂志》 CSSCI 北大核心 2016年第2期182-187,共6页 Journal of Intelligence
关键词 词汇链 语义计算 HNC理论 依存句法 lexical chain semantic calculation HNC theory dependency syntax
  • 相关文献

参考文献14

  • 1Morris J ,Hirst G. Lexieal Cohesion Computed by Thesauri Rela- tions as an Indicator of the Structure of Text [ J ]. Computational Linguistics,1991,17( 1 ) : 21-48.
  • 2Halliday Mak, Hasan R. Cohesion in English [ M ]. London, UK: Longman, 1976.
  • 3Barzilay R, Ethadad M. Using Lexical Chains for Text Summa- rization [ C ]. In Proceedings of the Intelligent Sealable Text Summarization Workshop( ISTS. 97 ), Madrid, 1997.
  • 4Silber H G, McCoy K F. Efficiently Computed Lexical Chains as an Intermediate Representation for Automatic Text Summariza- tion [ J ]. Computational Linguistics,2002,28 (4) : 487-496.
  • 5Ercan G, Cicekli Y. Using Lexical Chains for Keyword Extraction [ J ]. Information Processing & Management,2007,43 ( 6 ) : 1705 -1714.
  • 6刘铭,王晓龙,刘远超.基于词汇链的关键短语抽取方法的研究[J].计算机学报,2010,33(7):1246-1255. 被引量:14
  • 7张明宝,马静.一种面向语义的信息检索方法[J].情报学报,2009,28(4):509-515. 被引量:4
  • 8张明宝,谢宗旺.一种基于知网的中文词汇链构建算法研究[J].软件导刊,2008,7(10):51-53. 被引量:4
  • 9宋培彦,杨代庆.基于语义网络的中文词汇链构造方法[J].图书情报工作,2011,55(22):26-29. 被引量:6
  • 10Banerjee S, Pedersen T. Extended Gloss Overlaps as a Measure of Semantic Relatedness [ C ]//IJCAI. 2003 ( 3 ) : 805-810.

二级参考文献91

共引文献160

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部