期刊文献+

一种基于网页分割的Web信息检索方法 被引量:3

Information Retrieval Method based on Page Segmentation
原文传递
导出
摘要 提出一种基于网页内容分割的Web信息检索算法。该算法根据网页半结构化的特点,按照HTML标记和网页的内容将网页进行区域分割。在建立HTML标记树的基础上,利用内容相似性和视觉相似性进行节点的整合。在检索和排序中,根据用户的查询,充分利用区域信息来对相关的检索结果进行排序。 A Web information retrieval algorithm based on web page segment is designed. The key idea is to segment each web page into different topic areas or segments according to its HTML tags and contents since web pages are semi-structure. First the algorithm builds a HTML tag tree. Then it combines nodes in the tree by using both the content similarity and visual similarity. The retrieval and ranking algorithm makes use of this segmentation information to search and order the relevant pages.
出处 《图书情报工作》 CSSCI 北大核心 2009年第3期108-110,114,共4页 Library and Information Service
基金 淮安市科技计划项目"基于Web级科技计划项目管理系统"(项目编号:HAG08081)研究成果之一
关键词 网页分割 信息检索 HTML标记 相似性 page segment information retrieval HTML tag similarity
  • 相关文献

参考文献5

二级参考文献31

  • 1林培光,刘弘,樊孝忠,王涛.New method for query answering in semantic web[J].Journal of Southeast University(English Edition),2006,22(3):319-323. 被引量:1
  • 2龚劬.图论与网络最优化算法[M].重庆:重庆大学出版社,2000.87-96.
  • 3Franz Baader, Diego Calvanese, Deborah McGuinness, et al. The Description Logic Handbook [ M ]. Cambridge University Press, 2003 : 189 - 212
  • 4U Straecia. Reasoning Within Fuzzy Description Logics[J ]. Journal of Artificial Intelligence Research, 2001,14 : 323 - 328
  • 5Brian McBride. Jena. A Semantic Web Toolkit [J ]. IEEE Internet Computing, 2002,6 (6) : 55 - 59
  • 6Aleman - Meza B. SWETO. Large - scale Semantic Web Test bed [A]. Proceedings of the16th International Conference on Software Eng &Knowledge Eng (SEKE2004) :Workshop on Ontology in Action. Banff, Canada. Knowledge Systems Inst, 2004 : 490 - 493
  • 7Kerschberg Larry, Kim Wooju, Scime Anthony. A Personalizable Agent for Semantic Taxonomy - Based Web Search [M]. Springer Berlin:Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)[ J ]. Innovative Concepts for Agent-Based Systems, 2003 : 3 - 31
  • 8Voorhees E. The TREC-8 question answering track report[A]. In: Proceedings of the 8th Text Retrieval Conference NIST[C]. Gaithersburg, MD, 1999. 77-82.
  • 9Katz B, Lin J, Felshin S. Gathering knowledge for a question answering system from heterogeneous information sources [A]. In: Proceedings of the ACL 2001 Workshop on Human Language Technology and Knowledge Management[C]. Toulouse, France. 2001.
  • 10张德.[D].南京:东南大学计算机科学与工程系,2002.

共引文献52

同被引文献18

  • 1刘波涛.基于WEB信息检索方法研究[J].湖南科技学院学报,2006,27(11):244-246. 被引量:1
  • 2Zhu Lijun, Tao Lain, Liu Hui. Caeulation of the Concept Similarity on Domain Ontology [J ]. Journal of South China University of Technology( Natural Soienee FAition), 2004, 32(11) : 147 - 159.
  • 3Liu Yajun, Xu Yi. Automatic Question Answering System Based on Weighted Semantic Similarity Model[J ]. Journal of Southeast University (Natural Science Edition), 2004,34 (5) :609 - 612.
  • 4Rodriguez M, Egenhofer M. Determining Semantie Similarity Among Entity Class From Different Ontologies[J ]. IEEE Transactions on Knowledge and Data Engineering,2003,15(2) :442 - 456.
  • 5Ganesan P. Exploiting Hierarchical Domain Structure to Compute Similarity[J ]. ACM Transactions on Information System, 2003,21 (1):64 -93.
  • 6Hun Lixin, Sun Linping. An Approach to Determining Semantie Similarity[ J ]. Advances in Engineering Software, 2006,37 (2) : 129 - 132.
  • 7MA Zhong-ming,Gautam Pant,Sheng Olivia R.Interest-based personalized search[A] //ACM Transactions on Information Systems[C].NewYork,2007.
  • 8Pretschner A,Gauch S.Ontology based personalized search[A].Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence[C].Chicago,U S:IEEE Press,1999:391-398.
  • 9Joachims T,Freitag D,Mitchell T.WebWatcher:a tour guide for the World Wide Web[A].In:Georgeff,MP,Pollack,E.M,eds.Proceedings of the International Joint Conference on Artificial Intelligence[C].San Francisco:Morgan Kanfmann Publishers,1997:770-777.
  • 10Barratt Rob,Maglio Paul P,Kellem Daniel C.How to personalize the Web[A].In Proc.ACM CH197[C].Atlanta,USA,1997.

引证文献3

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部