期刊文献+

一种基于最近搜索周期被引用频率的改进WPR算法

Improved WPR Algorithm Based on Referenced Frequency in Recent Search Cycle
在线阅读 下载PDF
导出
摘要 针对WPR(Weighted PageRank)算法存在的在网页搜索方面的主题漂移和偏重旧网页的现象,综合网页的主题特征和最近搜索周期网页的被引用频率两个因素,提出了一种改进的算法WTFPR(Weighted Topic Frequency PageRank)。该算法通过内容分析,采用改进的TD-IDF算法来解决网页相关性,改善主题漂移现象;通过网页的最近搜索周期的被引用频率来提高那些较新而且价值较高的网页的PR值,从而改善偏重旧网页的现象。仿真结果表明,改进后的算法与WPR算法相比获得了更好的效果。 For the topic drift and bias towards the old pages of WPR(Weighted PageRank)algorithm exist in the Web search,consolidated two factors of Web pages' topic features and referenced frequency in recent search cycle,we proposed an improved algorithm WTFPR(Weighted Topic Frequency PageRank).The algorithm uses improved TD-IDF algorithm to solve relevance of page by content analysis to reduce the topic drift.The algorithm improves the PR value of new and has high quality by referenced frequency of pages in recent search cycle,reducing bias towards the old pages.Simulation results show that the improved algorithm obtaines better results compared to WPR.
出处 《计算机科学》 CSCD 北大核心 2016年第2期86-88,共3页 Computer Science
关键词 主题特征 被引用频率 偏重旧网页 搜索周期 主题漂移 Topic features Referenced frequency Bias towards the old pages Search cycle Topic drift
  • 相关文献

参考文献3

二级参考文献36

  • 1戚华春,黄德才,郑月锋.具有时间反馈的PageRank改进算法[J].浙江工业大学学报,2005,33(3):272-275. 被引量:27
  • 2Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking: Bringing Order to the Web[R]. Califonia, USA: Stanford Digital Library, Tech. Rep.: SIDL-WP-1999-0120, 1998.
  • 3Haveliwala T H. Topic-sensitive PageRank[C]//Proceedings of the 11 th International Conference on World Wide Web. Hawaii, USA: ACM Press, 2002.
  • 4Richardson M, Domingos E The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank[J]. Advances in Neural Information Processing Systems, 2002, (14): 1441-1448.
  • 5Haveliwala T. Effcien Computationof PageRank[R]. Califonia, USA: Computer Science Department, Stanford University, Technical Report: 1999-31-386, 1999.
  • 6Fung B C M,Wang K,Ester M.Hierarchical document clustering//Wang John ed.The Encyclopedia of Data Warehousing and Mining,idea Group.2005:970-975.
  • 7Salton G.The SMART Retrieval System-Experiments in Automatic Document Processing.Englewood Cliffs,New Jersey:Prentice Hall Inc,1971.
  • 8Wang Y,Julia H.Document clustering with semantic analysis//Proceedings of the 39th Hawaii International Conferences on System Sciences.Hawaii,US,2006:54-63.
  • 9Hotho A,Staab S,Stumme G.Wordnet improves text document clustering//Proceedings of the Semantic Web Workshop at SIGIR-2003,26th Annual International ACM SIGIR Conference.Toronto,Canada,2003:541-550.
  • 10Hall P,Dowling G.Approximate string matching.Computing Survey,1980,12(4):381-402.

共引文献301

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部