摘要
针对基于数据图的关系数据库关键词查询结果的排序问题,提出了基于多因素的结果二度排序法。该方法结合结果结构权重和信息检索中常用的内容匹配,首先采用结果路径权重衡量关键词之间的关联紧密程度对结果粗排序;然后,对于结构权重相等的结果,引入信息元组中的关键词词频和包含关键词的信息量对结果细排序。实验分析表明,该排序方法能将与查询条件高度相关的结果排在前面,提高结果的查准率。
Abstract: Aiming at the problem of ranking keyword search results in relational database based on data graph, this paper presented two factors based results ranking method which combined result' s structure weight and content matching. It often used content matching in field of information retrieve. Firstly, it used result' s path weight to measure the compact degree among keywords, results could get a rough ranking according that. Then for results with the same structure weight, keyword frequency in information tuple and amount of information that contain keywords were used to further ranking. Experiment and analysis shows that this ranking method can improve result precision by making highly relevant results in front of result set.
出处
《计算机应用研究》
CSCD
北大核心
2014年第2期440-442,447,共4页
Application Research of Computers
基金
江西省教育厅科技项目(GJJ12349
GJJ12345
GJJ12347)
江西省科技厅青年科学基金资助项目(20122BAB211035)
江西省自然基金资助项目(20122BAB201045)
江西省教育厅重点项目(赣教技字[12770]号)
关键词
关系数据库
数据图
关键词查询
关键词词频
信息量
排序
relational database
data graph
keyword search
keyword frequency
amount of information
ranking