北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2008, Vol. 31 ›› Issue (5): 78-81.doi: 10.13190/jbupt.200805.78.323

• 研究报告 • 上一篇    下一篇

元搜索引擎结果合成算法研究

李红梅1,2, 丁振国1, 周水生3, 周利华1
  

  1. 1. 西安电子科技大学 计算机学院, 西安 710071; 2. 河北农业大学 信息科学与技术学院, 保定071001;
    3. 西安电子科技大学 理学院, 西安 710071
  • 收稿日期:2007-12-19 修回日期:1900-01-01 出版日期:2008-10-30 发布日期:2008-10-30
  • 通讯作者: 李红梅

Research on Results Merging in Meta Search Engine
 

LI Hong-mei1,2, DING Zhen-guo1, ZHOU Shui-Sheng3, ZHOU Li-hua1
  

  1. 1. School of Computer Science and Technology, Xidian University, Xi’an 710071, China;
    2. College of Information Science and Technology, Agricultural University of Hebei, Baoding 071001,China;
    3. School of Science , Xidian University , Xi’an 710071, China
  • Received:2007-12-19 Revised:1900-01-01 Online:2008-10-30 Published:2008-10-30
  • Contact: LI Hong-mei

摘要:

提出了一种基于文本/位置分析和群决策的查询结果合成算法.在充分考虑搜索结果文本信息的基础之上,提出查询匹配度的概念,并对搜索结果的标题和短文摘进行相关度分析,通过将文本分析与规范化的搜索结果排序值相结合来计算文档的相关分值.在估计非相关文档的相关分值时,针对不同假设条件分别进行了讨论,并提出改进的影子文档算法.然后,采用基于群决策的合成方法对相关分值进行合并,实现搜索结果的一致性排序.实验结果表明采用该算法,搜索结果的相关性明显优于Round-robin、CombSum和CombMNZ 3种合成算法.

关键词: 元搜索, 信息检索, 搜索结果合成, 文本分析

Abstract:

A result merging method based on text / rank analysis and Group Decision Making activity is proposed. Utilizing text-based information obtained from search results, a definition of query-match grade is presented, and an approach on text normalization for meta search is described. The relevant scores of relevant documents are normalized by incorporating text analysis measure with existing rank-based method. When estimating scores of non-relevant documents, different assumptions are discussed, and an improved shadow document method is proposed as well. So a merging method based on Group Decision Making activity is adopted to sort the search results. Four different search engines are tested in the practical web environment. The experimental results show that this method is more effective than other three merging methods: Round-robin, CombSum and CombMNZ.

Key words: meta search, information retrieval, search results merging, text analysis

中图分类号: