Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2013, Vol. 36 ›› Issue (4): 81-84.doi: 10.13190/jbupt.201304.81.wangxw

• Reports • Previous Articles     Next Articles

Cross-Lingual Pseudo Relevance Feedback Based on Bilingual Topics

WANG Xu-wen, WANG Xiao-jie, SUN Yue-ping   

  1. Center for Intelligence Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2012-11-15 Online:2013-08-31 Published:2013-05-22

Abstract:

A cross-lingual pseudo relevance feedback model based on bilingual topics is proposed for cross language information retrieval task. The latent Dirichlet allocation (LDA) model is extended to the bilingual topic model, each topic could generate a source language token and a target language token. A strategy on how to choose topics and words for cross language query expansion is given, and the secondary retrieval is performed on the basis of the refined query translation. Experiments show that this model outperforms monolingual LDA-based feedback method as well as classical techniques based on vector space model.

Key words: pseudo relevance feedback, latent Dirichlet allocation, bilingual topics, cross language information retrieval, query expansion