首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Document reranking by term distribution and maximal marginal relevance for chinese information retrieval
Authors:Lingpeng Yang  Donghong Ji  Munkew Leong
Institution:Institute for Infocomm Research, Media Understanding, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore
Abstract:In this paper, we propose a document reranking method for Chinese information retrieval. The method is based on a term weighting scheme, which integrates local and global distribution of terms as well as document frequency, document positions and term length. The weight scheme allows randomly setting a larger portion of the retrieved documents as relevance feedback, and lifts off the worry that very fewer relevant documents appear in top retrieved documents. It also helps to improve the performance of maximal marginal relevance (MMR) in document reranking. The method was evaluated by MAP (mean average precision), a recall-oriented measure. Significance tests showed that our method can get significant improvement against standard baselines, and outperform relevant methods consistently.
Keywords:Relevance feedback  Term extraction  Term weighting  Maximal marginal relevance  Chinese information retrieval
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号