首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于关键词共现分析的检索结果聚类研究
引用本文:李枫林,何洲芳.基于关键词共现分析的检索结果聚类研究[J].情报学报,2011,30(8).
作者姓名:李枫林  何洲芳
作者单位:武汉大学信息资源研究中心,武汉,430072
基金项目:教育部人文社会科学重点研究基地重大项目
摘    要:随着互联网规模的急剧扩张,提升信息检索的效用变得相当困难.本文首先通过特定算法提取每篇文档的关键词,然后运用统计方法计量不同文档的共现关键词并形成相应的共现关键词标签矩阵,最后利用层次聚类算法对共现关键词标签进行聚类并形成相应的层次标签树来构造文档聚类束.该方法可以对源搜索引擎返回的结果进行有效的分类,使用户在更高主题层次上查看检索词的相关信息,准确地找到感兴趣的信息.通过与Lingo算法的比较,显示本文算法所得的标签更具可读性和概括性,同时F-measure评价指标也表明本算法在文本聚类的质量上有了一定的提升.

关 键 词:关键词  共现  聚类  检索结果

Study on Clustering of Retrieval Results Based on Co-occurrence Analysis of Keywords
Li Fenglin,He Zhoufang.Study on Clustering of Retrieval Results Based on Co-occurrence Analysis of Keywords[J].Journal of the China Society for Scientific andTechnical Information,2011,30(8).
Authors:Li Fenglin  He Zhoufang
Institution:Li Fenglin and He Zhoufang (Center for Studies of Information Resources of Wuhan University,Wuhan 430072)
Abstract:The continuous growth in the size of the Internet is creating difficulties for improving efficiency of information retrieval.First of all,this paper extracts the keywords from each document through a specific algorithm. Secondly,it has applied statistical techniques to measure the quantities of co-occurrence keywords for forming the label matrix of them,and finally agglomerated them into higher-level clusters by hierarchical clustering algorithm in order to classify the results which return from the source ...
Keywords:co-occurrence  clustering  retrieval results  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号