首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 203 毫秒
1.
董丕彦  马巍 《情报科学》2004,22(8):967-970
本文介绍了利用相关词进行提问扩展的算法。该算法建立在检索词模糊聚类的基础上,聚类以检索词在文献中共同出现为标准,与提问中检索词相关的群集形成提问的上下文,群集中属于上下文的检索词可用于提问的扩展。实验表明该算法提高了检准率。  相似文献   

2.
文章在分析传统查询扩展方法不足的基础上,提出了基于用户反馈的查询扩展方法,结合WordNet和概念相似性方法为用户提供用于查询扩展的候选集,最后用户从中挑选出扩展检索词用于查询扩展,从而为用户返回更加准确的检索结果.  相似文献   

3.
人们在利用搜索引擎进行信息检索时,较少的检索词难以反映用户真正的检索意图,因此对用户输入的检索词进行扩展尤为必要。对传统的查询扩展进行了改进,通过建立领域本体,借助本体及本体的推理机制,将用户输入的检索词从直接和间接两方面扩展为语义联系的查询关键词集合,以提高信息检索质量和效率。  相似文献   

4.
针对目前常用的信息检索算法普遍存在查询性能不高的问题。本文提出了一种基于AWAR算法的信息检索扩展查询模型,该模型首先采用传统向量空间模型算法对检索目标进行初检,然后利用最小完全加权置信度阈值生成完全加权关联规则,最后根据规则提取扩展词,得到查询结果。实验表明,基于AWAR算法的信息检索扩展查询模型的检索性能比传统向量空间模型算法和基于局部上下文分析的查询扩展的检索算法要高。  相似文献   

5.
本文主要研究了查询语义树的生成策略、用户查询语义的提取机制,以及查询语义树中语义边界的确定方法。通过查询语义树产生候选扩展词,再计算候选扩展词与所有查询项在初检局部文档集合中的共现度,用以评估扩展词质量,使得扩展词与用户查询所蕴涵的主题具有较强的语义相关性。实验结果表明,与传统向量空间模型检索算法比较,查询性能有明显的改善。  相似文献   

6.
杨韦洁  高珑  苏静 《现代情报》2014,34(7):78-82,87
针对传统数字图书馆中基于关键字的P2P查询扩展存在对用户检索词语义信息解释不足的缺陷,本文提出一种P2P环境下基于语义的节点查询扩展方法,通过把关键字关联表和本体相结合,实现了一种个性化查询扩展方法,同时利用这种扩展方法实现P2P中基于兴趣网络的搜索,能够较大幅度提升检索效率。  相似文献   

7.
信息检索的关键在于检索词的处理,文章针对信息检索中的检索词运用方法进行分析,并提出检索词的处理技巧,以帮助用户提高信息检索效率.  相似文献   

8.
Web智能检索中动态相关反馈技术研究   总被引:5,自引:1,他引:5  
随着网络技术的发展 ,网络逐步成为巨大的、分布广泛的综合了复杂文本、图像、声音等的信息资源中心。由于网络提供的信息大多缺乏结构上的统一性、组织上的有序性 ,所以非常有必要帮助用户有效收集信息并选择感兴趣的信息推荐给用户 ,做到用户信息检索的智能化 ,从而降低网络查询时间 ,提高网络检索精度。本文介绍的动态相关反馈技术可在用户进行Web浏览及查询时动态跟踪用户兴趣。在用户进行Web检索时通过用户即时反馈 ,确定用户兴趣模式 ,从而实现智能Web检索 ,使得检索结果最大程度地满足用户需要。1 Web智能检索中的动态…  相似文献   

9.
在Web信息检索中,为了明确用户的查询需求,很多搜索引擎和全文数据库提供了相关词提示功能。本文简要介绍了Web信息检索中相关词提示的获取技术,并对相关词提示效果进行实际调查分析。从关键词库中随机抽取若干关键词,在选定的搜索引擎和全文数据库上进行信息检索,获取抽样关键词的相关提示词。通过关键词检索、人工打分和数据统计,进行查询扩展分析、查询式专指度分析和查准率分析,给出相关词提示在改善检索效果和用户满意度方面的综合评价。  相似文献   

10.
赵文娟  刘忠宝  郭慧 《情报科学》2019,37(5):108-114
【目的/意义】传统的信息检索技术主要是基于关键词匹配的信息推送,该方法容易出现漏检和误检的情 况。语义检索通过语义分析获得用户真正的检索意图,实现精准检索。【方法/过程】本文在对语义检索的原理和模 型进行描述的基础上,提出了基于本体概念树模型的词元扩展算法,通过对词元的语义相似性、语义相关性进行计 算,得出词元的语义关联度,关联度超过一定阈值的词元的集合即为扩展后的词元集。【结果/结论】该方法既考虑 了具有继承关系的词元间的语义相似性,也考虑了具有相同属性词元间的语义关联度,结论更具参考价值。  相似文献   

11.
Broken hypertext links are a frequent problem in the Web. Sometimes the page which a link points to has disappeared forever, but in many other cases the page has simply been moved to another location in the same web site or to another one. In some cases the page besides being moved, is updated, becoming a bit different to the original one but rather similar. In all these cases it can be very useful to have a tool that provides us with pages highly related to the broken link, since we could select the most appropriate one. The relationship between the broken link and its possible linkable pages, can be defined as a function of many factors. In this work we have employed several resources both in the context of the link and in the Web to look for pages related to a broken link. From the resources in the context of a link, we have analyzed several sources of information such as the anchor text, the text surrounding the anchor, the URL and the page containing the link. We have also extracted information about a link from the Web infrastructure such as search engines, Internet archives and social tagging systems. We have combined all of these resources to design a system that recommends pages that can be used to recover the broken link. A novel methodology is presented to evaluate the system without resorting to user judgments, thus increasing the objectivity of the results, and helping to adjust the parameters of the algorithm. We have also compiled a web page collection with true broken links, which has been used to test the full system by humans.  相似文献   

12.
本文分析了正方法,查询修正中的用户信息行为,吸收网页抓取、检索与浏览并重的思想,综合考虑用户Web搜索过程中的行为特点、查询修正所用词汇的可用来源,给出一个新的面向Web搜索的查询修正解决方案.  相似文献   

13.
Web信息检索系统中的网页质量分析方法评价   总被引:1,自引:0,他引:1  
李树青  崔慧智 《情报科学》2008,26(5):729-734
改进对高质量网页的检索精度,将会极大提高Web信息检索系统的用户满意度。首先提出了信息检索中的“有用性”指标,并据此论述了基于网页质量分析方法的Web信息检索模型,然后提出了网页质量直接测度指标和网页质量间接测度指标。最后,详细介绍了各种网页质量指标的相关研究内容和方法,并做出了针对性的评价。  相似文献   

14.
基于页面链接挖掘的Web教育信息检索   总被引:2,自引:0,他引:2  
王成云  王乐乐 《情报科学》2004,22(4):475-477,487
教育信息检索是教育信息应用于教育科研与教育教学的关键环节,而Web页面链接挖掘是对Web页面之间的链接结构进行挖掘。本文对Web链接结构挖掘在教育信息检索方面上进行了研究,介绍了Web挖掘的概念、分类,以及HITS与Page—rank等算法,并提出了一种基于样本模式特征提取的信息检索方法。  相似文献   

15.
全文检索搜索引擎中文信息处理技术研究   总被引:2,自引:0,他引:2  
唐培丽  胡明  解飞  刘钢 《情报科学》2006,24(6):895-899,909
本文深入分析了全文检索中文搜索引擎的关键技术,提出了一种适用于全文检索搜索引擎的中文分词方案,既提高了分词的准确性,又能识别文中的未登录词。针对向量空间信息检索模型,本文设计了一个综合考虑中文词在Web文本中的位置、长度以及频率等重要因素的词条权重计算函数,并且用量化的方法表示出其重要性,能够较准确地反映出词条在Web文档中的重要程度。最后对分词算法进行了测试,测试表明该方法能够提高分词准确度满足实用的要求。  相似文献   

16.
Pre-adoption expectations often serve as an implicit reference point in users’ evaluation of information systems and are closely associated with their goals of interactions, behaviors, and overall satisfaction. Despite the empirically confirmed impacts, users’ search expectations and their connections to tasks, users, search experiences, and behaviors have been scarcely studied in the context of online information search. To address the gap, we collected 116 sessions from 60 participants in a controlled-lab Web search study and gathered direct feedback on their in-situ expected information gains (e.g., number of useful pages) and expected search efforts (e.g., clicks and dwell time) under each query during search sessions. Our study aims to examine (1) how users’ pre-search experience, task characteristics, and in-session experience affect their current expectations and (2) how user expectations are correlated with search behaviors and satisfaction. Our results with both quantitative and qualitative evidence demonstrate that: (1) user expectation is significantly affected by task characteristics, previous and in-situ search experience; (2) user expectation is closely associated with users’ browsing behaviors and search satisfaction. The knowledge learned about user expectation advances our understanding of users’ search behavioral patterns and their evaluations of interaction experience and will also facilitate the design, implementation, and evaluation of expectation-aware user models, metrics, and information retrieval (IR) systems.  相似文献   

17.
As an information medium, video offers many possible retrieval and browsing modalities, far more than text, image or audio. Some of these, like searching the text of the spoken dialogue, are well developed, others like keyframe browsing tools are in their infancy, and others not yet technically achievable. For those modalities for browsing and retrieval which we cannot yet achieve we can only speculate as to how useful they will actually be, but we do not know for sure. In our work we have created a system to support multiple modalities for video browsing and retrieval including text search through the spoken dialogue, image matching against shot keyframes and object matching against segmented video objects. For the last of these, automatic segmentation and tracking of video objects is a computationally demanding problem which is not yet solved for generic natural video material, and when it is then it is expected to open up possibilities for user interaction with objects in video, including searching and browsing. In this paper we achieve object segmentation by working in a closed domain of animated cartoons. We describe an interactive user experiment on a medium-sized corpus of video where we were able to measure users’ use of video objects versus other modes of retrieval during multiple-iteration searching. Results of this experiment show that although object searching is used far less than text searching in the first iteration of a user’s search it is a popular and useful search type once an initial set of relevant shots have been found.  相似文献   

18.
Topic distillation is one of the main information needs when users search the Web. Previous approaches for topic distillation treat single page as the basic searching unit, which has not fully utilized the structure information of the Web. In this paper, we propose a novel concept for topic distillation, named sub-site retrieval, in which the basic searching unit is sub-site instead of single page. A sub-site is the subset of a website, consisting of a structural collection of pages. The key of sub-site retrieval includes (1) extracting effective features for the representation of a sub-site using both the content and structure information, (2) delivering the sub-site-based retrieval results with a friendly and informative user interface. For the first point, we propose Punished Integration algorithm, which is based on the modeling of the growth of websites. For the second point, we design a user interface to better illustrate the search results of sub-site retrieval. Testing on the topic distillation task of TREC 2003 and 2004, sub-site retrieval leads to significant improvement of retrieval performance over the previous methods based on single pages. Furthermore, time complexity analysis shows that sub-site retrieval can be integrated into the index component of search engines.  相似文献   

19.
陈洁 《情报探索》2020,(2):114-119
[目的/意义]旨在为信息检索相关性研究提供参考。[方法/过程]以CNKI为数据源,采用定性方法,从信息检索的历史脉络和研究学派进行梳理总结,分析信息检索的影响因素和发展趋势。[结果/结论]信息检索相关性是用户、系统的相关性的综合体,任何一方都不能脱离。相关性应该是以用户为关键,系统为基础,研究用户与检索系统的交互、认知以及真实需求的描述与反馈。随着信息检索相关性研究的深入,系统观与用户观将会相互交融,检索技术与用户需求将会协调统一,共同推进检索相关性的发展。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号