共查询到19条相似文献,搜索用时 234 毫秒
1.
[目的/意义]旨在为信息检索相关性研究提供参考。[方法/过程]以CNKI为数据源,采用定性方法,从信息检索的历史脉络和研究学派进行梳理总结,分析信息检索的影响因素和发展趋势。[结果/结论]信息检索相关性是用户、系统的相关性的综合体,任何一方都不能脱离。相关性应该是以用户为关键,系统为基础,研究用户与检索系统的交互、认知以及真实需求的描述与反馈。随着信息检索相关性研究的深入,系统观与用户观将会相互交融,检索技术与用户需求将会协调统一,共同推进检索相关性的发展。 相似文献
2.
基于向量空间模型的信息检索系统的设计 总被引:8,自引:1,他引:7
向量空间检索模型最早提出于1958年,它在信息检索系统中有着良好的性能表现。先对该模型作了介绍,然后在向量空间模型的基础上,根据文献标引、索引项权重的确定、相似度测算这三个步骤设计了一个信息检索系统,总结了该系统要用到的核心技术,并评价了该系统。 相似文献
3.
相关性反馈是应用于信息检索中的一项重要技术。它对于提高信息检索的质量是非常有效的。就用户提问与相关性反馈、标准的相关性反馈界面以及如何获得相关信息等问题进行了探讨。 相似文献
4.
相关性是情报学研究中的一个基本概念。用户在利用网络获取信息的过程中,对信息进行相关性判断是信息选择行为中的核心环节。文章对影响相关性判断的因素进行了分析,认为次序效应对用户判断检索结果相关性有一定的影响;了解这一影响,无论对信息检索系统的设计还是对提高用户的信息检索效率来说都是非常有意义的。 相似文献
5.
6.
个性化信息检索系统的用户模型研究 总被引:1,自引:0,他引:1
分析了传统信息检索系统实现个性化信息检索的必然性,对个性化信息检索系统的信息代理Agent、用户模型等相关问题进行了探讨,提出了一个基于向量空间的个性化信息检索用户模型。 相似文献
7.
8.
相关性是信息学科尤其是信息检索领域的一个相对热门的研究课题。立足于相关性评价,在简要论述研究现状后,着重分析检索结果相关性的两个基本评价指标-查全率和查准率二值改进算法以及基于布尔模型、向量空间模型和概率模型的计算方法,从系统和用户两个角度出发,提出一些提高信息检索相关性计算准确性的改进建议。 相似文献
9.
相关性是衡量信息检索系统性能的重要指标之一,对信息检索相关性进行深入分析对于信息检索系统的有效开发具有重要的理论指导意义.从系统、用户以及当前的网络环境信息检索三个方面分析了信息检索相关性的评价方法及改善策略. 相似文献
10.
应用于信息检索的统计语言模型研究进展 总被引:1,自引:0,他引:1
统计语言模型作为一种自然语言处理的工具,已经被证明有能力处理大规模真实文本.而统计语言模型和IR相结合后所形成的SLM-IR模型的提出,是信息检索模型研究上的重大进展.本文介绍了统计语言模型在信息检索领域的基本模型及相关问题,重点分析了Lemur工具箱和标题语言模型的原理及模型,最后从整体上介绍了该领域的国际动态和研究进展情况. 相似文献
11.
How to merge and organise query results retrieved from different resources is one of the key issues in distributed information retrieval. Some previous research and experiments suggest that cluster-based document browsing is more effective than a single merged list. Cluster-based retrieval results presentation is based on the cluster hypothesis, which states that documents that cluster together have a similar relevance to a given query. However, while this hypothesis has been demonstrated to hold in classical information retrieval environments, it has never been fully tested in heterogeneous distributed information retrieval environments. Heterogeneous document representations, the presence of document duplicates, and disparate qualities of retrieval results, are major features of an heterogeneous distributed information retrieval environment that might disrupt the effectiveness of the cluster hypothesis. In this paper we report on an experimental investigation into the validity and effectiveness of the cluster hypothesis in highly heterogeneous distributed information retrieval environments. The results show that although clustering is affected by different retrieval results representations and quality, the cluster hypothesis still holds and that generating hierarchical clusters in highly heterogeneous distributed information retrieval environments is still a very effective way of presenting retrieval results to users. 相似文献
12.
13.
14.
《Information processing & management》1999,35(2):121-139
It is well-known that relevance feedback is a method significant in improving the effectiveness of information retrieval systems. Improving effectiveness is important since these information retrieval systems must gain access to large document collections distributed over different distant sites. As a consequence, efforts to retrieve relevant documents have become significantly greater. Relevance feedback can be viewed as an aid to the information retrieval task. In this paper, a relevance feedback strategy is presented. The strategy is based on back-propagation of the relevance of retrieved documents using an algorithm developed in a neural approach. This paper describes a neural information retrieval model and emphasizes the results obtained with the associated relevance back-propagation algorithm in three different environments: manual ad hoc, automatic ad hoc and mixed ad hoc strategy (automatic plus manual ad hoc). 相似文献
15.
相关反馈技术在知识检索中的应用 总被引:4,自引:1,他引:4
本文从相关反馈技术的基本原理出发,综合应用信息管理与机器学习,探讨了相关反馈技术在知识检索中的应用模式、相关反馈检索算法和相关反馈学习算法,最后提出了对该领域研究的建议。 相似文献
16.
《Information processing & management》2005,41(4):759-775
Lately there has been intensive research into the possibilities of using additional information about documents (such as hyperlinks) to improve retrieval effectiveness. It is called data fusion, based on the intuitive principle that different document and query representations or different methods lead to a better estimation of the documents' relevance scores.In this paper we propose a new method of document re-ranking that enables us to improve document scores using inter-document relationships. These relationships are expressed by distances and can be obtained from the text, hyperlinks or other information. The method formalizes the intuition that strongly related documents should not be assigned very different weights. 相似文献
17.
In this paper we propose an effective sentence retrieval method that consists of incorporating query-independent features into standard sentence retrieval models. To meet this aim, we apply a formal methodology and consider different query-independent features. In particular, we show that opinion-based features are promising. Opinion mining is an increasingly important research topic but little is known about how to improve retrieval algorithms with opinion-based components. In this respect, we consider here different kinds of opinion-based features to act as query-independent evidence and study whether this incorporation improves retrieval performance. On the other hand, information needs are usually related to people, locations or organizations. We hypothesize here that using these named entities as query-independent features may also improve the sentence relevance estimation. Finally, the length of the retrieval unit has been shown to be an important component in different retrieval scenarios. We therefore include length-based features in our study. 相似文献
18.
本文通过研究注意力与搜索引擎、注意力与网络数据库、注意力与检索结果的相关性,论证了注意力与网络信息检索的互动及其影响。 相似文献
19.
Linda Schamber Michael B. Eisenberg Michael S. Nilan 《Information processing & management》1990,26(6)
Although relevance judgments are fundamental to the design and evaluation of all information retrieval systems, information scientists have not reached a consensus in defining the central concept of relevance. In this paper we ask two questions: What is the meaning of relevance? and What role does relevance play in information behavior? We attempt to address these questions by reviewing literature over the last 30 years that presents various views of relevance as topical, user-oriented, multidimensional, cognitive, and dynamic. We then discuss traditional assumptions on which most research in the field has been based and begin building a case for an approach to the problem of definition based on alternative assumptions. The dynamic, situational approach we suggest views the user — regardless of system — as the central and active determinant of the dimensions of relevance. We believe that relevance is a multidimensional concept; that it is dependent on both internal (cognitive) and external (situational) factors; that it is based on a dynamic human judgment process; and that it is a complex but systematic and measurable phenomenon. 相似文献