首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 125 毫秒
The problem of language in Web searching has been discussed primarily in the area of cross-language information retrieval (CLIR). However, much CLIR research centers on investigation of the effectiveness of automatic translation techniques. The case study reported here explored bilingual user behaviors, perceptions, and preferences with respect to the capability of the Web as a multilingual information resource. Twenty-eight bilingual academic users from Myongji University in Korea were recruited for the study. Findings show that the subjects did not use Web search engines as multilingual tools. For search queries, they selected a language that represents their information need most accurately depending on the types of information task rather than choosing their first language. Subjects expressed concerns about the accuracy of machine translation of scholarly terminologies and preferred to have user control over multilingual Web searches.  相似文献   

This article summarizes the author's dissertation regarding search strategies of millennial undergraduate students in Web and library online information retrieval systems. Millennials bring a unique set of search characteristics and strategies to their research since they have never known a world without the Web. Through the use of search engines, they have come to expect that searches can be completed using an easy, single search box that produces complete articles on just about any topic. However, the reliability and quality of much of the information on the Web is questionable. Conversely, information that resides in library information retrieval systems is often of high scholarly content, but there is not a single, user-friendly system for millennials to use. A case study investigation was conducted at the State University of New York at Oswego, a public undergraduate university in Central New York. The goal of the study was to identify the search strategies of a sampling of millennial undergraduate students in both Web information retrieval systems and library information retrieval systems in order to expand upon existing theories about millennials and to suggest interface design enhancements, which could make online library information retrieval systems easier for millennials to use. The findings recommend improving database search functionality, such as natural-language searching, spell-checking, and uniform full-text wording and placement. Results also suggested that increasing the availability of full-text articles would contribute to millennials’ use of library information retrieval systems.  相似文献   

Web 信息检索(Information Retrieval)技术研究是应用文本检索研究的成果,它结合Web图论的思想,研究Web上的信息检索,是行之有效的Web知识发现的途径。传统HITS方法所获得的信息精确度相当低,而PageRank作为一通用的搜索方法,不能够应用于特定主题的信息获取。在充分分析了PageRank、HITS等现有算法和Web文档的相似度计算方法的基础上,提出了Web上查询特定主题相关信息发现的RG-HITS算法。它结合了Web超链接、网页知识表示的信息相关度以及HITS方法来搜索Web上特定主题的相关知识。  相似文献   

网络信息搜索行为与用户的日常生活息息相关,用户认知导向的网络信息搜索是认知观和社会认知理论在网络信息搜索中的应用,是与传统信息检索和用户导向信息检索不同的检索范式。在介绍认知交互模型、信息问题解决模型和使用搜索引擎的网络信息搜索行为模型等用户认知导向的信息搜索模型的基础上,进一步从用户因素、信息环境和社会情境方面分析网络信息搜索过程中的影响因素。  相似文献   

This project investigated how academic users search for information on their real-life research tasks. This article presents the findings of the first of two studies. The study data were collected in the Queensland University of Technology (QUT) in Brisbane, Australia. Eleven PhD students' searching behaviors on personal research topics were observed as they interacted with information retrieval (IR) systems. The analysis of search logs uncovered the characteristics of research tasks and the corresponding search strategies.  相似文献   

信息检索可视化   总被引:6,自引:0,他引:6  
随着Java和WWW浏览器技术的发展,使得基于Web的信息可视化成为可能。所谓的可视化(Visualization)技术是指利用计算机图形学、图像处理技术和人机交互技术,将数据转换成图形或图像在屏幕上显示出来,并进行交互处理的理论、方法和技术。信息检索可视化是数据可视化技术在信息检索领域的应用,信息用户通过图形界面与网络信息检索系统进行交互,评价检索过程中每次检索结果,优化提问或查询,从而提高查全率和查准率。本文主要讨论“人机界面”设计的原则、信息检索可视化技术,同时介绍了几个信息检索可视化模型。  相似文献   

Social tagging systems have gained increasing popularity as a method of annotating and categorizing a wide range of different web resources. Web search that utilizes social tagging data suffers from an extreme example of the vocabulary mismatch problem encountered in traditional information retrieval (IR). This is due to the personalized, unrestricted vocabulary that users choose to describe and tag each resource. Previous research has proposed the utilization of query expansion to deal with search in this rather complicated space. However, non-personalized approaches based on relevance feedback and personalized approaches based on co-occurrence statistics only showed limited improvements. This paper proposes a novel query expansion framework based on individual user profiles mined from the annotations and resources the user has marked. The underlying theory is to regularize the smoothness of word associations over a connected graph using a regularizer function on terms extracted from top-ranked documents. The intuition behind the model is the prior assumption of term consistency: the most appropriate expansion terms for a query are likely to be associated with, and influenced by terms extracted from the documents ranked highly for the initial query. The framework also simultaneously incorporates annotations and web documents through a Tag-Topic model in a latent graph. The experimental results suggest that the proposed personalized query expansion method can produce better results than both the classical non-personalized search approach and other personalized query expansion methods. Hence, the proposed approach significantly benefits personalized web search by leveraging users’ social media data.  相似文献   

Transaction logs from online search engines are valuable for two reasons: First, they provide insight into human information-seeking behavior. Second, log data can be used to train user models, which can then be applied to improve retrieval systems. This article presents a study of logs from PubMed®, the public gateway to the MEDLINE® database of bibliographic records from the medical and biomedical primary literature. Unlike most previous studies on general Web search, our work examines user activities with a highly-specialized search engine. We encode user actions as string sequences and model these sequences using n-gram language models. The models are evaluated in terms of perplexity and in a sequence prediction task. They help us better understand how PubMed users search for information and provide an enabler for improving users’ search experience.  相似文献   

中英文网络检索工具评价与比较   总被引:10,自引:1,他引:9  
选用AltaVista 和搜狐两个网络检索工具, 利用10 个来自实际参考咨询的检索提问分别进行检索测试, 并根据由索引数据库构成、检索功能、检索效果、检索结果显示、用户负担五方面构成的评价标准, 进行具体评述和比较。由于无法估算网络中相关信息的总量, 所以文中忽略了查全率R 的计算, 但根据需要采用了重复率R r、死链接率Rd 两个新指标作为检索效果评价的标准。在上述评价的基础上, 分析中文检索工具存在的差距, 以及在选择评价标准时, 根据中、英文检索工具的特点, 应该有哪些不同。  相似文献   

基于Ontology的个性化检索   总被引:4,自引:0,他引:4  
目前检索工具的设计大都面向所有用户,而不考虑用户个人的特殊信息需求。本文提出一种基于Ontology的个性化检索方法,该方法自动学习用户查询的历史记录,构建用户兴趣模型,以此推导用户新提问的真正意图,满足用户特殊的信息需求。该方法适用于Internet特定领域或者特定用户群、企业网等智能信息检索。  相似文献   

In the field of information retrieval (IR), researchers and practitioners are often faced with a demand for valid approaches to evaluate the performance of retrieval systems. The Cranfield experiment paradigm has been dominant for the in-vitro evaluation of IR systems. Alternative to this paradigm, laboratory-based user studies have been widely used to evaluate interactive information retrieval (IIR) systems, and at the same time investigate users’ information searching behaviours. Major drawbacks of laboratory-based user studies for evaluating IIR systems include the high monetary and temporal costs involved in setting up and running those experiments, the lack of heterogeneity amongst the user population and the limited scale of the experiments, which usually involve a relatively restricted set of users. In this paper, we propose an alternative experimental methodology to laboratory-based user studies. Our novel experimental methodology uses a crowdsourcing platform as a means of engaging study participants. Through crowdsourcing, our experimental methodology can capture user interactions and searching behaviours at a lower cost, with more data, and within a shorter period than traditional laboratory-based user studies, and therefore can be used to assess the performances of IIR systems. In this article, we show the characteristic differences of our approach with respect to traditional IIR experimental and evaluation procedures. We also perform a use case study comparing crowdsourcing-based evaluation with laboratory-based evaluation of IIR systems, which can serve as a tutorial for setting up crowdsourcing-based IIR evaluations.  相似文献   

基于用户兴趣的个性化检索   总被引:8,自引:0,他引:8  
目前检索工具的设计大都面向所有用户,而不考虑用户个人的兴趣偏好。本文提出一种基于用户兴趣的个性化检索方法。该方法自动学习用户查询的历史记录,构建用户兴趣模型,以此推导用户新提问的真正意图。实验结果表明,该方法更适宜涉及多个类别的关键词的信息检索,可提高信息检索的查准率。  相似文献   

An instruction program that covers search tools on the Web should incorporate a user's information need as the basis for teaching these tools. This article lays the theoretical groundwork for such a query-based approach, deriving its argument from the literature published in the early days of databases in libraries. A review of recent literature as well as tutorials on the Web reveals that librarians do not routinely consider a user's query in their training in the use of Web-based search tools. Other training elements are more prevalent, in particular the nature of search tools, search strategies and the evaluation of results found in a search. When training recommends a context for searching the Web, the advice tends to be of a general nature. These results suggest that librarians may not routinely be teaching queries as a strategy for selecting and using search tools on the Web. Suggestions for further research are presented to explore these findings.  相似文献   

The number of Web users whose first language is not English continues to grow, as does the amount of content provided in languages other than English. This poses new challenges for actors on the Web, such as in which language(s) content should be offered, how search tools should deal with mono- and multilingual content, and how users can make the best use of navigation and search options, suited to their individual linguistic skills. How should these challenges be dealt with? Technological approaches to non-English (or in general, cross-language) Web search have made large progress; however, translation remains a hard problem. This precludes a low-cost but high-quality blanket all-language coverage of the whole Web. In this paper, we propose a user-centric approach to answering questions of where to best concentrate efforts and investments. Drawing on linguistic research, we describe data on the availability of content and access to it in first and second languages across the Web. We then present three studies that investigated the impact of the availability (or not) of first-language content and access forms on user behaviour and attitudes. The results indicate that non-English languages are under-represented on the Web and that this is partly due to content-creation, link-setting and link-following behaviour. They also show that user satisfaction is influenced both by the cognitive effort of searching and the availability of alternative information in that language. These findings suggest that more cross-language tools are desirable. However, they also indicate that context (such as user groups’ domain expertise or site type) should be considered when tradeoffs between information quality and multilinguality need to be taken into account.  相似文献   

BioSYNTHESIS is a prototype intelligent retrieval system under development as part of the IAIMS project at Georgetown University. The aim is to create an integrated system that can retrieve information located on disparate computer systems. The project work has been divided in two phases: BioSYNTHESIS I, development of a single menu to access various databases which reside on different computers; and BioSYNTHESIS II, development of a search component that facilitates complex searching for the user. BioSYNTHESIS II will accept a user's query and conduct a search for appropriate information in the IAIMS databases at Georgetown. For information not available at Georgetown, such as full text, it will access selected remote systems and translate the search query as appropriate for the target system. The search through various computer systems and different databases with unique storage and retrieval structures will be transparent to the user. BioSYNTHESIS I is complete and available to users. The design work for BioSYNTHESIS II is under development and will continue as a multiyear technical research effort of the proposed Georgetown IAIMS implementation project.  相似文献   

New Mexico State University's Computing Research Lab has participated in research in all three phases of the US Government's Tipster program. Our work on information retrieval has focused on research and development of multilingual and cross-language approaches to automatic retrieval. The work on automatic systems has been supplemented by additional research into the role of the IR system user in interactive retrieval scenarios: monolingual, multilingual and cross-language. The combined efforts suggest that universal text retrieval, in which a user can find, access and use documents in the face of language differences and information overload, may be possible.  相似文献   

[目的/意义]针对新生代用户群体对信息检索系统的需求,提出一种游戏化信息检索系统的理论模型,实现激发用户使用检索系统的兴趣,支持用户的信息检索与交互以及鼓励用户持续使用的目标。[方法/过程]基于游戏化基础理论、相关框架及信息检索系统的机制,对不同游戏元素进行组合,在考虑不同游戏元素与规则之间关系的前提下,设计具有特定功能的模块,实现游戏元素在非游戏情境中的应用。[结果/结论]为构建游戏化信息检索系统的理论模型,确定20种游戏元素,并按其功能进行组合,设计出12类游戏模块,包括5类简单模块和7类复合模块,使信息检索系统具备游戏功能。提出的构建思路和理论模型弥补当前游戏化信息检索领域研究的不足,为开发游戏化信息检索系统及后续的相关研究提供了理论框架。  相似文献   

网络检索系统研究进展综述   总被引:3,自引:0,他引:3  
Web上大量的异构、分布、动态的信息造成了“信息过载”。如何在传统信息检索技术的基础上开展针对Web的信息检索工作已经成为一项重要的研究课题。笔者对近年来网络检索技术进行综述,从网络检索系统分类、网络信息获取、网络信息检索相关技术、网络检索方法等方面较全面地介绍了这一领域的发展概况,并对未来网络检索系统的发展趋势提出展望。  相似文献   

[目的/意义]随着大数据、人工智能领域的快速发展,用户画像的研究悄然兴起,回顾和梳理其研究现状,对于促进用户画像的长远发展具有重要的理论和现实意义。[方法/过程]基于中国知网(CNKI)数据库和Web of Science数据库,运用Citespace、Vosviwer等可视化工具对目前图书情报领域用户画像研究的发文量、研究机构、被引期刊、研究热点和研究热点的密度等内容进行定量分析,运用思维导图对国内外图书情报领域用户画像的研究主题、研究算法和技术、模型构建和用户画像的实践与应用进行具体阐述。[结果/结论]作为新兴的研究工具,我国关于用户画像方面的研究还较为薄弱,具有较大的探索空间,用户画像未来的研究应注重:拓宽用户画像的研究主题,重视用户画像的行为分析,优化用户画像的研究模型。  相似文献   

文献推荐系统:提高信息检索效率之途   总被引:2,自引:0,他引:2  
Traditional Information Retrieval (IR) systems have limitations in improving search performance in today’s information environment. The high recall and poor precision of traditional IR systems are only as good as with the accuracy of search query, which is, however, usually difficult for the user to construct. It is also time-consuming for the user to evaluate each search result. The recommendation techniques having been developed since the early 1990s help solve the problems that traditional IR systems have. This paper explains the basic process and major elements of document recommender systems, especially the two recommendation techniques of content-based filtering and collaborative filtering. Also discussed are the evaluation issue and the problems that current document recommender systems are facing, which need to be taken into account in future system designs. Traditional Information Retrieval (IR) systems have limitations in improving search performance in today’s information environment. The high recall and poor precision of traditional IR systems are only as good as with the accuracy of search query, which is, however, usually difficult for the user to construct. It is also time-consuming for the user to evaluate each search result. The recommendation techniques having been developed since the early 1990s help solve the problems that traditional IR systems have. This paper explains the basic process and major elements of document recommender systems, especially the two recommendation techniques of content-based filtering and collaborative filtering. Also discussed are the evaluation issue and the problems that current document recommender systems are facing, which need to be taken into account in future system designs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号