首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 765 毫秒
1.
元数据的应用需要开发适于所应用主题领域的规范词表来满足用户的检索需求,但目前对用户用什么词来进行查找却知之甚少。为了了解数字化教育图书馆用户在检索中使用什么样的词来进行查找,本文作者利用检索记录挖掘的方法来进行研究。在初步分析了40多万条检索记录中所含的100多万个检索词之后,作者重点分析了规范词在检索中被用户使用的情况,并且对比分析了哪些非规范词被用户使用.作者发现用户在查找信息的过程中对规范词的使用频率大大超过了非规范词的使用频率。对非规范词使用的进一步分析不仅可以提供补充更新规范词的来源,而且也可以为分析规范词非规范词之间建立对应的浯义关系提供重要的信息来源。  相似文献   

2.
基于RDF的叙词表研究   总被引:19,自引:4,他引:15  
毛军 《情报学报》2003,22(2):163-168
叙词表在网络中的存在以分布式的叙词关系单元为基础 ,在RDF提供的框架中提供对资源对象的标引和检索服务。XML和RDF对叙词的定义区别在语法和语义上 ,将叙词抽象为概念和词汇两个层次 ,概念之间的关系简化为相关和从属。在客户器 服务器模式下介绍RDF模式传递叙词条目的方法。最后介绍RDF方式的叙词和DC(都柏林核心集 )结合和语义网络 (SEMANTICWEB)两个具体应用。  相似文献   

3.
This paper provides an overview of the research into current medical vocabularies and their impact on searching the Web for health information. The Web provides growing opportunities for laypersons to gain knowledge about specific health conditions, though research to date has been incomplete. Many studies have examined aspects of controlled medical vocabularies. Other studies have examined aspects of medical Web searching vocabularies. In this context, there is a growing need to examine more closely laypersons' Web queries using controlled medical vocabularies that were designed to serve the needs of medical professionals. It may be the case that the average consumer of Web health services is not able to use correct medical terminology, and may not be able to choose analogous or synonymous terms from a search result list. Our review suggests a growing need for studies to examine the current applicability of controlled medical vocabularies as well as alternatives to semantic query by Web search engine users.  相似文献   

4.
[目的/意义]ResourceSync是由美国国家标准协会颁布的资源同步框架规范,可用于实现互联网上不同系统之间的资源同步。对该规范的基本原理及其在数字图书馆中的应用场景进行分析,可为ResourceSync在数字图书馆中的实际应用提供借鉴。[方法/过程]从框架结构、同步过程、资源发现机制3个方面分析ResourceSync的基本原理,在此基础上描述ResourceSync在数字图书馆中的3个应用场景,并对实际应用时需要注意的4个问题进行说明。[结果/结论]数字图书馆可以通过ResourceSync支持的3种发现机制来发布或获取资源列表、资源仓库、变化列表、变化仓库,并经过基准同步、增量同步、审校等操作步骤来实现数字图书馆与镜像站点、资源提供者、资源使用者之间的资源同步。  相似文献   

5.
Perhaps the greatest power of folksonomies, especially when set against controlled vocabularies like the Library of Congress Subject Headings, lies in their capacity to empower user communities to name their own resources in their own terms. This article analyzes the potential and limitations of both folksonomies and controlled vocabularies for transgender materials by analyzing the subject headings in WorldCat records and the user-generated tags in LibraryThing for books with transgender themes. A close examination of the subject headings and tags for twenty books on transgender topics reveals a disconnect between the language used by people who own these books and the terms authorized by the Library of Congress and assigned by catalogers to describe and organize transgender-themed books. The terms most commonly assigned by users are far less common or non-existent in WorldCat. The folksonomies also provide spaces for a multiplicity of representations, including a range of gender expressions, whereas these entities are often absent from Library of Congress Subject Headings and WorldCat. While folksonomies are democratic and respond quickly to shifts and expansions of categories, they lack control and may inhibit findability of resources. Neither tags nor subject headings are perfect systems by themselves, but they may complement each other well in library catalogs. Bringing users’ voices into catalogs through the addition of tags might greatly enhance organization, representation, and retrieval of transgender-themed materials.  相似文献   

6.
DC词表的维护:实践、策略与模型   总被引:2,自引:1,他引:2  
DCMI维护着一个包含众多元数据术语的词表,其中最主要的是包括15个元素的都柏林核心。然而,DCMI所关注和讨论的并不只是这个核心集,以及围绕这个核心集的实践、策略和模型,DC元数据模型已经由最初的单一模型进化成一个完整的框架,可以通过这个框架对应用DC元数据的各种实践进行评估。这些术语及其以前的版本现在都由统一资源标识符(URI)来标识,并建立了网页和形式化的schema文档,在注册系统中被编成索引,在具体的应用纲要(application profiles)中被引用。DCMI正在寻求一种简单易行的维护机制,一方面推荐使用不断变化和增长的核心标准,另一方面承认和重用其它的词表作为必要的补充,这两者之间需要建立一种平衡。  相似文献   

7.
Scientific repositories create a new environment for studying traditional information science issues. The interaction between indexing terms provided by users and controlled vocabularies continues to be an area of debate and study. This article reports and analyzes findings from a study that mapped the relationships between free text keywords and controlled vocabulary terms used in the sciences. Based on this study's findings recommendations are made about which vocabularies may be better to use in scientific data repositories.  相似文献   

8.
调查了Taxonomy Warehouse中医学受控词表的数量、规模、编制单位、学科分布、语种、应用等情况,介绍了UMLS、MeSH及ICD等主流医学词表的发展概况,认为语义网环境下,国外医学受控词表呈现出本体转化、智能更新与应用、用户协同编制、动态集成与分解、发布为关联数据等趋势。  相似文献   

9.
[目的/意义]以公共图书馆、博物馆、美术馆和群众艺术馆数字资源整合为例,探讨通过赋予受控词汇"身份",提高资源检索的效率的方法。[方法/过程]定义多维坐标系统空间面、主题坐标轴和坐标点;通过为词汇概念赋予标识符,建立概念与词汇的关联,按一定规则为词汇赋予"身份",以概念优选机制、关联数据技术与索引表构建作为其辅助。此外,通过解析用户检索词语义,构建语义标识符,并对概念标识符进行拆分、组合,利用测算标识符点距的方法建立语义标识符与概念标识符之间的映射关系,实现检索维度优选。[结果/结论]多维坐标系统的坐标关系模型以"面-线-点"的坐标关系处理层次为基础,以"概念定位-词汇定位-资源定位"的检索层次为依据,并结合优选、关联与索引,拆分、组合与点距等相关实现机制,通过量化方法来处理词汇关系,能够提高机器对词汇的理解。  相似文献   

10.
因特网已成为全球性的信息环境,我们可以跨越时空获取信息资源。然而,当用户使用自身社区的术语和元数据方案来检索信息时,现在的因特网技术还没有成熟到能提供专业信息设施。元数据担任很重要的角色来帮助实现社区导向的信息环境。本文进行了两项案例研究,包括建立专业主题词表(用于图书馆和图书馆信息科学资源主题网关(LIS)的核心主题词表)和用于地方社区门户网站的主题词表。从这两个案例可以看出小型的主题词表对专业服务是非常有用的,并且维护工作对词表的建立和应用是至关紧要的。为了在网络上建立专业信息环境,我们必须解决对元数据方案来说相悖的两个需求——社区中的专业化(或者本地化)和社区之间的互操作。这篇文章提供了一个概念模型来理解解决这个问题所要涉及的诸多方面。  相似文献   

11.
基于控制词集的中文信息动态自动聚类研究   总被引:1,自引:0,他引:1  
以专用词典为切分工具,建立以概念为基础的、具有主题分类特点的类目结构是中文信息动态自动聚类的一种适用方式。该文探索了基于控制词集的中文信息动态自动聚类技术,包括专用控制词集的构建,动态有限环境下的自动聚类程序、聚类算法,以及结合控制词集对聚类结果进行优化控制等,最后对实验结果给出了概要评价。  相似文献   

12.
受控词表的术语服务研究   总被引:2,自引:0,他引:2  
范炜 《图书情报工作》2012,56(14):34-39,97
明确受控词表在词汇控制和语义关联两方面的作用,阐释基于受控词表的术语服务内涵和意义,提出术语服务生命周期,抽象出术语服务的三层架构,在此基础上进行Web service设计,给出基础API服务集。最后对术语服务研究应用的一些重要问题进行相关讨论。  相似文献   

13.
There have been ample suggestions in the literature that terms added to documents from Flickr and Wikipedia can complement traditional methods of indexing and controlled vocabularies. At the same time, adding new metadata to existing metadata objects may not always add value to those objects. The potential added-value of using user-contributed (“social”) terms from Flickr and the English Wikipedia in image indexing is compared with using two expert-created controlled vocabularies—the Thesaurus for Graphic Materials and the Library of Congress Subject Headings—without those social terms. Experiments confirmed that the social terms did add value, relative to terms from the controlled vocabularies. The median rating for the usefulness of social terms was significantly higher than the baseline rating, but was lower than the ratings for the terms from the Thesaurus for Graphic Materials and the Library of Congress Subject Headings. Furthermore, complementing the controlled vocabulary terms with social terms more than doubled the average coverage of participants' terms for a photograph. The relationships between user demographics and users' perceptions of the value of terms were also investigated, as well as the relationships between user demographics and indexing quality, as measured by the number of terms participants assigned to a photograph. Participants with more tagging and indexing experience assigned a greater number of tags than did other participants.  相似文献   

14.
互联网的信息代理技术   总被引:1,自引:0,他引:1  
互联网用户要求为其在互联网的多种可利用的信息资源中查找、获得和维护所需相关信息提供先进的解决方案。智能的信息代理是自动计算的软件实体,可提供动态的信息搜寻,解决信息需求者和信息提供者之间的理解差异,并能及时处理困扰用户的“信息过载”方面的问题。本文在根据信息代理的特点进行系统分类的基础上,介绍有关构建信息智能代理的基本技术。  相似文献   

15.
Folksonomy与受控词汇在OPAC的应用研究   总被引:1,自引:0,他引:1  
主题表、叙词表等传统受控词汇形式的信息组织工具在Web2.0环境下凸显出缺陷,影响联机公共目录查询系统的检索质量。论文对分众分类法和受控词汇的优缺点进行分析,认为两者可以很好地互补,并且提出了一个应用在联机公共目录查询系统的可行性模式。  相似文献   

16.
在现有国内外医学检索语言研究的基础上,构建了一个跨语种一体化医学检索语言系统.通过中、英文医学词汇、中图法等级结构和MeSH树状结构及其相互之间的对应转换,该系统为医学信息检索提供更深层的医学知识导航及其跨语言检索接口.  相似文献   

17.
This study analyzed observations and interviews of 31 participants, who were divided into six age groups, to understand the influence of end-user goals and experience on Internet search approaches. Users who lacked experience approached the Internet similarly no matter what the age group. Children and older adults were more likely to lack online search experience than other users. In addition, children and older adults were more homogeneous than other users in that they had a narrow range of situational goals, whereas users in other groups had a wide range of situational goals. The study has implications for user services and research in end-user searching. An understanding of the influence of age, experience, and goals on Internet search patterns might guide how, how much, and in what format information should be presented in the future. Knowledge gained from this study can also form the basis of hypotheses for larger studies.  相似文献   

18.
Ensuring quick and consistent access to large collections of unstructured documents is one of the biggest challenges facing knowledge-intensive organizations. Designing specific vocabularies to index and retrieve documents is often deemed too expensive, full-text search being preferred despite its known limitations. However, the process of creating controlled vocabularies can be partly automated thanks to natural language processing and machine learning techniques. With a case study from the biopharmaceutical industry, we demonstrate how small organizations can use an automated workflow in order to create a controlled vocabulary to index unstructured documents in a semantically meaningful way.  相似文献   

19.
Abstract

Music catalogers have embraced in current cataloging the flexibility, intuitiveness, and expressivity afforded by the Library of Congress’s new faceted vocabularies. In order to realize the full benefits to discovery these vocabularies offer, however, methods for programmatically assigning faceted terms retrospectively to existing metadata for music resources are required in order to mitigate the costs of manual enhancement of legacy data. This article describes efforts to repurpose existing Library of Congress Subject Headings as faceted data, taking advantage of recent developments in the MARC 21 formats. Limitations inherent to this process and ideas for future directions are described.  相似文献   

20.
黄倩 《情报学报》2005,24(6):713-717
Internet的发展使互联网成为一个巨大的信息库,但是信息的获取质量却停滞不前。传统的搜索引擎大都基于关键字机械匹配,因而不具备理解文档内容的能力,导致查询效率普遍不高。由于计算机可以理解RDF描述和携带的元数据的含义,因此可以做到基于内容的精确检索。为此,本文提出一种基于RDF的科技论文搜索引擎的设计与实现方案。该方案包括含词汇集设计、RDF描述生成工具、运行于服务器端的RDF描述信息收集和解析程序、基于词汇集的查询四个模块。查询结果为搜索网页的元数据显示,从而使用户对查询结果有准确的了解,可大大提高用户的搜索效率。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号