共查询到18条相似文献,搜索用时 765 毫秒
1.
2.
XML检索系统及其比较研究* 总被引:2,自引:0,他引:2
探讨XML检索与传统信息检索的区别、XML检索的目标与任务以及XML检索系统研究的核心问题,并对现有的几个XML检索系统进行介绍和比较研究。 相似文献
3.
对数字图书馆统一检索平台的核心问题进行了介绍,在对XML和WebService介绍基础之上提出了基于XML与WebService相结合的数字图书馆同意检索的总体模型,并对如何利用该模型建立统一检索平台进行实证研究。 相似文献
4.
针对信息检索角度的XML的结构化检索问题,利用基于倒排文件的方法,使用NEXI作为检索语言,在基于XML的数字图书馆检索实验系统WHU-XML上对其进行实现,并具体分析查询语言的解析方法以及所采用的结构化检索算法。 相似文献
5.
刘丹 《现代图书情报技术》2010,26(5):50-57
研究将XML文本检索方法应用于长文本环境,并以中文博硕士论文为数据集。对博硕士论文数据集的XML标引、索引、关键词检索和结构化检索分别进行设计和实现,构建一个基于XML的中文博硕士论文检索系统。 相似文献
6.
7.
与传统信息检索不同的是XML要实现元素级的检索,其核心是元素级检索模型的构建。而XML文档内上下文元素的相关性、元素之间信息的重复性以及元素大小的不一性等则是构建模型时面临的核心问题。解决办法是:构建基于BM25元素级XML检索模型,构建基于上下文的元素级XML检索模型BM25E,过滤重复元素,进行可检索元素的选择和太小元素的处理。表1。图1。参考文献19。 相似文献
8.
9.
图像对象特征值的抽取、存储、转换、显现的实现有多种方法,SIMIIRS系统主要采用了数据库方法和XML方法。文章主要讨论了图像资源的XML描述方法、建立图像信息的XML索引文档,检索XML文档以实现图像信息查询与提供。 相似文献
10.
传统的关键词检索技术在文本检索和HTML文档检索上得到了广泛的应用,但它运用于检索XML文档时却不尽如意.为此,本文引入一种改进的遗传算法,对XML文档上的关键词检索进行了研究,提出了XML文档标记的自适应遗传训练算法与XML文档上关键词语义检索及结果排序算法. 相似文献
11.
This paper presents a theoretical methodology to evaluate XML retrieval systems and their filters. Theoretical evaluation
is concerned with the formal investigation of qualitative properties of retrieval models. XML retrieval deals with retrieving
those document components that specifically answer a query, and filters are a method of delivering the most focussed answers.
Our theoretical evaluation critically analyzes how filters achieve this. 相似文献
12.
This paper relates to the difficulty in retrieving precise information from big repositories of magazine articles in full text, and proposes an Extended Markup Language (XML) vocabulary for improving retrieval rates. The hypothesis tested was as follows: Magazine articles marked up with an XML vocabulary, indexed only by selected parts, give more precise search results than the same search using full text index.The study was exploratory with the following characteristics: 29 magazine articles were tested for results, 8 scholars were interviewed for defining 23 search strategies and evaluating results. The data showed that precision improved from 40.72% with full text search to 62.84% using XML markup and searching only in specific labels.Revision of the vocabulary and more testing has to be done by the library and information science community in order to obtain a valid vocabulary and provide more research results. Cultural characteristics and politics of librarians and information managers’ community are as important as technical issues in order to consider any technical proposal to be implemented successfully to achieve interoperability. 相似文献
13.
Content-oriented XML retrieval approaches aim at a more focused retrieval strategy: Instead of retrieving whole documents, document components that are exhaustive to the information need while at the same time being as specific as possible should be retrieved. In this article, we show that the evaluation methods developed for standard retrieval must be modified in order to deal with the structure of XML documents. More precisely, the size and overlap of document components must be taken into account. For this purpose, we propose a new effectiveness metric based on the definition of a concept space defined upon the notions of exhaustiveness and specificity of a search result. We compare the results of this new metric by the results obtained with the official metric used in INEX, the evaluation initiative for content-oriented XML retrieval.
相似文献
Gabriella KazaiEmail: |
14.
The majority of today's scholarly papers are authored in Microsoft Word. Some of those papers include simple and/or complex math. Authors have at their disposal multiple means to insert equations in Word documents, including several of Word's native equation editors and third‐party applications, such as Design Science's MathType. Building workflows that smoothly and accurately transform all of these formats into the appropriate XML markup for use in multiple rendering environments has many challenges. This paper clarifies the different forms of equations that can be encountered in Word documents and discusses the issues and idiosyncrasies of converting these various forms to MathML, LaTeX, and/or images in the JATS XML model. It also touches on workflow alternatives for handling equations in various rendering environments and how those downstream requirements may affect the means of equation extraction from Word documents. 相似文献
15.
根据1996-1998年我校利用中国科技论文与引文索引数据库(Chinese Scientific and Technical Papers and Citations Database,简称CSTPC)情况统计,发现用户满意率不足 10%,检索效果差,检索效率低。因而我们针对这种情况,从数据库结构、标引质量、更新速度等方面分析影响CSTPC检索效率的原因,并提出相应对策,提高检索效率。 相似文献
16.
17.
经济期刊论文的分类标引 总被引:2,自引:0,他引:2
对经济期刊论文进行分类标引。其根据是《中国图书馆分类法》(第4版)。经济期刊论文分类标引要在充分认识其特殊性的基础之上。坚持充分标引、归类恰切、标引适度三个原则。标引工作应按分析主题、判断类别、标引类号三个步骤进行。在处理交替类目、类号组配和交叉学科等问题上。要坚持全部标引、多重式反映。提供尽可能多的检索途径.充分揭示文献蕴含的全部主题。 相似文献
18.
利用JDOM解析XML文档及其在数据转换上的应用* 总被引:5,自引:1,他引:5
由于企业、组织、数字图书馆等之间现存的计算机平台,数据存储模型的不同,严重地阻碍了信息交流。为消除“信息孤岛”,本文试图结合Java的跨平台特性和XML信息交流的标准平台特性,利用JDOM实现从数据库抽取有效数据转换为XML文档保存,从而满足数据的多样性表示和异构数据库环境下数据交换的需求。 相似文献