首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到17条相似文献,搜索用时 98 毫秒
1.
INEX是当今信息检索领域最重要的国际评测会议之一.文章通过对INEX 2004年至2010年检索评价项目数量、项目类型、项目任务、测试集的变化,以及对IST 2007年至2010年所关注项目,参与INEX评测的机构的分析,了解XML检索领域的发展方向与趋势,以促进我国科研团队在XML语言检索评价领域更加深入的探索和发展.  相似文献   

2.
INEX与TREC是检索领域的两大检索系统评价平台,在检索技术发展迅速的今天依然保持强大生命力,在当今检索技术评价领域起着十分重要的作用。本篇文章通过对INEX与TREC的研究目标以及平台的构成要素包括三个方面:测试集、检索问题的构造、相关性评估的比较,找出INEX相对于TREC评测平台的创新及不同点,以便更加深入和全面地了解INEX的评测方法。  相似文献   

3.
基于域加权词频法的XML文档级检索实现与评价   总被引:1,自引:0,他引:1  
利用BM25F模型,通过实验,在INEX 04数据集的基础上,实现了对多个域(元素)词频进行加权的XML文档级检索。XML文档结构的确蕴含了一定的语义信息。利用这些语义信息,可以提高检索性能。表2。图1。参考文献16。  相似文献   

4.
随着社交网络的兴起和发展,互联网上出现了大量与商品有关的社会信息。如何利用这些社会信息结合商品元数据进行检索和推荐是信息检索领域中一个热门的研究问题。本文以社会图书检索为例,提出了一种通用的信息检索方法来解决这一问题。首先,通过分析原始图书数据集和图书的用户标签、用户评分和流行度等社会信息,从图书中提取不同的社会特征构建特征矩阵;然后分别计算图书在各种社会特征上的相似度,并使用不同的策略对搜索引擎返回的排序结果进行重排序;最后使用学习排序的方法进行重排结果融合,得到最终的图书检索结果。在实验中,使用该检索方法在INEX Social Book Search 2015和2016数据集上分别进行了训练和测试。结果表明,相比现有的技术,该检索方法能够有效提升图书检索的效果。  相似文献   

5.
吴世文 《新闻传播》2009,(10):13-15
本文采用比较的研究方法,对中西媒体会议新闻报道进行了对比分析。研究发现:西方媒体会议新闻报道所采取的新闻本位的报道观念、受众视角较之我国媒体对应的会议本位、会议主办者视角和严肃性.更符合会议新闻报道规律,值得学习与借鉴。  相似文献   

6.
中国广播电视学会各专业研究委员会负责人会议于3月20日在京召开。会议就研究委员会如何贯彻学会第二届理事会议精神,落实《关于专业研究委员会暂行管理办法》进行了讨论。会议由学会秘书长孙以森主持,常务副会长郝平南、何光,常务理事李振水讲了  相似文献   

7.
向熹 《新闻界》2001,(1):21-22
北京、上海、广州都提出要建立区域的“会议中心”;“会议经济”也成为经济研究人员的案头之题,而报纸尤其是机关报,却在为“会议新闻”头痛。   笔者参加的各种会议多了,对“会议新闻”的抱怨听多了,也着手作了些研究,才发现这原来不是一个小题目。本文陈述了近期的研究结果。   一、理一理“会议”的概念   “会议新闻多了,所以报纸一版不好看”;   “要改变现在的状况,必须改变会议新闻的写法,作到‘跳出会议写会议’”;   ……   这样论断与评价我们听得多了,它们传达着同样的信息:对新闻来说,会议不是什么…  相似文献   

8.
信息速递     
政策 中央政治局研究进一步改进会议和领导同志活动新闻报道 据新华社报道,中共中央政治局3月28日召开会议,研究进一步改进会议和领导同志活动新闻报道等工作。中共中央总书记胡锦涛主持会议。  相似文献   

9.
随着我国科学技术日新月异的发展,各学术团体和研究机构为了进行有效的学术交流,都要举行各种类型的学术会议(以下简称会议)。会议文献是会议的产物,具有很高的情报价值。在发达国家,会议文献的开发利用已具有很高的水平。相比之下,我国的会议文献开发利用工作还很落后。因此,有必要对怎样充分开发、有效利用我国会议文献这一问题进行研究。  相似文献   

10.
杨红娟 《新闻窗》2011,(3):128-128
会议是传达精神的载体,会议是探讨方针的手段。在实际生活中,每天都有形形色色的会议在召开。有的会议是为了探讨某一件事情,有的会议是为了研究某一个方案,有的会议是为了传达某一个政策,也有的会议是为了学习某一种精神。  相似文献   

11.
This special issue contains articles describing XML retrieval approaches developed and evaluated during the second year of INEX, the evaluation initiative for XML retrieval.  相似文献   

12.
TIJAH: Embracing IR Methods in XML Databases   总被引:1,自引:0,他引:1  
This paper discusses our participation in INEX (the Initiative for the Evaluation of XML Retrieval) using the TIJAH XML-IR system. TIJAHs system design follows a standard layered database architecture, carefully separating the conceptual, logical and physical levels. At the conceptual level, we classify the INEX XPath-based query expressions into three different query patterns. For each pattern, we present its mapping into a query execution strategy. The logical layer exploits score region algebra (SRA) as the basis for query processing. We discuss the region operators used to select and manipulate XML document components. The logical algebra expressions are mapped into efficient relational algebra expressions over a physical representation of the XML document collection using the pre-post numbering scheme. The paper concludes with an analysis of experiments performed with the INEX test collection.  相似文献   

13.
In this paper we evaluate the application of data fusion or meta-search methods, combining different algorithms and XML elements, to content-oriented retrieval of XML structured data. The primary approach is the combination of a probabilistic methods using Logistic regression and the Okapi BM-25 algorithm for estimation of document relevance or XML element relevance, in conjunction with Boolean approaches for some query elements. In the evaluation we use the INEX XML test collection to examine the relative performance of individual algorithms and elements and compare these to the performance of the data fusion approaches.  相似文献   

14.
Entity ranking has recently emerged as a research field that aims at retrieving entities as answers to a query. Unlike entity extraction where the goal is to tag names of entities in documents, entity ranking is primarily focused on returning a ranked list of relevant entity names for the query. Many approaches to entity ranking have been proposed, and most of them were evaluated on the INEX Wikipedia test collection. In this paper, we describe a system we developed for ranking Wikipedia entities in answer to a query. The entity ranking approach implemented in our system utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the entity examples (when provided) to retrieve relevant entities as answers to the query. We also extend our entity ranking approach by utilising the knowledge of predicted classes of topic difficulty. To predict the topic difficulty, we generate a classifier that uses features extracted from an INEX topic definition to classify the topic into an experimentally pre-determined class. This knowledge is then utilised to dynamically set the optimal values for the retrieval parameters of our entity ranking system. Our experiments demonstrate that the use of categories and the link structure of Wikipedia can significantly improve entity ranking effectiveness, and that topic difficulty prediction is a promising approach that could also be exploited to further improve the entity ranking performance.  相似文献   

15.
Content-only queries in hierarchically structured documents should retrieve the most specific document nodes which are exhaustive to the information need. For this problem, we investigate two methods of augmentation, which both yield high retrieval quality. As retrieval effectiveness, we consider the ratio of retrieval quality and response time; thus, fast approximations to the 'correct' retrieval result may yield higher effectiveness. We present a classification scheme for algorithms addressing this issue, and adopt known algorithms from standard document retrieval for XML retrieval. As a new strategy, we propose incremental-interruptible retrieval, which allows for instant presentation of the top ranking documents. We develop a new algorithm implementing this strategy and evaluate the different methods with the INEX collection.  相似文献   

16.
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that takes full article answers from Zettair and uses eXist to extract elements from those articles. For the content-only topics, we undertake a preliminary analysis of the INEX 2003 relevance assessments in order to identify the types of highly relevant document components. Further analysis identifies two complementary sub-cases of relevance assessments (General and Specific) and two categories of topics (Broad and Narrow). We develop a novel retrieval module that for a content-only topic utilises the information from the resulting answer list of a native XML database and dynamically determines the preferable units of retrieval, which we call Coherent Retrieval Elements. The results of our experiments show that—when each of the three systems is evaluated against different retrieval scenarios (such as different cases of relevance assessments, different topic categories and different choices of evaluation metrics)—the XML retrieval systems exhibit varying behaviour and the best performance can be reached for different values of the retrieval parameters. In the case of INEX 2003 relevance assessments for the content-only topics, our newly developed hybrid XML retrieval system is substantially more effective than either Zettair or eXist, and yields a robust and a very effective XML retrieval.  相似文献   

17.
Most recent document standards like XML rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. The design of such systems is still an open problem. We present a new model for structured document retrieval which allows computing scores of document parts. This model is based on Bayesian networks whose conditional probabilities are learnt from a labelled collection of structured documents—which is composed of documents, queries and their associated assessments. Training these models is a complex machine learning task and is not standard. This is the focus of the paper: we propose here to train the structured Bayesian Network model using a cross-entropy training criterion. Results are presented on the INEX corpus of XML documents.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号