首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Lexical and Syntactic knowledge for Information Retrieval
Authors:Antonio Ferrández
Institution:Dept. Languages and Information Systems, Carretera San Vicente S/N, University of Alicante, 03080 Alicante, Spain
Abstract:Traditional Information Retrieval (IR) models assume that the index terms of queries and documents are statistically independent of each other, which is intuitively wrong. This paper proposes the incorporation of the lexical and syntactic knowledge generated by a POS-tagger and a syntactic Chunker into traditional IR similarity measures for including this dependency information between terms. Our proposal is based on theories of discourse structure by means of the segmentation of documents and queries into sentences and entities. Therefore, we measure dependencies between entities instead of between terms. Moreover, we handle discourse references for each entity. It has been evaluated on Spanish and English corpora as well as on Question Answering tasks obtaining significant increases.
Keywords:Information Retrieval  Natural Language Processing  Term Proximity  Question Answering  Lexical and syntactic relationships
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号