首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Summarisation of the logical structure of XML documents
Authors:Zoltán Szlávik  Anastasios Tombros  Mounia Lalmas
Institution:1. Department of Computer Science, VU University Amsterdam, 1081 HV Amsterdam, The Netherlands;2. School of Electronic Engineering and Computer Science, Queen Mary University of London, E1 4NS London, United Kingdom;3. Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018 Barcelona, Spain
Abstract:Summarisation is traditionally used to produce summaries of the textual contents of documents. In this paper, it is argued that summarisation methods can also be applied to the logical structure of XML documents. Structure summarisation selects the most important elements of the logical structure and ensures that the user’s attention is focused towards sections, subsections, etc. that are believed to be of particular interest. Structure summaries are shown to users as hierarchical tables of contents. This paper discusses methods for structure summarisation that use various features of XML elements in order to select document portions that a user’s attention should be focused to. An evaluation methodology for structure summarisation is also introduced and summarisation results using various summariser versions are presented and compared to one another. We show that data sets used in information retrieval evaluation can be used effectively in order to produce high quality (query independent) structure summaries. We also discuss the choice and effectiveness of particular summariser features with respect to several evaluation measures.
Keywords:Structure summarisation  XML retrieval
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号