首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Compression of double array structures for fixed length keywords
Authors:Masao Fuketa  Hiroya Kitagawa  Takuki Ogawa  Kazuhiro Morita  Jun-ichi Aoe
Institution:Department of Information Science and Intelligent Systems, Faculty of Engineering, University of Tokushima, 2-1Minami josanjima, Tokushima 770-8506, Japan
Abstract:A trie is one of the data structures for keyword matching. It is used in natural language processing, IP address routing, and so on. It is represented by the matrix form, the link form, the double array, and LOUDS. The double array representation combines retrieval speed of the matrix form with compactness of the list form. LOUDS is a succinct data structure using bit-string. Retrieval speed of LOUDS is not faster than that of the double array, but its space usage is smaller. This paper proposes a compressed version of the double array by dividing the trie into multiple levels and removing the BASE array from the double array. Moreover, a retrieval algorithm and a construction algorithm are proposed. According to the presented experimental results for pseudo and real data sets, the retrieval speed of the presented method is almost the same as the double array, and its space usage is compressed to 66% comparing with LOUDS for a large set of keywords with fixed length.
Keywords:Trie  Double array  Fixed length keyword  Compression method
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号