Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2013, Vol. 36 ›› Issue (2): 84-88.doi: 10.13190/jbupt.201302.84.232

• Reports • Previous Articles     Next Articles

A Fast Trie Tree Index Construction Algorithm Using Frequency Characteristic

ZHANG Qi-fei1, WU Ji-yi1,2, LI Wen-juan1,3, LV Hong-bing1, PAN Xue-zeng1   

  1. 1. College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China;<br>2. Key Laboratory of E-Business and Information Security, Hangzhou Normal University, Hangzhou 310036, China;<br>3. College of Qianjiang, Hangzhou Normal University, Hangzhou 310036, China
  • Received:2012-09-29 Revised:2013-01-18 Online:2013-04-30 Published:2013-03-25

Abstract:

With maturity of "the Internet of things" and establishment of cloud computing national standards, kinds of terminals appear quickly, and huge amounts of datas generation exponentially increase, so its crucial to construct index for the data. A fast Trie-tree index construction algorithm is proposed. All the strings are sorted and then the sorted strings are preprocessed, and after that a matrix with the element of triple is generated, consisting of the character, the horizontal and vertical offset of the character. The fast algorithm scans each column in turn and skips the repeated rows and columns with the same prefix according to offset value in triple array. The experimental results show that the fast algorithm significantly reduces the construction time compared with traditional algorithm and the performance is better than Aoes double-array Trie construction algorithm.

Key words: index construction, fast algorithm, Trie-tree, character frequency, double-array Trie

CLC Number: