An Efficient Trie Construction for Natural Language Dictionaries
Titel:
An Efficient Trie Construction for Natural Language Dictionaries
Auteur:
Sumitomo, T. Morita, K. Fuketa, M. Tokunaga, H. Aoe, J. -I.
Verschenen in:
International journal of computer mathematics
Paginering:
Jaargang 79 (2002) nr. 6 pagina's 703-713
Jaar:
2002
Inhoud:
A trie structure is frequently used for various applications, such as natural language dictionaries, database systems and compilers. However, the total number of nodes of the trie becomes large and it takes a lot of spaces for a huge set of keys. The space cost becomes a serious problem if long strings, or compound words, are stored in the trie. In order to resolve this disadvantage, this paper presents a compression method for these long strings by using trie arc for single words. The concept of the compression scheme to be presented is to replace long strings into the corresponding leaf node numbers of the trie. The double array structure is introduced because a fast backward tracing of the trie is required in this approach. The theoretical and experimental observations show that the method presented is more practical than existing ones.