Click here to Skip to main content
Click here to Skip to main content

String Tree Dictionary

By , 23 Sep 2008
 

Introduction

A StringTreeDictionary<T> class implements a IDictionary<string, T> interface and is 57% faster than a Dictionary<string, T> at a task described below, 50% slower at adding items and much slower at accessing them. Even so it can be useful if you have to translate a sequence of characters.

Using the Code

A method, MatchKey, provided in a class tries to find a key in a dictionary which matches the beginning of a given string. If the key was found, then it returns a KeyValuePair<string, TKey> object. Otherwise a KeyNotFoundException is thrown. For example, if a dictionary would contain keys aaba, ac, ccbc and abcd, then the MatchKey("accdba") would retaurn a pair {"ac", dict["ac"]}. Note that we do not know a length of a key we are searching for. The method can be used to extract a sequence of meaningful strings (keys) from a continuous stream of characters.

How It Works

Internally, the StringTreeDictionary object represents a node of a tree structure. There are the following fields inside:

public class StringTreeDictionary<T> : IDictionary<string, T>
{
	private T value;
	private bool hasValue = false;
	private Dictionary<char, StringTreeDictionary<T>> childs = null;

A single StringTreeDictionary object contains just one value and a dictionary with child nodes as values and chars as keys. Let's see how the items are stored.

StringTreeDictionary_diagram1.jpg

Such a dictionary could be instantinated this way:

StringTreeDictionary<string> std = new StringTreeDictionary<string> {
	{"bear", "sleuth"},
	{"bee", "grist"},
	{"bird", "flock"},
	{"cat", "clowder"}
};

The diagram clearly shows that the dictionary does not like long keys.

Remarks

  • A StringTreeDictionary<T> is good only for very specific cases; do not use it instead of a standard .NET dictionary.
  • Accessing items with keys longer than 3 characters is terribly slow and is not recommended.
  • The dictionary does not support a true item removal. A faked Remove method only disallows accessing deleted items and sets value to default(TValue), but does not detach unused nodes.
  • The MatchKey method searches for the shortest key matching a string's beginning. If you would add keys acx and ac, then the acx would never be found.

History

  • 2008-09-20 - First version posted
  • 2008-09-20 - Download files updated

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

About the Author

Jacek Gajek
Poland Poland
Member
My name is Jacek Gajek. I study computer science in the University of Technology in Wrocław. I like C#. I like cats. I like Monthy Python's sense of humour.

Sign Up to vote   Poor Excellent
Add a reason or comment to your vote: x
Votes of 3 or less require a comment

Comments and Discussions

 
Hint: For improved responsiveness ensure Javascript is enabled and choose 'Normal' from the Layout dropdown and hit 'Update'.
You must Sign In to use this message board.
Search this forum  
    Spacing  Noise  Layout  Per page   
QuestionIs it Ternary Search Trie implementation?memberc-smile23 Sep '08 - 17:17 
Am I correct in my understanding that this is Ternary Search Trie implementation?
 
Dr. Dobb's | Ternary Search Trees | April 1, 1998
AnswerRe: Is it Ternary Search Trie implementation?membergajatko23 Sep '08 - 23:04 
GeneralThoughtsmemberPIEBALDconsult23 Sep '08 - 10:09 
GeneralRe: Thoughtsmembergajatko23 Sep '08 - 10:59 
GeneralRe: ThoughtsmemberPIEBALDconsult23 Sep '08 - 12:56 
GeneralRe: Thoughtsmembergajatko23 Sep '08 - 22:28 
GeneralRe: ThoughtsmemberPIEBALDconsult24 Sep '08 - 3:47 
GeneralRe: Thoughtsmembergajatko24 Sep '08 - 9:57 
GeneralRe: ThoughtsmemberPIEBALDconsult24 Sep '08 - 10:20 
GeneralRe: Thoughtsmembergajatko25 Sep '08 - 1:00 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Permalink | Advertise | Privacy | Mobile
Web01 | 2.6.130523.1 | Last Updated 23 Sep 2008
Article Copyright 2008 by Jacek Gajek
Everything else Copyright © CodeProject, 1999-2013
Terms of Use
Layout: fixed | fluid