Click here to Skip to main content
Licence CPOL
First Posted 23 Sep 2008
Views 26,966
Downloads 283
Bookmarked 14 times

String Tree Dictionary

By | 23 Sep 2008 | Article
Good for decoding a stream of characters with a finite number of predefined tokens

Introduction

A StringTreeDictionary<T> class implements a IDictionary<string, T> interface and is 57% faster than a Dictionary<string, T> at a task described below, 50% slower at adding items and much slower at accessing them. Even so it can be useful if you have to translate a sequence of characters.

Using the Code

A method, MatchKey, provided in a class tries to find a key in a dictionary which matches the beginning of a given string. If the key was found, then it returns a KeyValuePair<string, TKey> object. Otherwise a KeyNotFoundException is thrown. For example, if a dictionary would contain keys aaba, ac, ccbc and abcd, then the MatchKey("accdba") would retaurn a pair {"ac", dict["ac"]}. Note that we do not know a length of a key we are searching for. The method can be used to extract a sequence of meaningful strings (keys) from a continuous stream of characters.

How It Works

Internally, the StringTreeDictionary object represents a node of a tree structure. There are the following fields inside:

public class StringTreeDictionary<T> : IDictionary<string, T>
{
	private T value;
	private bool hasValue = false;
	private Dictionary<char, StringTreeDictionary<T>> childs = null;

A single StringTreeDictionary object contains just one value and a dictionary with child nodes as values and chars as keys. Let's see how the items are stored.

StringTreeDictionary_diagram1.jpg

Such a dictionary could be instantinated this way:

StringTreeDictionary<string> std = new StringTreeDictionary<string> {
	{"bear", "sleuth"},
	{"bee", "grist"},
	{"bird", "flock"},
	{"cat", "clowder"}
};

The diagram clearly shows that the dictionary does not like long keys.

Remarks

  • A StringTreeDictionary<T> is good only for very specific cases; do not use it instead of a standard .NET dictionary.
  • Accessing items with keys longer than 3 characters is terribly slow and is not recommended.
  • The dictionary does not support a true item removal. A faked Remove method only disallows accessing deleted items and sets value to default(TValue), but does not detach unused nodes.
  • The MatchKey method searches for the shortest key matching a string's beginning. If you would add keys acx and ac, then the acx would never be found.

History

  • 2008-09-20 - First version posted
  • 2008-09-20 - Download files updated

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

About the Author

Jacek Gajek



Poland Poland

Member

My name is Jacek Gajek. I study computer science in the University of Technology in Wrocław. I like C#. I like cats. I like Monthy Python's sense of humour.

Sign Up to vote   Poor Excellent
Add a reason or comment to your vote: x
Votes of 3 or less require a comment

Comments and Discussions

 
You must Sign In to use this message board. (secure sign-in)
 
Search this forum  
 FAQ
    Noise  Layout  Per page   
  Refresh
QuestionIs it Ternary Search Trie implementation? Pinmemberc-smile17:17 23 Sep '08  
AnswerRe: Is it Ternary Search Trie implementation? Pinmembergajatko23:04 23 Sep '08  
GeneralThoughts PinmemberPIEBALDconsult10:09 23 Sep '08  
GeneralRe: Thoughts Pinmembergajatko10:59 23 Sep '08  
GeneralRe: Thoughts PinmemberPIEBALDconsult12:56 23 Sep '08  
GeneralRe: Thoughts Pinmembergajatko22:28 23 Sep '08  
PIEBALDconsult wrote:
at a task described below

Read the "Using the code" section.
PIEBALDconsult wrote:
. I'd also like to see how you came up with that number.

Download a test project with a benchmark. All numbers are real measured data on my machine (Quad 2.4 2GB RAM).
 
Greetings - Gajatko
 
Portable.NET is part of DotGNU, a project to build a complete Free Software replacement for .NET - a system that truly belongs to the developers.

GeneralRe: Thoughts PinmemberPIEBALDconsult3:47 24 Sep '08  
GeneralRe: Thoughts Pinmembergajatko9:57 24 Sep '08  
GeneralRe: Thoughts PinmemberPIEBALDconsult10:20 24 Sep '08  
GeneralRe: Thoughts Pinmembergajatko1:00 25 Sep '08  

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Permalink | Advertise | Privacy | Mobile
Web02 | 2.5.120604.1 | Last Updated 23 Sep 2008
Article Copyright 2008 by Jacek Gajek
Everything else Copyright © CodeProject, 1999-2012
Terms of Use
Layout: fixed | fluid