Click here to Skip to main content
15,867,453 members
Articles / Programming Languages / C#

SQL Parser

Rate me:
Please Sign up or sign in to vote.
4.86/5 (47 votes)
3 Feb 2009CDDL4 min read 306.3K   15.2K   165   60
This article describes an implementation of a simple SQL parser which can be used to add or modify "where" and "order by" parts of SQL queries (PL/SQL).
TestApp.gif

Introduction

Sometimes it is necessary to apply a custom filter to an existing SQL query (to perform search by a custom criteria) or to order the query results depending on the user action (when you work with large amounts of data and display only a small part of it to the user, such as displaying only N first records). I faced this problem while implementing a reusable control for searching. In this control the user should be able to specify a filter and order the search results as he needs. A SQL query is specified in the control’s data source and it can contain any parts including sub-queries. In this case it is not enough just to add a custom ‘where’ clause to the end of the query as it may already contain parts which should follow the where clause according to the SQL syntax. This article describes a simple SQL parser which allows you to add or modify the ‘where’ and ‘order by’ clauses of a SQL query (PL/SQL).

Background

Any document can be separated into tags (special words or specially formatted characters which have some extra meaning; it depends on the document format and task which sequence of character should be treated as a tag), words, separators (points, commas, braces, etc.), and white spaces. Tags can contain other elements such as sub-tags, words, etc. The parser presented in this article separates a text (sequence of characters) into elements mentioned above and builds a tree from them. Later this tree can be restructured, some its nodes can be changed or removed according to the task.

Every tag in the document is represented with a special class (every type of tag has its own class). While parsing a document, the parser reads the document symbol by symbol and determines whether the current sequence of characters is a tag or a simple text. If it is a tag, the parser creates an instance of the class which represents the tag.

Every class which describes a tag is derived from an abstract class named TagBase and has common information about the tag (whether it has contents, its type (string identifier), value, and whether it can be terminated by the end of the document (whether its ending should be specified explicitly)).The parser has a static set of classes (types) which correspond to the set of the tags of the document format (the Tags property). When the parser needs to determine whether there is a tag at the current position of the document, it enumerates through its collection of tag types. For every tag type (element in the collection) it requests a special attribute (derived from the MatchTagAttributeBase class). This attribute has a special method named Match, which returns a value indicating whether this type of tag is located at the specified position in the document. If so, the parser creates an instance of that class. After an instance of a tag class is created, it is converted into an XML node and then added into the XML tree which reflects the structure of the query.

Here is how a tag class declaration looks like:

C#
[TagType("STRING_LITERAL")]
[MatchStringLiteralTag]
internal class StringLiteralTag : TagBase
{
...
}
internal class MatchStringLiteralTagAttribute : MatchTagAttributeBase
{
  public override bool Match(string sql, int position)
  {
    ...
  }
}

Here is the class diagram of the tags used in the SQL parser:

TagClassDiagram.gif

This list of tags does not include all the tags which may be present in a SQL query, it includes only those tags which are necessary for modifying the ‘where’ and ‘order by’ clauses of a query.

Using the Code

To modify a SQL query, you should first create an instance of the SqlParser class and then invoke its Parse method:

SQL
SqlParser myParser = new SqlParser();
myParser.Parse(mySqlQuery);

If you need to modify the ‘where’ clause, you should modify the WhereClause property of the parser:

C#
string myOrginalWhereClause = myParser.WhereClause;
if (string.IsNullOrEmpty(myOrginalWhereClause))
  myParser.WhereClause = myAdditionalWhereClause;
else
  myParser.WhereClause = string.Format("({0}) AND ({1})", myOrginalWhereClause,
  myAdditionalWhereClause);

If you need to modify the ‘order by’ clause, you should modify the OrderByClause property of the parser:

C#
string myOrginalOrderByClause = myParser.OrderByClause;
if (string.IsNullOrEmpty(myOrginalOrderByClause))
  myParser.OrderByClause = myAdditionalOrderByClause;
else
  myParser.OrderByClause = string.Format("{0}, {1}", myOrginalOrderByClause,
     myAdditionalOrderByClause);

After all the necessary modifications, you can get the final SQL query by using the ToText method:

C#
myParser.ToText();

Ways to Make Code Faster

There are a few ways to make the code work faster. When the parser builds a tree, it uses the XmlDocument class. This is useful for debugging as we can save the tree to a file and then look through it with an internet browser or another tool. Also we can search certain nodes with the x-path syntax. But the parser will work faster if we make our own tree-like data structure without redundant functionality.

Also, it may be preferable not to use reflection as it is quite slow compared to other parser operations (the method which returns attributes of a class may be accessed thousands of times when parsing large documents). Instead we can make some data structures which will store metadata about tag classes. It is not so substantial for the SQL parser, but it may be much more substantial for parsers of large documents.

Change Log

  • 1 Feb 2009

    • A new tag which parses quoted identifiers was added.

License

This article, along with any associated source code and files, is licensed under The Common Development and Distribution License (CDDL)


Written By
Software Developer (Senior)
Canada Canada
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
GeneralMS SQL Support for column names like [Header:X-Notice-ID] Pin
Mike Lucas26-Aug-10 4:23
Mike Lucas26-Aug-10 4:23 
GeneralRe: MS SQL Support for column names like [Header:X-Notice-ID] Pin
Sergey Gorbenko28-Aug-10 10:50
Sergey Gorbenko28-Aug-10 10:50 
Questionc/c++ version please? Pin
Martial Spirit11-Aug-10 2:09
Martial Spirit11-Aug-10 2:09 
AnswerRe: c/c++ version please? Pin
Sergey Gorbenko11-Aug-10 2:11
Sergey Gorbenko11-Aug-10 2:11 
GeneralSql parser for Oracle Pin
kiquenet.com2-Aug-10 23:54
professionalkiquenet.com2-Aug-10 23:54 
GeneralRe: Sql parser for Oracle Pin
Sergey Gorbenko2-Aug-10 23:57
Sergey Gorbenko2-Aug-10 23:57 
GeneralRe: Sql parser for Oracle Pin
kiquenet.com5-Aug-10 1:19
professionalkiquenet.com5-Aug-10 1:19 
GeneralRe: Sql parser for Oracle Pin
Sergey Gorbenko5-Aug-10 5:58
Sergey Gorbenko5-Aug-10 5:58 
This parser does not check whether a SQL query is correct, it just allows you to modify correct queries.
Serge

GeneralSQL parser Pin
ekhashab6-Jul-10 7:07
ekhashab6-Jul-10 7:07 
GeneralRe: SQL parser Pin
Sergey Gorbenko19-Jul-10 3:58
Sergey Gorbenko19-Jul-10 3:58 
GeneralGood work. But a small bug Pin
N a v a n e e t h17-May-10 21:03
N a v a n e e t h17-May-10 21:03 
GeneralLittle Bug Pin
cbapiaz26-Apr-10 6:01
cbapiaz26-Apr-10 6:01 
GeneralRe: Little Bug Pin
Sergey Gorbenko29-Apr-10 23:11
Sergey Gorbenko29-Apr-10 23:11 
GeneralSmall bug Pin
martin.nedopil3-Feb-09 22:37
martin.nedopil3-Feb-09 22:37 
GeneralRe: Small bug [modified] Pin
Sergey Gorbenko3-Feb-09 23:03
Sergey Gorbenko3-Feb-09 23:03 
GeneralRe: Small bug Pin
martin.nedopil5-Feb-09 2:27
martin.nedopil5-Feb-09 2:27 
GeneralRe: Small bug Pin
martin.nedopil5-Feb-09 4:00
martin.nedopil5-Feb-09 4:00 
GeneralRe: Small bug Pin
rhyu21-Jan-10 6:00
rhyu21-Jan-10 6:00 
GeneralExcellent Pin
JoseMenendez15-Jan-09 3:22
JoseMenendez15-Jan-09 3:22 
GeneralRe: Excellent Pin
milkplus11-Dec-09 7:21
milkplus11-Dec-09 7:21 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.