Click here to Skip to main content
15,861,168 members
Articles / Programming Languages / C#

SQL Parser

Rate me:
Please Sign up or sign in to vote.
4.86/5 (47 votes)
3 Feb 2009CDDL4 min read 306.1K   15.2K   165   60
This article describes an implementation of a simple SQL parser which can be used to add or modify "where" and "order by" parts of SQL queries (PL/SQL).
TestApp.gif

Introduction

Sometimes it is necessary to apply a custom filter to an existing SQL query (to perform search by a custom criteria) or to order the query results depending on the user action (when you work with large amounts of data and display only a small part of it to the user, such as displaying only N first records). I faced this problem while implementing a reusable control for searching. In this control the user should be able to specify a filter and order the search results as he needs. A SQL query is specified in the control’s data source and it can contain any parts including sub-queries. In this case it is not enough just to add a custom ‘where’ clause to the end of the query as it may already contain parts which should follow the where clause according to the SQL syntax. This article describes a simple SQL parser which allows you to add or modify the ‘where’ and ‘order by’ clauses of a SQL query (PL/SQL).

Background

Any document can be separated into tags (special words or specially formatted characters which have some extra meaning; it depends on the document format and task which sequence of character should be treated as a tag), words, separators (points, commas, braces, etc.), and white spaces. Tags can contain other elements such as sub-tags, words, etc. The parser presented in this article separates a text (sequence of characters) into elements mentioned above and builds a tree from them. Later this tree can be restructured, some its nodes can be changed or removed according to the task.

Every tag in the document is represented with a special class (every type of tag has its own class). While parsing a document, the parser reads the document symbol by symbol and determines whether the current sequence of characters is a tag or a simple text. If it is a tag, the parser creates an instance of the class which represents the tag.

Every class which describes a tag is derived from an abstract class named TagBase and has common information about the tag (whether it has contents, its type (string identifier), value, and whether it can be terminated by the end of the document (whether its ending should be specified explicitly)).The parser has a static set of classes (types) which correspond to the set of the tags of the document format (the Tags property). When the parser needs to determine whether there is a tag at the current position of the document, it enumerates through its collection of tag types. For every tag type (element in the collection) it requests a special attribute (derived from the MatchTagAttributeBase class). This attribute has a special method named Match, which returns a value indicating whether this type of tag is located at the specified position in the document. If so, the parser creates an instance of that class. After an instance of a tag class is created, it is converted into an XML node and then added into the XML tree which reflects the structure of the query.

Here is how a tag class declaration looks like:

C#
[TagType("STRING_LITERAL")]
[MatchStringLiteralTag]
internal class StringLiteralTag : TagBase
{
...
}
internal class MatchStringLiteralTagAttribute : MatchTagAttributeBase
{
  public override bool Match(string sql, int position)
  {
    ...
  }
}

Here is the class diagram of the tags used in the SQL parser:

TagClassDiagram.gif

This list of tags does not include all the tags which may be present in a SQL query, it includes only those tags which are necessary for modifying the ‘where’ and ‘order by’ clauses of a query.

Using the Code

To modify a SQL query, you should first create an instance of the SqlParser class and then invoke its Parse method:

SQL
SqlParser myParser = new SqlParser();
myParser.Parse(mySqlQuery);

If you need to modify the ‘where’ clause, you should modify the WhereClause property of the parser:

C#
string myOrginalWhereClause = myParser.WhereClause;
if (string.IsNullOrEmpty(myOrginalWhereClause))
  myParser.WhereClause = myAdditionalWhereClause;
else
  myParser.WhereClause = string.Format("({0}) AND ({1})", myOrginalWhereClause,
  myAdditionalWhereClause);

If you need to modify the ‘order by’ clause, you should modify the OrderByClause property of the parser:

C#
string myOrginalOrderByClause = myParser.OrderByClause;
if (string.IsNullOrEmpty(myOrginalOrderByClause))
  myParser.OrderByClause = myAdditionalOrderByClause;
else
  myParser.OrderByClause = string.Format("{0}, {1}", myOrginalOrderByClause,
     myAdditionalOrderByClause);

After all the necessary modifications, you can get the final SQL query by using the ToText method:

C#
myParser.ToText();

Ways to Make Code Faster

There are a few ways to make the code work faster. When the parser builds a tree, it uses the XmlDocument class. This is useful for debugging as we can save the tree to a file and then look through it with an internet browser or another tool. Also we can search certain nodes with the x-path syntax. But the parser will work faster if we make our own tree-like data structure without redundant functionality.

Also, it may be preferable not to use reflection as it is quite slow compared to other parser operations (the method which returns attributes of a class may be accessed thousands of times when parsing large documents). Instead we can make some data structures which will store metadata about tag classes. It is not so substantial for the SQL parser, but it may be much more substantial for parsers of large documents.

Change Log

  • 1 Feb 2009

    • A new tag which parses quoted identifiers was added.

License

This article, along with any associated source code and files, is licensed under The Common Development and Distribution License (CDDL)


Written By
Software Developer (Senior)
Canada Canada
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
QuestionHas anyone seen dfmJSON app for Delphi 11.1 by Mason Wheeler ? Pin
Member 265291713-Apr-22 0:07
Member 265291713-Apr-22 0:07 
Questionis this project still active or sleeping the big sleep? Pin
AllenRogerMarshall3-Apr-20 4:42
professionalAllenRogerMarshall3-Apr-20 4:42 
AnswerRe: is this project still active or sleeping the big sleep? Pin
AllenRogerMarshall3-Apr-20 5:05
professionalAllenRogerMarshall3-Apr-20 5:05 
BugQuoted identifiers make MatchEnd working incorrectly Pin
Member 1453330717-Jul-19 2:47
Member 1453330717-Jul-19 2:47 
QuestionProblem with "From"-column Pin
Daryna22-Feb-18 0:41
Daryna22-Feb-18 0:41 
QuestionNeed to fix the issue when any column with [] is inclueded Pin
htk21-Dec-17 5:08
htk21-Dec-17 5:08 
AnswerRe: Need to fix the issue when any column with [] is inclueded Pin
vtsiatsos18-Feb-19 23:52
vtsiatsos18-Feb-19 23:52 
QuestionThanks! Pin
eyedia18-Dec-17 10:23
eyedia18-Dec-17 10:23 
QuestionOne bug one sample SQL Pin
Member 1011124722-Jun-15 16:24
Member 1011124722-Jun-15 16:24 
QuestionOnly OrderByClause and WhereClause? How about SelectClause? Pin
devvvy7-Nov-14 17:36
devvvy7-Nov-14 17:36 
QuestionSpaces in Tokens Pin
Steve Pinckney18-Jun-12 18:38
Steve Pinckney18-Jun-12 18:38 
AnswerRe: Spaces in Tokens Pin
yorikLC23-Jan-14 3:20
yorikLC23-Jan-14 3:20 
GeneralRe: Spaces in Tokens Pin
Steve Pinckney23-Jan-14 5:33
Steve Pinckney23-Jan-14 5:33 
GeneralRe: Spaces in Tokens Pin
yorikLC18-Feb-14 21:13
yorikLC18-Feb-14 21:13 
BugBug when columnname contains SQL keyword name or is between [ ] Pin
Member 807470823-Jan-12 2:46
Member 807470823-Jan-12 2:46 
QuestionVery well!! Pin
flavio19661-Dec-11 21:06
flavio19661-Dec-11 21:06 
GeneralMy vote of 5 Pin
Kanasz Robert18-Nov-11 5:42
professionalKanasz Robert18-Nov-11 5:42 
QuestionAsk At Runtime Pin
rlrook9-Nov-11 7:19
rlrook9-Nov-11 7:19 
AnswerRe: Ask At Runtime Pin
Sergey Gorbenko9-Nov-11 14:04
Sergey Gorbenko9-Nov-11 14:04 
QuestionAdding ColumnsClause and support for paging Pin
martin.nedopil8-Nov-11 2:52
martin.nedopil8-Nov-11 2:52 
AnswerRe: Adding ColumnsClause and support for paging Pin
Sergey Gorbenko9-Nov-11 14:24
Sergey Gorbenko9-Nov-11 14:24 
Questionsimple but still useful Pin
Tako.Lee8-Aug-11 22:57
Tako.Lee8-Aug-11 22:57 
AnswerRe: simple but still useful Pin
Sergey Gorbenko9-Aug-11 12:44
Sergey Gorbenko9-Aug-11 12:44 
GeneralMy vote of 5 Pin
aaroncampf19-Apr-11 7:52
aaroncampf19-Apr-11 7:52 
GeneralParserBase.ParsedDocument - idenfication of joined tables/sets names? [modified] Pin
devvvy27-Feb-11 15:26
devvvy27-Feb-11 15:26 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.