Click here to Skip to main content
Click here to Skip to main content

ASP Parser

, 28 Apr 2004 CPOL
Rate this:
Please Sign up or sign in to vote.
This article presents a simple way to parse and analyze ASP document structure.

Introduction

During my labor with ASP+, I developed a few design-time tools. I early encountered a problem that there was no way to analyze ASPX or ASCX document structure. Microsoft does not support something like ASP Document Object Model (similar to XML DOM). Therefore, I was forced to create my own ASP parser which analyzes the ASPX / ASCX document tags and creates a tree of objects that represent the structure of the document. The ASP DOM was very useful in my automatic localization mechanism but it could be used in many other cases.

Example

When creating a new ASPX document, it usually looks similar to this one:

<%@ Page language="c#" Codebehind="Empty.aspx.cs" 
   AutoEventWireup="false" Inherits="PersistingObjects.Empty" %>
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" > 
<html>
  <head>
    <title>Empty</title>
    <meta name="GENERATOR" Content="Microsoft Visual Studio .NET 7.1">
    <meta name="CODE_LANGUAGE" Content="C#">
    <meta name=vs_defaultClientScript content="JavaScript">
    <meta name=vs_targetSchema 
      content="http://schemas.microsoft.com/intellisense/ie5">
  </head>
  <body MS_POSITIONING="GridLayout">
    <form id="Form1" method="post" runat="server">
    </form>
  </body>
</html>

After parsing the document with the presented ASP DOM, it can be displayed as a tree of objects, e.g.:

You can find here the parsed tree of tags that occur in the ASPX document. Each of the tags (including <OPEN_TAG>, </CLOSE_TAG>, <EMPTY_TAG/>, and tags that do not require closing tag, e.g.: <META>, <BR>) is represented by a tree node with a collection of attributes and child tags beneath it.

Details

To create the ASP Document, use the following code:

string asp = 
  "<asp:linkbutton id=\"LinkButton1\" runat="\""server\">text</asp:linkbutton>";
ASP.Document root = new ASP.Document(asp);

After the ASP.Document is created, it can be traversed recursively, e.g.:

public void Traverse(ASP.Tag tag)
{
    foreach(ASP.Tag child in tag.ChildTags)
    {
        Console.WriteLine(child.Value);

        foreach(ASP.Attribute attribute in child.Attributes)
        {
            Console.WriteLine(attribute.Key + "=" + attribute.Value);
        }

        Traverse(tag);
    }
}

As the ASP.Document is inherited from ASP.Tag, the Traverse method can be called like this:

Traverse(root)

Tags

The core object of the parsed ASPX/ASCX is ASP.Tag. There are a few types of tags: Root, Open, Close, Text, Directive, Codeand Comment. The type of the ASP.Tag can be checked with Tag.TagType property.

  1. Root tag - the only tag that has the type of Root is ASP.Document. This tag type does not contain attributes.
  2. Open tag - e.g.: <title>. This tag type can contain attributes.
  3. Close tag - e.g.: </title>. This tag type does not contain attributes.
  4. Text tag - each text that occurs between other tags, e.g.: (<title>TEXT</title>, the TEXT will occur as a Text tag in ASP.Document tree). This tag type does not contain attributes.
  5. Directive tag - each tag that begins with %@, e.g.: <%@ Page language= "c#" ... %>. This tag type can contain attributes.
  6. Codetag - each tag that begins and ends with % (except the Directive which begins with %@, and Comment which begins with %--), e.g.: <% this.DoSomething() %>. This tag type does not contain attributes.
  7. Comment tag - XML comment, e.g.: <!-- COMMENTED -->, or server side comment, e.g.: <%-- COMMENTED --%>. This tag type does not contain attributes.

Attributes

Each of the ASP.Tag objects has Tag.Attributes property that contains a list of attributes of the tag. The list is empty if the tag does not contain attributes or its type does not support attributes. Therefore only Open or Directivetag can contain non empty attributes list.

Each of attributes contains key (Attribute.Key) and value (Attribute.Value). The value can be empty string ("") if the attribute does not contain it.

Limitations

The main limitation I found in the presented ASP DOM is that it gives read only access to the parsed ASPX / ASCX document. It does not allow to modify the ASP.Document structure and save it back to a string.

Summary

This article does not describe all features of the presented ASP DOM but only briefly shows how to use it. I am sure that in most development cases, the ASP DOM is not needed, but I wanted to share what I gained to make your life easier. For those who will use it: enjoy it.

Source code attached to this article contains sample application that displays parsed ASPX document as a tree.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Share

About the Author

Marcin Celej
Web Developer
Poland Poland
No Biography provided

Comments and Discussions

 
GeneralMy vote of 5 PinmemberManjeet Singh0211-Oct-11 1:00 
GeneralVery nice, but... PinmemberMember 77182519-Feb-09 12:40 
GeneralExcellent! Pinmemberartemgolubev28-Jan-09 15:15 
GeneralAmazing, Pinmemberlagartron123mailcl21-Aug-08 14:27 
QuestionHow can i do it? PinmemberMember 178404929-Jul-08 11:50 
AnswerRe: How can i do it? PinmemberMarcin Celej30-Jul-08 7:56 
GeneralRe: How can i do it? PinmemberMember 178404930-Jul-08 9:21 
GeneralRe: How can i do it? PinmemberMarcin Celej30-Jul-08 12:21 
GeneralGreat job! Pinmemberhkenuam28-Jul-08 13:48 
GeneralRe: Great job! PinmemberMarcin Celej30-Jul-08 7:57 
GeneralVery nice indeed! [modified] PinmemberAlexandru Matei21-Jun-08 3:09 
GeneralRe: Very nice indeed! PinmemberMarcin Celej23-Jun-08 7:12 
GeneralRe: Very nice indeed! PinmemberAlexandru Matei23-Jun-08 7:17 
GeneralRe: Very nice indeed! PinmemberMarcin Celej23-Jun-08 8:26 
QuestionHow can i use it? PinmemberWave004-Jun-08 4:54 
AnswerRe: How can i use it? PinmemberMarcin Celej4-Jun-08 11:25 
GeneralNice Work Pinmemberhabanero7-Jun-06 3:03 
QuestionInteresting! Pinmemberrobert204828-Mar-06 4:00 
AnswerRe: Interesting! PinmemberMarcin Celej28-Mar-06 6:54 
GeneralRe: Interesting! Pinmemberrobert204829-Mar-06 1:03 
GeneralCool ! PinmemberTrance Junkie24-Jun-05 1:10 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

| Advertise | Privacy | Terms of Use | Mobile
Web04 | 2.8.1411023.1 | Last Updated 29 Apr 2004
Article Copyright 2004 by Marcin Celej
Everything else Copyright © CodeProject, 1999-2014
Layout: fixed | fluid