5,276,406 members and growing! (18,658 online)
Email Password   helpLost your password?
General Programming » String handling » Strings     Intermediate License: The GNU Lesser General Public License

NRTFTree - A class library for RTF processing in C#

By sgoliver

Class library to manage RTF files.
C#Windows, .NET, .NET 1.1, Win2K, WinXPVS.NET2003, VS, Dev

Posted: 13 Aug 2005
Updated: 7 Sep 2007
Views: 59,328
Announcements
Want a new Job?



Search    
Advanced Search
Sitemap
20 votes for this Article.
Popularity: 5.20 Rating: 4.00 out of 5
1 vote, 5.0%
1
0 votes, 0.0%
2
5 votes, 25.0%
3
2 votes, 10.0%
4
12 votes, 60.0%
5
NRtfTree Demo Screenshot

Introduction

NRtfTree Library (LGPL) is a set of classes written entirely in C# that may be used to manage RTF documents in your own applications. NRtfTree will help you:

  • Open and parse RTF files.
  • Analyze the content of RTF files.
  • Add, modify and remove document elements (i.e. text, control words, control symbols).
  • Create new RTF documents.

Background

RTF (Rich Text Format) is a method of encoding formatted text and graphics for easy transfer between applications. An RTF document can contain text, images, tables, lists, hyperlinks and many other text and graphic elements. In addition, RTF is the format used internally by the RichTextBox control included as part of .NET Framework. Nevertheless, its functionality is not enough to satisfy all aspects of RTF file management.

Using the Code

NRtfTree has two modes of operation:

  1. DOM-like mode: RTF documents are loaded in a tree structure and are provided several methods to traverse it, access tag contents and modify or create new nodes. This implementation requires the entire content of a document to be parsed and stored in memory.

    In this mode, the main classes are RtfTree and RtfTreeNode:

  2. SAX-like mode: RTF file parser is implemented as an event-driven model in which the programmer provides callback methods that are invoked by the parser as part of its traversal of the RTF document.

    In this mode, the main classes are RtfReader and SARParser:

Examples

The following lines show how you can use the class library in your own code.

  1. DOM-like mode

    This code loads an RTF document into an RtfTree object and inspects all the child nodes:

    public void doSomething()
    {
        //Create the RTF tree object
    
        RtfTree tree = new RtfTree();
    
        //Load and parse RTF document
    
        tree.LoadRtfFile("c:\rtfdoc.rtf");
        
        //Get root node
    
        RtfTreeNode root = tree.RootNode;
    
        RtfTreeNode node = new RtfTreeNode();
    
        for(int i = 0; i < root.ChildNodes.Count; i++)
        {
            node = root.ChildNodes[i];
    
            if(node.NodeType == RTF_NODE_TYPE.GROUP)
            {
                //...
    
            }
            else if(node.NodeType == RTF_NODE_TYPE.CONTROL)
            {
                //...
    
            }
            else if(node.NodeType == RTF_NODE_TYPE.KEYWORD)
            {
                switch(nodo.NodeKey)
                    {
                    case "f":  //Font type
    
                    //...
    
                    break;
                case "cf":  //Font color
    
                    //...
    
                    break;
                case "fs":  //Font size
    
                    //...
    
                    break;
                }
            }
            else if(node.NodeType == RTF_NODE_TYPE.TEXT)
            {
                //...
    
            }
        }
    }
  2. SAX-like mode

    This is an example of the implementation of a simple rft sax-parser:

    public class MyParser : SARParser
    {
        //...
    
    
        public override void StartRtfDocument()
        {
          doc += 
            "<?xml version=\"1.0\" encoding=\"ISO-8859-1\"?>\r\n";
    
          doc += "<DOCUMENT>\r\n";
        }
    
        public override void EndRtfDocument()
        {
            doc += "\r\n</DOCUMENT>";
        }
        
        public override void StartRtfGroup()
        {
            //...
    
        }
    
        public override void EndRtfGroup()
        {
            //...
    
        }
    
        public override void RtfControl(string key, 
                                bool hasParam, int param)
        {
            //..
    
        }
    
        public override void RtfKeyword(string key, 
                                bool hasParam, int param)
        {
            switch(key)
            {
               case "b":  //bold font
    
                    //...
    
                    break;
               case "i":  //Italic font
    
                    //...
    
                    break;
               //...
    
            }
        }
    
        public override void RtfText(string text)
        {
            doc += text;
        }
    }

    Once you have completed the parser, you can start parsing the RTF document by calling the function RtfReader.Parse(). Then the handlers for the configured events are automatically called as many times as necessary:

    //Create the parser
    
    MiParser parser = new MyParser(res);
    
    //Create the reader and associate the parser
    
    reader = new RtfReader(parser);
    
    //Load the RTF document
    
    reader.LoadRtfFile(rutaRTF);
    
    //Start parsing
    
    reader.Parse();
  3. RtfDocument class

    You can create new RTF documents using the new class RtfDocument (beta):

    RtfDocument doc = new RtfDocument("testdoc.rtf");
    
    RtfTextFormat format = new RtfTextFormat();
    format.size = 20;
    format.bold = true;
    format.underline = true;
    
    doc.AddText("Title", format);
    doc.AddNewLine();
    doc.AddNewLine();
    
    format.size = 12;
    format.bold = false;
    format.underline = false;
    
    doc.AddText("This is a test.", format); 
    doc.AddText("This is a text.");
    
    doc.AddNewLine();
    
    doc.AddImage("test.png", 50, 50);
    
    doc.Close();

Software License

NRtfTree Library is licensed under the GNU LGPL license.

More Information

You can find up-to-date information on my personal home page (Spanish) or NRtfTree SourceForge Project (English).

References

History

  • 2007/09/02 - v0.3.0 beta 1
    • New license: LGPL.
    • New classes to create RTF documents (basic support in beta): RtfDocument, RtfColorTable, RtfFontTable and RtfTextFormat.
    • RtfTree class:
      • New property MergeSpecialCharacters. When it is set to true, if special character is found ('\') it is converted to Text node and eventually merged to adjacent text nodes.
      • New property Text. Returns plain text from the RTF document.
      • New method GetEncoding(). Returns document encoding.
    • RtfTreeNode class:
      • New property Tree. Returns a reference to owner RTF tree.
      • New method To String().
      • New method InsertChild(). Inserts a new node at the specified location.
      • Methods SelectXXXByType() have been replaced by SelectXXX() overloads.
      • New methods SelectSibling() (3 overloads).
    • RtfNodeCollection class:
      • New method Insert(). Inserts a new node at the specified location.
      • New method RemoveRange(). Remove a range of nodes from the list.
    • InfoGroup class:
      • New method ToString().
    • Fixed Bugs:
      • Group and Root node types initialization with "ROOT" and "GROUP".
      • NRtfTree.Rtf property didn't include last '}' in a group node RTF code.
      • NRtfTree does not treat correctly special characters '\', '{' and '}' as part of the text.
      • Methods RtfTreeNode.AppendChild() and InsertChild() should update Root and Tree properties recursively.
  • 2006/12/10 - v0.2.1
    • Fixed - Bug in NRtfTree.SaveRtf() - Special character hex codes with one digit.
  • 2005/12/17 - v0.2.0
    • New namespaces: Net.Sgoliver.NRtfTree.Core and Net.Sgoliver.NRtfTree.Util
    • New classes: ImageNode, ObjectNode, InfoGroup.
    • RtfTreeNode class:
      • New properties: LastChild, NextSibling, PreviousSibling, Rtf.
      • New methods: CloneNode(), HasChildNodes(), SelectSingleNode(), SelectSingleChildNode(), SelectChildNodes(), SelectNodes(), SelectSingleChildNodeType(), SelectChildNodesByType(), SelectNodesByType(), SelectSingleNodeByType().
      • New indexer [equivalent to SelectSingleChildNode()].
      • Some optimization changes.
    • RtfTree class:
      • New methods: ToStringEx(), SaveRtf(), GetColorTable() y GetFontTable() y GetInfoGroup()
      • Some optimization changes.
      • Some bugs fixed.
    • RtfNodeCollection class:
      • New methods: IndexOf(), AddRange()
    • RtfLex class:
      • parseText() now ignores new line, tabs and null characters.
      • Some optimization changes.
  • 2005/08/13 - v0.1
    • First public release.

License

This article, along with any associated source code and files, is licensed under The GNU Lesser General Public License

About the Author

sgoliver


Currently, i work for a great consulting company as a software developer.

My homepage is:
http://www.sgoliver.net
Occupation: Web Developer
Location: Spain Spain

Other popular String handling articles:

Article Top
Sign Up to vote for this article
You must Sign In to use this message board.
FAQ FAQ Noise ToleranceSearch Search Messages 
 Layout  Per page   
 Msgs 1 to 25 of 39 (Total in Forum: 39) (Refresh)FirstPrevNext
Subject  Author Date 
GeneralMerging RTF documentsmemberOlli Nissinen0:32 18 Jun '08  
GeneralIssue with other charactersets!memberMember 35493141:04 28 May '08  
QuestionMerge Codesmemberbeachbeamer16:55 15 May '08  
QuestionHow I Can Find and Replace textmemberladakana22:11 3 Apr '08  
GeneralHTMLmemberPQSIK13:12 24 Nov '07  
AnswerRe: HTML [modified]membersgoliver23:11 25 Nov '07  
GeneralOut of memory exceptionmemberOrsol23:58 26 Sep '07  
AnswerRe: Out of memory exceptionmembersgoliver2:55 29 Sep '07  
GeneralRe: Out of memory exceptionmemberMember 3238379:00 4 Apr '08  
AnswerRe: Out of memory exceptionmembersgoliver5:42 5 Apr '08  
GeneralIts fantastic, but the table in rtf is not get parsed [modified]memberchand.m2:19 25 May '07  
GeneralC++ conversion of the Librarymemberalamgir mohammed18:25 16 Jan '07  
AnswerRe: C++ conversion of the Librarymembersgoliver9:32 19 Jan '07  
GeneralRe: C++ conversion of the Librarymembercode_discuss21:33 21 Mar '08  
GeneralRe: C++ conversion of the Librarymembersgoliver4:43 30 Mar '08  
General'Font TablememberHolpit0:35 16 Jan '07  
QuestionMissed codememberMTchary10:38 29 Dec '06  
AnswerRe: Missed codemembersgoliver8:24 4 Jan '07  
GeneralImages in Rtf documentmembertmak19:28 11 May '06  
AnswerRe: Images in Rtf document [modified]membersgoliver10:12 28 May '06  
GeneralRe: Images in Rtf documentmemberSuper Lloyd20:30 26 Jun '06  
AnswerRe: Images in Rtf documentmembersgoliver4:28 27 Jun '06  
GeneralRe: Images in Rtf documentmemberSuper Lloyd4:39 27 Jun '06  
GeneralHTML translation of RTF contentmemberdaluu14:02 3 Mar '06  
GeneralRe: HTML translation of RTF contentmemberFlying Jett1:34 4 Feb '07  

General General    News News    Question Question    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

PermaLink | Privacy | Terms of Use
Last Updated: 7 Sep 2007
Editor: Genevieve Sovereign
Copyright 2005 by sgoliver
Everything else Copyright © CodeProject, 1999-2008
Web15 | Advertise on the Code Project