Click here to Skip to main content
15,867,308 members
Articles / Operating Systems / Windows
Article

A managed wrapper for the HTML Tidy library

Rate me:
Please Sign up or sign in to vote.
4.83/5 (17 votes)
12 Jan 2007CPOL2 min read 140.3K   2.6K   28   33
A managed C++ for a small part of the HTML Tidy C library

(For the latest changes, please see the history section at the end of the article)

Introduction

This is a small library in its initial creation state to provide a native .NET way in accessing the functions of the HTML Tidy library.

HTML Tidy is an open source C library for checking and generating clean XHTML/HTML. In other words: You can throw a misformatted HTML to the library and it will do its best to repair the errors and clean unnecessary items/tags from the HTML.

The Library

There already does exist a way to access the library from .NET, namely through the ATL wrapper of Charles Reitzel (SourceForge CVS repository of the sources here). But you need to register the COM ActiveX control first.

To get rid of this registration limitation, I created a C++/CLI wrapper of the original C library of HTML Tidy. This wrapper is a normal library that you can use in your .NET applications by simply adding a reference to the library.

Please note that my created library currently does not deserve to be called "library", because it really just consists of one single function until now.

The reason why I still do publish it here and now is that I want to provide the basic idea as early as possible to anyone being in the same situation than me (by needing a .NET wrapper for HTML Tidy). It's rather ease to take my library as a starting point and add the required functions you need. I did the core work, you simply add the functions you like.

Of course I gradually will add more functions to the library, as my requirements grow. And I also do encourage you to enhance it by yourself and send me your code so that I can include it.

The underlying C library

It was a pleasure to compile the original HTML Tidy C library. After first starting the provided Visual Studio .NET project file, compiled it for debug and release, received no errors, no warnings. Amazing! I never had such a seamless experience with compiling foreign C/C++ libraries.

Using my .NET library

The library currently has one function to call:

public string CleanHtml( string html );

Simply pass a string and get a cleaned up string back. Easy, isn't it?

An example usage could be:

using ( HtmlTidy tidy = new HtmlTidy() )
{
  string html =
    @"
    <html>
      <head>
        <meta http-equiv=""Content-Type"" content=""text/html; charset=utf-16"">
      </head>
      <body>
        <p>Hello, <b><i>With German</b></i>: ÄÖÜ. Some Chinese: 讪.</p>
      <body>
    </html>
    ";

 
  string s = tidy.CleanHtml(
    html,
    HtmlTidyOptions.ConvertToXhtml );

 
  Console.WriteLine( s );
}

As you see, simply pass the string to the function. There is an overload with one option (currently, will be enhanced in the future, too).

Redistributing

In order to redistribute the library, please ensure that the Microsoft CRT runtime DLLs "msvcr80.dll", "msvcm80.dll" and "msvcp80.dll" are also being distributed. The libraries are usually being found in the folder "C:\Program Files\Microsoft Visual Studio 8\VC\redist\x86\Microsoft.VC80.CRT".

History

  • 2007-01-14
    Added the section about redistributing the CRT library.
     
  • 2007-01-12
    First version published.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Chief Technology Officer Zeta Software GmbH
Germany Germany
Uwe does programming since 1989 with experiences in Assembler, C++, MFC and lots of web- and database stuff and now uses ASP.NET and C# extensively, too. He has also teached programming to students at the local university.

➡️ Give me a tip 🙂

In his free time, he does climbing, running and mountain biking. In 2012 he became a father of a cute boy and in 2014 of an awesome girl.

Some cool, free software from us:

Windows 10 Ereignisanzeige  
German Developer Community  
Free Test Management Software - Intuitive, competitive, Test Plans.  
Homepage erstellen - Intuitive, very easy to use.  
Offline-Homepage-Baukasten

Comments and Discussions

 
Questionfine Job Pin
yves3142-May-13 23:28
yves3142-May-13 23:28 
GeneralTargeting 64-bit Pin
Nick Higgs23-Aug-07 23:56
Nick Higgs23-Aug-07 23:56 
GeneralThe dll cannot run in ASP.NET Medium trust application Pin
tsandl13-Mar-07 10:37
tsandl13-Mar-07 10:37 
GeneralRe: The dll cannot run in ASP.NET Medium trust application Pin
Uwe Keim13-Mar-07 19:22
sitebuilderUwe Keim13-Mar-07 19:22 
GeneralRe: The dll cannot run in ASP.NET Medium trust application Pin
tsandl13-Mar-07 19:52
tsandl13-Mar-07 19:52 
GeneralRe: The dll cannot run in ASP.NET Medium trust application Pin
Uwe Keim13-Mar-07 20:28
sitebuilderUwe Keim13-Mar-07 20:28 
GeneralRe: The dll cannot run in ASP.NET Medium trust application Pin
tsandl14-Mar-07 9:28
tsandl14-Mar-07 9:28 
GeneralRe: The dll cannot run in ASP.NET Medium trust application Pin
Thomas Haller2-Jul-13 3:14
Thomas Haller2-Jul-13 3:14 
GeneralBug Pin
mike_mmmm1-Mar-07 3:22
mike_mmmm1-Mar-07 3:22 
GeneralGreat tool! Pin
Koen_V28-Feb-07 21:39
Koen_V28-Feb-07 21:39 
GeneralHave a look at tidyfornet Pin
Frederik198421-Feb-07 9:50
Frederik198421-Feb-07 9:50 
GeneralRe: Have a look at tidyfornet Pin
Uwe Keim21-Feb-07 18:17
sitebuilderUwe Keim21-Feb-07 18:17 
GeneralRe: Have a look at tidyfornet Pin
mike_mmmm1-Mar-07 2:39
mike_mmmm1-Mar-07 2:39 
Generalbug - with sample file Pin
mike_mmmm17-Feb-07 15:04
mike_mmmm17-Feb-07 15:04 
GeneralRe: bug - with sample file Pin
Uwe Keim17-Feb-07 19:30
sitebuilderUwe Keim17-Feb-07 19:30 
GeneralRe: bug - with sample file Pin
mike_mmmm17-Feb-07 21:38
mike_mmmm17-Feb-07 21:38 
GeneralRe: bug - with sample file Pin
mike_mmmm17-Feb-07 21:40
mike_mmmm17-Feb-07 21:40 
GeneralHe does it again... Pin
Chris Maunder13-Feb-07 18:40
cofounderChris Maunder13-Feb-07 18:40 
GeneralRe: He does it again... Pin
Uwe Keim13-Feb-07 19:05
sitebuilderUwe Keim13-Feb-07 19:05 
GeneralNice job Pin
Clickok13-Feb-07 15:03
Clickok13-Feb-07 15:03 
GeneralRe: Nice job Pin
Uwe Keim13-Feb-07 18:13
sitebuilderUwe Keim13-Feb-07 18:13 
Generalbig problem with tidy Pin
mike_mmmm10-Feb-07 19:11
mike_mmmm10-Feb-07 19:11 
GeneralRe: big problem with tidy Pin
mike_mmmm10-Feb-07 19:15
mike_mmmm10-Feb-07 19:15 
GeneralRe: big problem with tidy Pin
Uwe Keim10-Feb-07 20:17
sitebuilderUwe Keim10-Feb-07 20:17 
GeneralRe: big problem with tidy Pin
Michael Sandrock21-Jun-07 17:03
Michael Sandrock21-Jun-07 17:03 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.