What the...?

Something bad happened

We're not sure what, but we have a few guesses.

Problem: Object reference not set to an instance of an object.
Ticket: 7864825
Server: Web02
NHunspell - Hunspell for the .NET platform - CodeProject Click here to Skip to main content
11,715,022 members (62,678 online)
Click here to Skip to main content

NHunspell - Hunspell for the .NET platform

, 21 Jul 2014 LGPL3 125.2K 1.8K 53
Rate this:
Please Sign up or sign in to vote.
The spell checking and hyphenation features of OpenOffice for the .NET platform.

Introduction

I was looking for a good spell checker and hyphenation library for .NET, and I found the free (LGPL licensed) Hunspell spell checker and Hyphen libraries used in OpenOffice. Hunspell wasn't available for the .NET platform. So, I decided to write a wrapper/port. It is quite nice that a lot of the OpenOffice dictionaries are LGPL licensed too and can be used in proprietary applications.

Interop code to the native Hunspell functions

I used Managed C++ to write the wrapper/port, because I could use the original source code of Hunspell and Hyphen. It was quite nice to write the interop code between managed classes and the unmanaged Hunspell and Hyphen libraries. The original source code is almost unchanged, so that new versions of Hunspell or Hyphen can be easily adopted. Hunspell and Hyphen use unmanaged memory functions, so I had to implement the IDisposable interface and used this pattern to free unmanaged memory early.

Because Hunspell uses UTF8 coding, I had to provide conversion functions from/to UTF8:

char * NHunspell::MarshalHelper::AllocUTF8FromString(String ^value)
{
    array<Byte> ^ byteArray = Encoding::UTF8->GetBytes(value);
    int size = Marshal::SizeOf(byteArray[0]) * (byteArray->Length + 1);
    IntPtr buffer = Marshal::AllocHGlobal(size);
    Marshal::Copy(byteArray, 0, buffer, byteArray->Length);
    Marshal::WriteByte(buffer, size - 1, 0);
    return (char *) buffer.ToPointer();
}

String ^ NHunspell::MarshalHelper::AllocStringFromUTF8( char * value )
{
    int size = strlen(value);
    array<Byte> ^ byteArray = gcnew array<Byte>(size);
    Marshal::Copy(IntPtr(value), byteArray, 0, size);
    return Encoding::UTF8->GetString(byteArray);
}

Another big thing is to handle the unmanaged memory. I implement destructors and finalizers to deal with this:

NHunspell::Hunspell::~Hunspell()
{
    this->!Hunspell();
}

NHunspell::Hunspell::!Hunspell()
{
    if( handle != 0 )
    {
        delete handle;
        handle = 0;
    }
}

bool NHunspell::Hunspell::IsDisposed::get()
{
    return handle == 0;
}

NHunspell spell checking and hyphenation sample

This is a short demo of how to use NHunspell for spell checking, suggestions, and hyphenation:

Console.WriteLine("NHunspell functions and classes demo");

Console.WriteLine("");
Console.WriteLine("Spell Check with with Hunspell");

// Important: Due to the fact Hunspell will use unmanaged memory
// you have to serve the IDisposable pattern
// In this block of code this is be done
// by a using block. But you can also call hunspell.Dispose()
using (Hunspell hunspell = new Hunspell("en_us.aff", "en_us.dic"))
{
    Console.WriteLine("Check if the word 'Recommendation' is spelled correct"); 
    bool correct = hunspell.Spell("Recommendation");
    Console.WriteLine("Recommendation is spelled " + 
              (correct ? "correct" : "not correct"));

    Console.WriteLine("");
    Console.WriteLine("Make suggestions for the word 'Recommendatio'");
    List<string> suggestions = hunspell.Suggest("Recommendatio");
    Console.WriteLine("There are " + suggestions.Count.ToString() + 
                      " suggestions" );
    foreach (string suggestion in suggestions)
    {
        Console.WriteLine("Suggestion is: " + suggestion );
    }
}

Console.WriteLine("");
Console.WriteLine("Hyphenation with Hyph");

// Important: Due to the fact Hyphen will use unmanaged
// memory you have to serve the IDisposable pattern
// In this block of code this is be done by a using block.
// But you can also call hyphen.Dispose()
using (Hyphen hyphen = new Hyphen("hyph_en_us.dic"))
{
    Console.WriteLine("Get the hyphenation of the word 'Recommendation'"); 
    HyphenResult hyphenated = hyphen.Hyphenate("Recommendation");
    Console.WriteLine("'Recommendation' is hyphenated as: " + 
                      hyphenated.HyphenatedWord ); 
}

Console.WriteLine("");
Console.WriteLine("Press any key to continue...");
Console.ReadKey();

Because Hunspell is native C++ code, you must include the correct assembly for your platform. On x86 platforms (32 bit), use the NHunspell.dll from the X86 folder. On X64 platforms, use the NHunspell.dll from the X64 folder.

License

This article, along with any associated source code and files, is licensed under The GNU Lesser General Public License (LGPLv3)

Share

About the Author

I'm working on a new project called Crawler-Lib. It is a generalized back-end processing and hosting framework for Microsoft .NET and Mono. Please take a look at it:
Crawler-Lib Homepage
Crawler-Lib Blog
Crawler-Lib YouTube Channel

You may also be interested in...

Comments and Discussions

 
QuestionHyphenation priority Pin
Yves Goergen27-Jan-15 4:04
memberYves Goergen27-Jan-15 4:04 
AnswerRe: Hyphenation priority Pin
Thomas Maierhofer (Tom)5-Feb-15 0:08
memberThomas Maierhofer (Tom)5-Feb-15 0:08 
QuestionCan we get grammar suggestions using this library? Pin
Nitin Sawant29-Nov-12 18:56
memberNitin Sawant29-Nov-12 18:56 
AnswerRe: Can we get grammar suggestions using this library? Pin
Thomas Maierhofer18-Mar-13 11:32
memberThomas Maierhofer18-Mar-13 11:32 
QuestionForm Resize issue Pin
Scott Logan11-Nov-11 1:07
memberScott Logan11-Nov-11 1:07 
QuestionStrange hang Pin
jarofkla15-Dec-10 4:53
memberjarofkla15-Dec-10 4:53 
GeneralAFF File not found Pin
hemaprathima24-Nov-10 20:01
memberhemaprathima24-Nov-10 20:01 
General"Stack empty" error every now and then with v0.9.6 Pin
Rune Jacobsen13-Oct-10 20:24
memberRune Jacobsen13-Oct-10 20:24 
GeneralRe: "Stack empty" error every now and then with v0.9.6 Pin
Thomas Maierhofer14-Oct-10 3:37
memberThomas Maierhofer14-Oct-10 3:37 
GeneralRe: "Stack empty" error every now and then with v0.9.6 Pin
Rune Jacobsen18-Oct-10 0:33
memberRune Jacobsen18-Oct-10 0:33 
GeneralRe: "Stack empty" error every now and then with v0.9.6 Pin
Thomas Maierhofer18-Oct-10 0:52
memberThomas Maierhofer18-Oct-10 0:52 
GeneralRe: "Stack empty" error every now and then with v0.9.6 Pin
Rune Jacobsen18-Oct-10 1:03
memberRune Jacobsen18-Oct-10 1:03 
GeneralNHunspell Version 0.9.5 released Pin
Thomas Maierhofer18-Jul-10 23:05
memberThomas Maierhofer18-Jul-10 23:05 
GeneralProblem loading the native DLL Pin
Rune Jacobsen1-Jul-10 1:08
memberRune Jacobsen1-Jul-10 1:08 
GeneralRe: Problem loading the native DLL Pin
Thomas Maierhofer18-Jul-10 22:50
memberThomas Maierhofer18-Jul-10 22:50 
GeneralRe: Problem loading the native DLL Pin
Rune Jacobsen10-Aug-10 23:41
memberRune Jacobsen10-Aug-10 23:41 
QuestionSom question that i have Pin
Viki Vic1-Jun-10 22:34
memberViki Vic1-Jun-10 22:34 
AnswerRe: Som question that i have Pin
Thomas Maierhofer18-Jul-10 22:59
memberThomas Maierhofer18-Jul-10 22:59 
GeneralProblem exporting solution Pin
yunou13-May-10 12:59
memberyunou13-May-10 12:59 
GeneralRe: Problem exporting solution Pin
Thomas Maierhofer18-Jul-10 23:01
memberThomas Maierhofer18-Jul-10 23:01 
GeneralNot Working in Windows Form Pin
mouthpiec7-Mar-10 6:29
membermouthpiec7-Mar-10 6:29 
GeneralRe: Not Working in Windows Form Pin
mouthpiec7-Mar-10 6:47
membermouthpiec7-Mar-10 6:47 
GeneralRe: Not Working in Windows Form Pin
mouthpiec7-Mar-10 7:47
membermouthpiec7-Mar-10 7:47 
GeneralHunspell class Add method not working Pin
mrmans0n23-Feb-10 8:20
membermrmans0n23-Feb-10 8:20 
GeneralRe: Hunspell class Add method not working Pin
JBoinker2-Jun-10 15:02
memberJBoinker2-Jun-10 15:02 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

| Advertise | Privacy | Terms of Use | Mobile
Web02 | 2.8.150901.1 | Last Updated 21 Jul 2014
Article Copyright 2009 by Thomas Maierhofer (Tom)
Everything else Copyright © CodeProject, 1999-2015
Layout: fixed | fluid