Click here to Skip to main content
15,867,686 members
Articles / Programming Languages / C++
Article

OCR With MODI in Visual C++

Rate me:
Please Sign up or sign in to vote.
3.81/5 (20 votes)
23 Jan 20071 min read 233.3K   17K   61   55
An article on how to use Microsoft Office Document Imaging Library (MODI) for OCR in Visual C++

MODI VC Demo

Introduction

Microsoft Office Document Imaging Library (MODI) which comes with the Office 2003 package, allows us easily integrate OCR functionality into our own applications. Although there is a good C# sample: "OCR with Microsoft® Office" posted on this web site, I would need something in C++. After searching on the Internet and the Microsoft web site and can't find anything good regarding MODI's OCR for Visual C++. I decided to dig this thing out and write this sample demo program to show the basic thing of MODI's OCR feature. I believe that some people may be interested in this program, so, I post it on the codeproject web site to share the common interest.Project Background

This project was firstly started in Visual C++ 6.0 and then updated to Visual Studio .Net 2003 and I have included two project file in the demo program. To run it in Visual C++ 6.0, open MODIVCDemo.dsp manually.

Build Project and Use Code

Add MODI Active-X into the project

In visual C++ 6.0, click "Project->Add To Project->Components and Controls->Registered ActiveX Control" and select MODI ActiveX as shown below.

 MODI Active-X Control

Mapping Active-X into the project

MODI Active-X Control Mapping

Once map MODI Active-X control into the project, all Active-X control wrapped classes will be automatically added into the project.

HOW TO OCR it in Visual C++.

Following is the sample code showing how to use MODI for OCR.

C++
BOOL CMODIVC6Dlg::bReadOCRByMODIAXCtrl(CString csFilePath, 
                                       CString &csText)
{
   BOOL bRet = TRUE;
   HRESULT hr = 0;
   csText.Empty();

   IUnknown *pVal = NULL;
   IDocument *IDobj = NULL;
   ILayout *ILayout = NULL;
   IImages *IImages = NULL;
   IImage *IImage = NULL;
   IWords *IWords = NULL;
   IWord *IWord = NULL;

   pVal = (IUnknown *) m_MIDOCtrl.GetDocument(); 

   if ( pVal != NULL )
   {
      //Already has image in it, Don't need to create again
      //Just get IDocument interface
      pVal->QueryInterface(IID_IDocument,(void**) &IDobj);
      if ( SUCCEEDED(hr) )
      {
         hr = IDobj->OCR(miLANG_SYSDEFAULT,1,1);

         if ( SUCCEEDED(hr) )
         {
            IDobj->get_Images(&IImages);
            long iImageCount=0;
    
            Images->get_Count(&iImageCount);
            for ( int img =0; img<iImageCount;img++)
            {
               IImages->get_Item(img,(IDispatch**)&IImage);
               IImage->get_Layout(&ILayout);

               long numWord=0;
               ILayout->get_NumWords(&numWord);
               ILayout->get_Words(&IWords);

               IWords->get_Count(&numWord);

               for ( long i=0; i<numWord;i++)
               {
                  IWords->get_Item(i,(IDispatch**)&IWord);
                  CString csTemp;
                  BSTR result;
                  IWord->get_Text(&result);
                  char buf[256];
                  sprintf(buf,"%S",result);
                  csTemp.Format("%s",buf);

                  csText += csTemp;
                  csText +=" ";
               }

            //Release all objects
            IWord->Release();
            IWords->Release();
            ILayout->Release();
            IImage->Release();
         }
         IImages->Release();

      } else {
         bRet = FALSE;
      }
   } else {
      bRet = FALSE;
   }

   IDobj->Close(0);
   IDobj->Release();
   pVal->Release();

   } else {
      bRet = FALSE;
   }

   return bRet;
}

That is!

Version History

Version 1: No Active-X ctrl in the dialogue, use bReadOCRByMODI(...)

Version 2: Add Active-X ctrl in the dialogue,use bReadOCRByMODIAXCtrl(...)

and version 2 is the 1st demo program posted on the codeproject.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here


Written By
Software Developer (Senior)
United States United States

The author is a senior software engineer in US.


Comments and Discussions

 
Questioncan I deploy it without office2003 installed Pin
liberpike21-Jun-07 4:37
liberpike21-Jun-07 4:37 
QuestionUnable to add ref to MODI in VC++ 8.0. Pin
Purusothaman Chennai8-Jun-07 2:51
Purusothaman Chennai8-Jun-07 2:51 
AnswerRe: Unable to add ref to MODI in VC++ 8.0. Pin
$uresh $hanmugam11-Jun-07 1:18
$uresh $hanmugam11-Jun-07 1:18 
Questionhow 2 office 2007 Pin
Elaaber26-May-07 23:04
Elaaber26-May-07 23:04 
AnswerRe: how 2 office 2007 Pin
talbot18-Feb-09 4:28
talbot18-Feb-09 4:28 
GeneralOCR from bitmap Pin
Md. Mazharul Islam Khan26-Apr-07 18:52
Md. Mazharul Islam Khan26-Apr-07 18:52 
GeneralRe: OCR from bitmap Pin
B4stard26-Apr-07 20:35
B4stard26-Apr-07 20:35 
GeneralOnly works on second try... [modified] Pin
B4stard16-Apr-07 19:48
B4stard16-Apr-07 19:48 
Hi,

I've been playing around using the code from CMODIVCDemoDlg::bReadOCRByMODI (OCR without the activeX control in the dialog) in my project - it seems the first time it is executed, the IDobj->Create fails, and then it works on the second run...

Any ideas why this would be happening?

Thanks Smile | :)

-- modified at 2:27 Tuesday 17th April, 2007

It works every time when I do this -
<br />
hr = IDobj->Create(AsciiToBSTR(csFilePath));<br />
<br />
// following two lines added<br />
if (FAILED(hr))<br />
  hr = IDobj->Create(AsciiToBSTR(csFilePath));<br />
<br />
if (SUCCEEDED(hr))<br />
...<br />

It's a workaround, but I'm still curious as to why it doesn't succeed the first time...
GeneralRe: Only works on second try... Pin
byallop3-Jan-08 7:15
byallop3-Jan-08 7:15 
AnswerRe: Only works on second try... Pin
byallop9-Jan-08 10:02
byallop9-Jan-08 10:02 
GeneralProblem for long running applications that use MODI, 1500 document max! Pin
C-codist5-Apr-07 14:09
C-codist5-Apr-07 14:09 
GeneralThanks for referencing my article Pin
Martin Welker25-Jan-07 21:39
Martin Welker25-Jan-07 21:39 
GeneralVery good, but a little crash on close... Pin
Thierry Maurel24-Jan-07 0:10
Thierry Maurel24-Jan-07 0:10 
GeneralRe: Very good, but a little crash on close... Pin
donghuih26-Jan-07 11:40
donghuih26-Jan-07 11:40 
GeneralRe: Very good, but a little crash on close... Pin
sybase_20061-Feb-07 17:53
sybase_20061-Feb-07 17:53 
GeneralRe: Very good, but a little crash on close... Pin
xy_sun22-May-11 5:18
xy_sun22-May-11 5:18 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.