Click here to Skip to main content
Click here to Skip to main content
Go to top

OCR With MODI in Visual C++

, 23 Jan 2007
Rate this:
Please Sign up or sign in to vote.
An article on how to use Microsoft Office Document Imaging Library (MODI) for OCR in Visual C++

MODI VC Demo

Introduction

Microsoft Office Document Imaging Library (MODI) which comes with the Office 2003 package, allows us easily integrate OCR functionality into our own applications. Although there is a good C# sample: "OCR with Microsoft® Office" posted on this web site, I would need something in C++. After searching on the Internet and the Microsoft web site and can't find anything good regarding MODI's OCR for Visual C++. I decided to dig this thing out and write this sample demo program to show the basic thing of MODI's OCR feature. I believe that some people may be interested in this program, so, I post it on the codeproject web site to share the common interest.

Project Background

This project was firstly started in Visual C++ 6.0 and then updated to Visual Studio .Net 2003 and I have included two project file in the demo program. To run it in Visual C++ 6.0, open MODIVCDemo.dsp manually.

Build Project and Use Code

Add MODI Active-X into the project

In visual C++ 6.0, click "Project->Add To Project->Components and Controls->Registered ActiveX Control" and select MODI ActiveX as shown below.

 MODI Active-X Control

Mapping Active-X into the project

MODI Active-X Control Mapping

Once map MODI Active-X control into the project, all Active-X control wrapped classes will be automatically added into the project.

HOW TO OCR it in Visual C++.

Following is the sample code showing how to use MODI for OCR.

BOOL CMODIVC6Dlg::bReadOCRByMODIAXCtrl(CString csFilePath, 
                                       CString &csText)
{
   BOOL bRet = TRUE;
   HRESULT hr = 0;
   csText.Empty();

   IUnknown *pVal = NULL;
   IDocument *IDobj = NULL;
   ILayout *ILayout = NULL;
   IImages *IImages = NULL;
   IImage *IImage = NULL;
   IWords *IWords = NULL;
   IWord *IWord = NULL;

   pVal = (IUnknown *) m_MIDOCtrl.GetDocument(); 

   if ( pVal != NULL )
   {
      //Already has image in it, Don't need to create again
      //Just get IDocument interface
      pVal->QueryInterface(IID_IDocument,(void**) &IDobj);
      if ( SUCCEEDED(hr) )
      {
         hr = IDobj->OCR(miLANG_SYSDEFAULT,1,1);

         if ( SUCCEEDED(hr) )
         {
            IDobj->get_Images(&IImages);
            long iImageCount=0;
    
            Images->get_Count(&iImageCount);
            for ( int img =0; img<iImageCount;img++)
            {
               IImages->get_Item(img,(IDispatch**)&IImage);
               IImage->get_Layout(&ILayout);

               long numWord=0;
               ILayout->get_NumWords(&numWord);
               ILayout->get_Words(&IWords);

               IWords->get_Count(&numWord);

               for ( long i=0; i<numWord;i++)
               {
                  IWords->get_Item(i,(IDispatch**)&IWord);
                  CString csTemp;
                  BSTR result;
                  IWord->get_Text(&result);
                  char buf[256];
                  sprintf(buf,"%S",result);
                  csTemp.Format("%s",buf);

                  csText += csTemp;
                  csText +=" ";
               }

            //Release all objects
            IWord->Release();
            IWords->Release();
            ILayout->Release();
            IImage->Release();
         }
         IImages->Release();

      } else {
         bRet = FALSE;
      }
   } else {
      bRet = FALSE;
   }

   IDobj->Close(0);
   IDobj->Release();
   pVal->Release();

   } else {
      bRet = FALSE;
   }

   return bRet;
}

That is!

Version History

Version 1: No Active-X ctrl in the dialogue, use bReadOCRByMODI(...)

Version 2: Add Active-X ctrl in the dialogue,use bReadOCRByMODIAXCtrl(...)

and version 2 is the 1st demo program posted on the codeproject.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here

Share

About the Author

donghuih
Software Developer (Senior)
United States United States

The author is a senior software engineer in US.


Comments and Discussions

 
QuestionA Simple C# Example PinmemberZamirF26-Sep-11 22:42 
BugCXPageException when OCR PinmemberAgentX300020-Jun-11 5:28 
GeneralPlease send me Source code Pinmembervenkadesh 201010-Mar-11 19:57 
QuestionExample doesn't work? PinmemberMember 442795525-Aug-10 7:10 
AnswerRe: Example doesn't work? Pinmemberdonghuih1-Feb-13 3:31 
QuestionPreffered Image resolution for better results Pinmembererappy18-May-09 0:03 
Questionc Pinmembersnawy20-Jan-09 18:54 
QuestionHow OCR method of MODI support Multi language? Pinmembersnawy20-Jan-09 18:22 
QuestionHow to include in a Visual C++ project? Pinmemberpaolopagano3-Aug-08 23:53 
QuestionOne Query...MS Office? PinmemberKBM7321-May-08 1:05 
Questiona question Pinmemberzhaoyanggolden30-Mar-08 15:54 
QuestionStill error after reinstalling Office 2003 PinmemberHooo26-Dec-07 18:39 
GeneralRe: Still error after reinstalling Office 2003 PinmemberYangchenlong20-Jan-08 20:04 
AnswerRe: Still error after reinstalling Office 2003 Pinmemberzhb_sh15-Mar-09 4:55 
Generalhi Pinmemberlulukuku20-Nov-07 5:41 
Generalthanks you! Pinmemberlulukuku20-Nov-07 5:40 
GeneralERROR!!! Pinmemberlulukuku12-Nov-07 6:08 
GeneralRe: ERROR!!! Pinmemberdonghuih19-Nov-07 10:03 
Questionneural network [modified] Pinmemberlulukuku10-Nov-07 5:27 
GeneralI use it but always Error for both two version,how to fix it out PinmemberThomas Shao9-Oct-07 20:57 
AnswerRe: I use it but always Error for both two version,how to fix it out PinmemberChristoph Reimmann9-Oct-07 23:57 
GeneralRe: I use it but always Error for both two version,how to fix it out PinmemberThomas Shao16-Oct-07 21:30 
GeneralRe: I use it but always Error for both two version,how to fix it out Pinmembervenkadesh 201010-Mar-11 19:52 
GeneralRe: I use it but always Error for both two version,how to fix it out PinmemberWiemAbdulAzis22-Jul-09 22:03 
QuestionHow to set a text selection ? PinmemberChristoph Reimmann27-Sep-07 1:11 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

| Advertise | Privacy | Mobile
Web01 | 2.8.140916.1 | Last Updated 24 Jan 2007
Article Copyright 2007 by donghuih
Everything else Copyright © CodeProject, 1999-2014
Terms of Service
Layout: fixed | fluid