|
 |
|
|
I have data which looks something like the following...
1/2 NAME 1/3--NAME -4----NAME
The data is fixed font and columns are strictly aligned (you might want to extract to a fixed font window and you'll see that all the NAMEs are lined up). The dashes are handwritten and therefore a better, or lesser, representation of a dash.
I've been trying to use TableExtractor to pull out just the number (including the slash) but, for example on the second and third portions, TableExtractor pulls out the NAME into the "uncorrected" column.
I thought that the columns on the overlap would limit OCR to that portion of the document - am I wrong or if not, why does the OCR "bleed" into the next template column and pick out the name?
Papadeltasierra.
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
 |
|
|
Hello when I Open a Documnet image and Slect a table ,then press the OCR Button,so give a dialog OCR: bad language? What is means?
tt
|
| Sign In·View Thread·PermaLink | 1.00/5 (1 vote) |
|
|
|
 |
|
|
 |
|
|
I am wanting to use your table extractor program to scan in multiple pages using an ADF on my fax machine... Is this possible it looks like it is going to do what I want it to do but I need it to scann this sam position over multiple documents then export that information into excel so I can use it on another program as well.. Any help in pointing me in the right direction would be greatly appreciated...
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hello, I'm new to this place, what a great resource! I want to use this OCR app but it seems the company is out of biz? I downloaded the code but I do not have the ability to compile C#. I am trying to build a big website and could really use this (if it works well, that is!) and I'm really not interested in taking on a new project (getting the source up and running). Anybody have any idea how I can get ahold of a working package? Anyone have a password or something out there.....
Thanks everyone. Now to surf around and see what else is on this site!
Bob
bob
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
 |
|
|
I've been trying to get ahold of them for a month now, half the emails are no good and the ones that seem to work, no reply. Frustrating. I have pages and pages (thousands) of data in tabular format that I am trying to integrate into a SQL dB and something like this, if it worked well, would just make my day.....ah well, if I wish hard enough.....
Thanks for your help! And arn't you the guy who built this app? It looks good!
Bob
bob
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hello.
I´m having some troubles on releasing the image document (.tif) after doing OCR. It seems not to free the file handle (for somehow) after requesting MODI.Document.Close() method.
If you simply try to use TableExtractor for doing OCR on a document, and then opening another one with the same window it won´t free the last file. (you can do a checking by trying to delete it)
Any ideas, please?
Thanks
- CEK
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Try forcing garbage collection after closing your MODI object and/or setting the object to null.
GC.Collect(); GC.WaitForPendingFinalizers();
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Yeah, using the GC.Collect() and GC.WaitForPendingFinalizers() doesn't free the file. Here's what I've tried:
document.Close(false); Marshal.ReleaseComObject(document); System.Runtime.InteropServices.Marshal.FinalReleaseComObject(document); document = null; GC.Collect(); GC.WaitForPendingFinalizers();
And it still won't release. However, doing this will allow you to over-write it, just not delete it.
Van
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
 |
|
|
I would like to be able to extract elements from multiple tables on the same document.
Is this/how is this possible with your TableExtractor ?
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hello, After I read this thread, I included MODI in my existing web application written in ASP.net 2003. Everything works fine in the develpment machine. I installed MS Office 2003 in the web server, I copied everything into the web server, including Interop.MODI.DLL in the bin folder. But is gave me the following error when the application tried to execute OCR function "System.Runtime.InteropServices.COMException: File is empty of corrupted" Please help!
|
| Sign In·View Thread·PermaLink | 1.50/5 (2 votes) |
|
|
|
 |
|
|
 |
|
|
Hello Martin! I want to Drag- Drop text after OCR process in to textBoxs, but i don't understand.Can u help me?????????
|
| Sign In·View Thread·PermaLink | 2.00/5 (1 vote) |
|
|
|
 |
|
|
I have looked all thru the code for Table Extractor attempting to figure out how the RED selection area or square comes up. I tried looking for mouse down, drag, Mouse move etc, but I cannot figure it out. Can someone explain how the red box works? Why does it turn blue under the red square?
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hi!
This is a hard coded feature of the MODI Viewer control. AFAIK you cannot change its properties. Sorry!
Martin
 Enlightened by voting 6 
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hi Martin.
hows life, hope you remember me (modi Search). Well I just need a peace of advice. Our company is willing to buy 3rd Party controls. The core requirements are Annotations, Bookmarks within document, Thumbnail View, Text Search and obviously OCR.
I will be thankful if u spear some of your time and guide me on this thing.
Regards, Abbas
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
 |
|
|
Hello,
We just started using the MODI OCR interface. Is there a way to pass the size of the ROI (region of interest) box to the MODI object so that it's location and size can be defined without input from the user?
|
| Sign In·View Thread·PermaLink | 2.00/5 (1 vote) |
|
|
|
 |
|
|
Hi!
Sure. It can be done by the Selection - Property. By the way: Its done in the example code.
Cheers
Martin
 Enlightened by voting 6 
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hi,
I've looked at the website http://www.tableextractor.com. Also i've downloaded the Trial Version of the TableExtractor. It's working fine. But i can't understand the "AutoCorrection" feature, that is available in the site. Also, the QRX manual is not elaborate so as to use.
In the "AutoCorrection", the color "Orange" indicates that some modification has been done to the extracted content. But, what [0,3] or [1,0] stands for. Can you explain?
Advanced thankx for your reply.
Regards Andrew
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |
|
|
Hi All, I am facing a little problem in creating an instance of axMiDocView in VS 2005. It says activex control not registered properly. I am getting this error only in VS2005 not in VS2003. One more error is when I add reference in VS2005 - "Microsoft Office Document Imaging" i am unable to see stdole in my references, but when I do the same in VS2003 i am able to see MODI and stdole.
Can any one help me what I am missing in VS2005
Thanks in advance
Vijay Kumar Raja.Grandhi Sr. Systems Analyst, CSSEurope Ltd.
|
| Sign In·View Thread·PermaLink | 2.00/5 (2 votes) |
|
|
|
 |
|
|
after all got the solution for finding the next occurance of search text just keep calling the same procedure on the button click event.
Private Sub FindFirst()
'''' for preserving search values ''''' Static pWordId As Integer Static pPageNo As Integer Static pSearchWord As String
Dim nWordID As Integer
If _MODIDocument Is Nothing Then Return End If
''' Variable Declartion '''''''' Dim miWord As MODI.Word Dim strWordInfo As String Dim strTextSelInfo As String Dim miSelectRects As MODI.MiSelectRects Dim miSelectRect As MODI.MiSelectRect Dim mWordID As Integer
If txtSearch.Text.Trim() <> "" Then
Try
Dim n1 As Object Dim n2 As Object
're-initialized search component if user enter different search criteria If pSearchWord = "" Then pSearchWord = txtSearch.Text ElseIf pSearchWord <> txtSearch.Text.Trim Then pPageNo = 0 pWordId = 0 pSearchWord = txtSearch.Text End If
'here the search begins Dim sel As MODI.IMiSelectableItem Dim search As MODI.MiDocSearchClass = New MODI.MiDocSearchClass search.Initialize(_MODIDocument, txtSearch.Text.Trim, pPageNo, pWordId, False, False) ', False, False, False, True) search.Search(Nothing, sel) If Not sel Is Nothing Then
'searched word miWord = sel.Words.item(0) miSelectRects = sel.GetSelectRects For Each miSelectRect In miSelectRects With miSelectRect
'Two criteria's for determining that search reaches its end '============================================================ If pPageNo = .PageNumber Then
'if Previous Page number is still same and word index is less than prior searched item 'its the last word If pWordId > miWord.Id Then pPageNo = 0 pWordId = 0 MessageBox.Show("Doc Viewer has finished searching the document") Return End If
ElseIf pPageNo > .PageNumber Then
' Previous Page number is greater than new page no, shows searching again starts from begining. pPageNo = 0 pWordId = 0 MessageBox.Show("Doc Viewer has finished searching the document") Return
End If
'remembering the page no pPageNo = .PageNumber
End With Next
'select the searched word AxMiDocView1.TextSelection = sel
'redirect to the particular page where searched word found AxMiDocView1.PageNum = miSelectRect.PageNumber
'updating statusbar ShowStatus()
'increament the word index pWordId = miWord.Id + 1
'Check either a wordid exist on that particular page or not If pWordId > _MODIDocument.Images(pPageNo).layout.words.count - 1 Then
'if wordid exceed then move to next page with a starting index If pPageNo < AxMiDocView1.NumPages - 1 Then pPageNo = pPageNo + 1 pWordId = 0 Else pPageNo = 0 pWordId = 0 End If
End If
Else 'self explainatory MessageBox.Show("The search item was not found") End If
Catch ee As Exception ''I hope it won't happen this time MessageBox.Show(ee.Message) End Try
End If
End Sub
regards / Abbas
-- modified at 6:59 Tuesday 31st January, 2006
|
| Sign In·View Thread·PermaLink | |
|
|
|
 |