Click here to Skip to main content
15,883,901 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more:
I would like to convert the tiff file to text. I tried with OCR tool, MODI etc, but the text in the TIFF file is in "French Script MT" which is unable to convert. I have attached to this question. Please help me in the solving this issue. Kindly provide me C# file for this.
Posted
Comments
Sergey Alexandrovich Kryukov 10-Aug-14 15:04pm    
It's a matter of the quality of OCR, which is a problem. Now, isn't this off-topic? If you want to develop your own OCR, I could give you some useful references, but it's going to be very serious big work, if not, it would be another problem. If you are hoping to find out some ready-to-use OCR library of open source code, it's possibly, but most results will be more or less frustrating.
—SA
ersenthil_nkl 11-Aug-14 2:01am    
yes please give me the reference docs. So that I can start with my self
Sergey Alexandrovich Kryukov 11-Aug-14 2:41am    
I put some answer. Wait a second, I'll add some more...
—SA

1 solution

Please see my comment to the question. To start with, please try: http://www.codeproject.com/search.aspx?q=%28OCR+OR+%22Optical+Character+Recognition%22%29+%28%22C#%22+OR+%22.NET%22%29&doctypeid=1[^].

However, I cannot guarantee any final results. Even though some works are very impressive; with good quality of code, I haven't find anything ready to use so far (some other members may correct me if I'm just unaware of something, which would be quite possible), all I knew so far would require big amount of high-qualification work.

[EDIT]

See also my past answer and other answers and 7 articles referenced in it: i need an ocr for a camera image capture[^].

It's possible that my references are a bit obsolete, but then it would mean luck for you, because it could be because some new works have been added. But don't hold your breath: OCR is in fact a very difficult topic, a remarkable success in it in near past was due to extremely high demand, as then it was a pike in the recognition of literature and other legacy documents. Still, not much of good software was released to open source. One big problem is the quality: many works good from the standpoint of applied mathematics were wonderfully weak in computer science and programming technology, many authors did not really understand such a relatively simple thing as Unicode, and so on…

—SA
 
Share this answer
 
v2

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900