Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: C#3.5
Hello all;
 
I have some jpgs and i am creating pdf from it using pdfSharp. but it is not getting converted to searchable pdf rather it is just a image pdf. i want to extract text from it.
Please help me.i have tried many free ocr as well and all in vain
Posted 9-Aug-12 21:45pm
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

The only way I could do this myself was to use Adobe Acrobat itself, but unfortunatly it's not free.
 
I couldn't find a free way of doing this.
  Permalink  
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 2

I'm not sure if there is any free way to do that. We use a third-party toolkit named leadtools that gives us very good results when OCRing PDF documents. I know that they have an online demo, you can try it. Here is the link for the demo:
Online Recognition Demo
 
If the results match your requirements, you can check this tutorial:
Scanning to Searchable PDF
  Permalink  

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 Zoltán Zörgő 370
1 Sergey Alexandrovich Kryukov 200
2 Peter Leow 179
3 OriginalGriff 160
4 88388132 128
0 Sergey Alexandrovich Kryukov 8,348
1 OriginalGriff 6,396
2 Peter Leow 3,557
3 Zoltán Zörgő 3,221
4 Richard MacCutchan 2,397


Advertise | Privacy | Mobile
Web04 | 2.8.150123.1 | Last Updated 2 Sep 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100