Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: C#3.5
Hello all;
 
I have some jpgs and i am creating pdf from it using pdfSharp. but it is not getting converted to searchable pdf rather it is just a image pdf. i want to extract text from it.
Please help me.i have tried many free ocr as well and all in vain
Posted 9-Aug-12 20:45pm
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

The only way I could do this myself was to use Adobe Acrobat itself, but unfortunatly it's not free.
 
I couldn't find a free way of doing this.
  Permalink  
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 2

I'm not sure if there is any free way to do that. We use a third-party toolkit named leadtools that gives us very good results when OCRing PDF documents. I know that they have an online demo, you can try it. Here is the link for the demo:
Online Recognition Demo
 
If the results match your requirements, you can check this tutorial:
Scanning to Searchable PDF
  Permalink  

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
Your Filters
Interested
Ignored
     
0 Sergey Alexandrovich Kryukov 530
1 OriginalGriff 379
2 Manas Bhardwaj 350
3 Abhinav S 295
4 Sampath Lokuge 255
0 Sergey Alexandrovich Kryukov 7,647
1 OriginalGriff 4,287
2 Peter Leow 3,681
3 Maciej Los 3,515
4 Er. Puneet Goel 3,107


Advertise | Privacy | Mobile
Web02 | 2.8.140415.2 | Last Updated 2 Sep 2012
Copyright © CodeProject, 1999-2014
All Rights Reserved. Terms of Use
Layout: fixed | fluid