Click here to Skip to main content
15,886,137 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Can anyone direct me to some c# code examples for extracting text from a scanned pdf document? I've went through with so many posts, but couldn't find a proper one where i can understand how to do this. Those libraries that were used are not free ones. Some libraries has restrictions like only able to extract first three pages from a pdf document. To extract whole document it asks me to download their full version of the library. So the full version is not for free.
Please direct me how to do this without spending money.
Posted
Comments
Kornfeld Eliyahu Peter 26-Jun-14 4:39am    
AS you talk about scanned text PDF is irrelevant. You have to search for OCR...

Hi,

Please refer the following URL

http://www.codeproject.com/Questions/243295/Is-this-possible-to-Extract-Text-from-Scanned-PDF
 
Share this answer
 
You can use tesseract OCR .net https://code.google.com/p/tesseractdotnet/[^]
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900