Click here to Skip to main content
15,885,546 members

read data from pdf files using c#

Revision 2
I have this question. I have 5 PDFs having around 38,000 objective questions. So i want to make an application which imports this questions and save it into database and then give interface to the user for choosing question with four objective. I used itextsharp to read from PDFs as a chunk and also line by line. The content after reading is scattered and i cannot figure out a sequence by which i can split or differentiate between the question and the four objectives. Is there any better way by which I can import data from PDFs?? The content in PDFs is in tabular format.

Here is the snapshots of the pdf and the resulting string.
Input Pdf file
resulting string in Window
Posted 11-Oct-12 0:43am by Pankaj Sinai Nagarsekar.