Click here to Skip to main content
15,881,877 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
I have this question. I have 5 PDFs having around 38,000 objective questions. So i want to make an application which imports this questions and save it into database and then give interface to the user for choosing question with four objective. I used itextsharp to read from PDFs as a chunk and also line by line. The content after reading is scattered and i cannot figure out a sequence by which i can split or differentiate between the question and the four objectives. Is there any better way by which I can import data from PDFs?? The content in PDFs is in tabular format.

Here is the snapshots of the pdf and the resulting string.
Input Pdf file
resulting string in Window
Posted
Updated 11-Oct-12 2:16am
v3
Comments
Jörgen Andersson 11-Oct-12 7:45am    
Your links don't work.
Pankaj Sinai Nagarsekar 11-Oct-12 7:52am    
Please check the link. I ve posted it as a comment. Thanx
This is one bug of Code project as OP has posted the links correctly in the question (we can see it by clicking on "Improve question").
So, I have reported the bug - Image URLs posted in the question don't link to the actual path.

Thanks...
Pankaj Sinai Nagarsekar 11-Oct-12 8:16am    
The links are working now.
Yes you added "http://" that's why... but without that it is also correct, which was not working in Code Project and it is a bug.

1 solution

See my previous answer[^] for similar question.
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900