Click here to Skip to main content
15,608,181 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:

I work for a website in publishing dept and i work on PDF's with lot of content related to tax and accounting. Usually every PDF i receive will have a INDEX like below.

12230 Change in Accountants
12240 Change in Fiscal Year
12250 Auditor Issues

This text in INDEX will again repeat in the content as heading for a paragaraph like below.

12230 Change in Accountants
(Last updated: 6/30/2009)
12230.1 Unless the same accountant reported on the most recent financial statements of
both the registrant and the accounting acquirer, a reverse acquisition always

So whenever the text in INDEX is repeating in the content i need to add the heading levels Eg: "< hd1 >< name >12230 Change in Accountants< /name >"

Can someone help me with this.
ZurdoDev 20-Aug-12 11:00am    
What code are you using to parse the pdf file?
ridoy 20-Aug-12 13:52pm    
Do you want pdf to text conversion and then replace it in a textbox?

1 solution

There's no PDF support in .NET, you need a third party library
Share this answer
vamshivarma 20-Aug-12 12:23pm    
I will convert the PDF into text and place the content in textbox.
So is there a posibility of replacing in textbox?
Christian Graus 20-Aug-12 12:29pm    
yes, changing the text in a textbox is trivial, it has a Text property. Your hard part is parsing data from a PDF.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900