Click here to Skip to main content
11,802,389 members (30,802 online)
Rate this: bad
Please Sign up or sign in to vote.
See more: C VB
1- hi for all, ineed to extract bold text from pdf documents and save it into DB
2- how can I extract italic text in pdf documents
Posted 21-Jan-11 6:24am
HARISHCHOWDHARY at 23-Jan-12 1:00am
Please suggest me a way to Extract all the BOLD and Italics Characters from the Pdf File on the Linux Box
Rate this: bad
Please Sign up or sign in to vote.

Solution 1

You need a pdf library such as iTextSharp[^] or commercial library. If you want to go with commericail application, there are many of them to choose. I am working with Syncufion's [^] library, which I find it very good.
SAKryukov at 21-Jan-11 18:14pm
Must be pretty good stuff, is that true?
My 5 anyway.
Yusuf at 21-Jan-11 18:20pm
It does pretty amazing stuff, well kinda of. I am doing a feature on out product that takes PDF form and annotates to create a new PDF file based on the PDF form, XML file and user selection. I thought writing an article, but we are using thrid party tool to accomplish the task. I am sure I can think of doing it using iTextSharp, but don't have the time right now. :-(
SAKryukov at 21-Jan-11 19:26pm

Would be quite nice if you share your experience and best techniques. If there are non-trivial moments (and they should be if you got non-trivial experience), it would be a very good candidate for an interesting article. Also, it may become very popular.
So, I would encourage you to press your personal schedule a bit and prepare the article. (I see you never posted one... how come? With such a wonderful logo :-) as the one you use -- like it! :-)

(By the way, I've been encourages to post another article as well -- preparing it now. You know, you never know what's going to appear more interesting for your colleague, so socializing like that is extremely valuable, don't you think so?)

Good luck,
Rate this: bad
Please Sign up or sign in to vote.

Solution 2

CPallini at 21-Jan-11 12:36pm
Hey man, reporting links to "The Competitors" is unfair! :-D
Nishant Sivakumar at 21-Jan-11 12:37pm
Well some of the most popular SO threads link back to CP articles. If they can link to us, let's show them that we are equally good at that game! *grin*
Yusuf at 21-Jan-11 18:21pm
Traitor. Bob does not have any Competitor. ducks!
CPallini at 22-Jan-11 8:00am
Wrong: Boobs have not. :-D
CPallini at 21-Jan-11 13:30pm
I was just kidding (I did the same so many times...).
Nishant Sivakumar at 21-Jan-11 13:31pm
I know :-)
SAKryukov at 21-Jan-11 18:16pm
Popular topic, isn't it? My 5.
Nishant Sivakumar at 21-Jan-11 18:16pm
Thank you.
Rate this: bad
Please Sign up or sign in to vote.

Solution 3

You have either to:
  • Use a PDF library.
  • Develop your self such a functionality, starting from PDF specifications, available here[^].
Smile | :)

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 435
1 KrunalRohit 366
2 F-ES Sitecore 295
3 Afzaal Ahmad Zeeshan 230
4 CHill60 220
0 OriginalGriff 2,950
1 Maciej Los 1,910
2 KrunalRohit 1,862
3 CPallini 1,645
4 Richard MacCutchan 1,157

Advertise | Privacy | Mobile
Web02 | 2.8.151002.1 | Last Updated 21 Jan 2011
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100