Click here to Skip to main content
11,935,053 members (52,306 online)
Rate this:
Please Sign up or sign in to vote.
See more: C VB
1- hi for all, ineed to extract bold text from pdf documents and save it into DB
2- how can I extract italic text in pdf documents
Posted 21-Jan-11 7:24am
HARISHCHOWDHARY 23-Jan-12 1:00am
Please suggest me a way to Extract all the BOLD and Italics Characters from the Pdf File on the Linux Box
Rate this: bad
Please Sign up or sign in to vote.

Solution 1

You need a pdf library such as iTextSharp[^] or commercial library. If you want to go with commericail application, there are many of them to choose. I am working with Syncufion's [^] library, which I find it very good.
SAKryukov 21-Jan-11 18:14pm
Must be pretty good stuff, is that true?
My 5 anyway.
Yusuf 21-Jan-11 18:20pm
It does pretty amazing stuff, well kinda of. I am doing a feature on out product that takes PDF form and annotates to create a new PDF file based on the PDF form, XML file and user selection. I thought writing an article, but we are using thrid party tool to accomplish the task. I am sure I can think of doing it using iTextSharp, but don't have the time right now. :-(
SAKryukov 21-Jan-11 19:26pm

Would be quite nice if you share your experience and best techniques. If there are non-trivial moments (and they should be if you got non-trivial experience), it would be a very good candidate for an interesting article. Also, it may become very popular.
So, I would encourage you to press your personal schedule a bit and prepare the article. (I see you never posted one... how come? With such a wonderful logo :-) as the one you use -- like it! :-)

(By the way, I've been encourages to post another article as well -- preparing it now. You know, you never know what's going to appear more interesting for your colleague, so socializing like that is extremely valuable, don't you think so?)

Good luck,
Rate this: bad
Please Sign up or sign in to vote.

Solution 2

CPallini 21-Jan-11 12:36pm
Hey man, reporting links to "The Competitors" is unfair! :-D
Nishant Sivakumar 21-Jan-11 12:37pm
Well some of the most popular SO threads link back to CP articles. If they can link to us, let's show them that we are equally good at that game! *grin*
Yusuf 21-Jan-11 18:21pm
Traitor. Bob does not have any Competitor. ducks!
CPallini 22-Jan-11 8:00am
Wrong: Boobs have not. :-D
CPallini 21-Jan-11 13:30pm
I was just kidding (I did the same so many times...).
Nishant Sivakumar 21-Jan-11 13:31pm
I know :-)
SAKryukov 21-Jan-11 18:16pm
Popular topic, isn't it? My 5.
Nishant Sivakumar 21-Jan-11 18:16pm
Thank you.
Rate this: bad
Please Sign up or sign in to vote.

Solution 3

You have either to:
  • Use a PDF library.
  • Develop your self such a functionality, starting from PDF specifications, available here[^].
Smile | :)

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
Top Experts
Last 24hrsThis month

Advertise | Privacy | Mobile
Web03 | 2.8.151126.1 | Last Updated 21 Jan 2011
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100