Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: C VB
1- hi for all, ineed to extract bold text from pdf documents and save it into DB
2- how can I extract italic text in pdf documents
Posted 21-Jan-11 7:24am
Comments
HARISHCHOWDHARY at 23-Jan-12 1:00am
   
Please suggest me a way to Extract all the BOLD and Italics Characters from the Pdf File on the Linux Box
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

You need a pdf library such as iTextSharp[^] or commercial library. If you want to go with commericail application, there are many of them to choose. I am working with Syncufion's [^] library, which I find it very good.
  Permalink  
Comments
SAKryukov at 21-Jan-11 18:14pm
   
Must be pretty good stuff, is that true?
My 5 anyway.
Yusuf at 21-Jan-11 18:20pm
   
It does pretty amazing stuff, well kinda of. I am doing a feature on out product that takes PDF form and annotates to create a new PDF file based on the PDF form, XML file and user selection. I thought writing an article, but we are using thrid party tool to accomplish the task. I am sure I can think of doing it using iTextSharp, but don't have the time right now. :-(
SAKryukov at 21-Jan-11 19:26pm
   
Yusuf,
 
Would be quite nice if you share your experience and best techniques. If there are non-trivial moments (and they should be if you got non-trivial experience), it would be a very good candidate for an interesting article. Also, it may become very popular.
So, I would encourage you to press your personal schedule a bit and prepare the article. (I see you never posted one... how come? With such a wonderful logo :-) as the one you use -- like it! :-)
 
(By the way, I've been encourages to post another article as well -- preparing it now. You know, you never know what's going to appear more interesting for your colleague, so socializing like that is extremely valuable, don't you think so?)
 
Good luck,
--SA
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 2

  Permalink  
Comments
CPallini at 21-Jan-11 12:36pm
   
Hey man, reporting links to "The Competitors" is unfair! :-D
Nishant Sivakumar at 21-Jan-11 12:37pm
   
Well some of the most popular SO threads link back to CP articles. If they can link to us, let's show them that we are equally good at that game! *grin*
Yusuf at 21-Jan-11 18:21pm
   
Traitor. Bob does not have any Competitor. ducks!
CPallini at 22-Jan-11 8:00am
   
Wrong: Boobs have not. :-D
CPallini at 21-Jan-11 13:30pm
   
I was just kidding (I did the same so many times...).
Nishant Sivakumar at 21-Jan-11 13:31pm
   
I know :-)
SAKryukov at 21-Jan-11 18:16pm
   
Popular topic, isn't it? My 5.
Nishant Sivakumar at 21-Jan-11 18:16pm
   
Thank you.
Rate this: bad
good
Please Sign up or sign in to vote.

Solution 3

You have either to:
  • Use a PDF library.
or
  • Develop your self such a functionality, starting from PDF specifications, available here[^].
Smile | :)
  Permalink  
v2

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 350
1 Jochen Arndt 190
2 Richard MacCutchan 135
3 Sergey Alexandrovich Kryukov 120
4 DamithSL 95
0 OriginalGriff 6,045
1 DamithSL 4,611
2 Maciej Los 4,087
3 Kornfeld Eliyahu Peter 3,480
4 Sergey Alexandrovich Kryukov 3,310


Advertise | Privacy | Mobile
Web01 | 2.8.141220.1 | Last Updated 21 Jan 2011
Copyright © CodeProject, 1999-2014
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100