Click here to Skip to main content
15,886,518 members
Articles / Web Development / HTML

Converting PDF to Text in C#

Rate me:
Please Sign up or sign in to vote.
4.80/5 (144 votes)
19 Apr 2015CPOL3 min read 1.9M   31.8K   484  
Parsing PDF files in .NET using PDFBox and IKVM.NET (managed code).
<!DOCTYPE html>
<html>

<head>
<meta content="cs" http-equiv="Content-Language" />
<meta content="text/html; charset=utf-8" http-equiv="Content-Type" />
<title>How to convert PDF to text in</title>
</head>

<body>

<h1>How to convert PDF to text in .NET</h1>
<h2>Libraries</h2>
<p>You need to copy the following DLLs to the lib directory</p>
<p>lib/reference</p>
<ul>
	<li>IKVM.OpenJDK.Core.dll</li>
	<li>IKVM.OpenJDK.SwingAWT.dll</li>
	<li>pdfbox-1.8.9.dll</li>
</ul>
<p>lib/copy</p>
<ul>
	<li>commons-logging.dll</li>
	<li>fontbox-1.8.9.dll</li>
	<li>IKVM.OpenJDK.Text.dll</li>
	<li>IKVM.OpenJDK.Util.dll</li>
	<li>IKVM.Runtime.dll</li>
</ul>
<h2>Download</h2>
<p>These <strong>PDFBox DLLs</strong> can be downloaded here:</p>
<ul>
	<li><a href="http://www.squarepdf.net/pdfbox-in-net-download/">
	http://www.squarepdf.net/pdfbox-in-net-download/</a></li>
</ul>
<p>The <strong>full project </strong>including these libraries can be downloaded 
here:</p>
<ul>
	<li>
	<a href="http://www.squarepdf.net/how-to-convert-pdf-to-text-in-net-sample-project/">
	http://www.squarepdf.net/how-to-convert-pdf-to-text-in-net-sample-project/</a></li>
</ul>

</body>

</html>

By viewing downloads associated with this article you agree to the Terms of Service and the article's licence.

If a file you wish to view isn't highlighted, and is a text file (not binary), please let us know and we'll add colourisation support for it.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Czech Republic Czech Republic
My open-source event calendar/scheduling web UI components:

DayPilot for JavaScript, Angular, React and Vue

Comments and Discussions