C# Read PDF Walkthrough

Today, we'll walk through a simple method for reading PDF content and extracting text in its original format. This can be done for entire documents or specific pages, all within your C# project.

Csharp Parse Pdf1 related to C# Read PDF Walkthrough
Csharp Parse Pdf2 related to C# Read PDF Walkthrough
Csharp Parse Pdf3 related to C# Read PDF Walkthrough

Get started with IronPDF

Start using IronPDF in your project today with a free trial.

First Step:
green arrow pointer


Read PDF File in C#

Using this C# library, we can read PDF files, extract content, and even extract high-quality and original images. See the examples below for the many ways we can use different functions to achieve our PDF reading needs in a .NET environment.

:path=/static-assets/pdf/content-code-examples/how-to/csharp-read-pdf-read-pdf.cs
using IronPdf;
using IronSoftware.Drawing;
using System.Collections.Generic;

// Load the desired PDF file
PdfDocument pdf = PdfDocument.FromFile("sample.pdf");

// Extract all text from the PDF
string allText = pdf.ExtractAllText();

// Get all images embedded in the PDF
IEnumerable<AnyBitmap> allImages = pdf.ExtractAllImages();

// Iterate over each page to extract text and images individually
for (int index = 0; index < pdf.PageCount; index++)
{
    // Extract text from the current page
    string text = pdf.ExtractTextFromPage(index);

    // Extract images from the current page
    IEnumerable<AnyBitmap> images = pdf.ExtractImagesFromPage(index);
    
    // At this point, 'text' contains the text from the current page
    // and 'images' contains any images found on the current page.
    // Additional processing can be performed here if needed.
}
Imports IronPdf
Imports IronSoftware.Drawing
Imports System.Collections.Generic

' Load the desired PDF file
Private pdf As PdfDocument = PdfDocument.FromFile("sample.pdf")

' Extract all text from the PDF
Private allText As String = pdf.ExtractAllText()

' Get all images embedded in the PDF
Private allImages As IEnumerable(Of AnyBitmap) = pdf.ExtractAllImages()

' Iterate over each page to extract text and images individually
For index As Integer = 0 To pdf.PageCount - 1
	' Extract text from the current page
	Dim text As String = pdf.ExtractTextFromPage(index)

	' Extract images from the current page
	Dim images As IEnumerable(Of AnyBitmap) = pdf.ExtractImagesFromPage(index)

	' At this point, 'text' contains the text from the current page
	' and 'images' contains any images found on the current page.
	' Additional processing can be performed here if needed.
Next index
$vbLabelText   $csharpLabel

Output

We have used a C# Form to show you the perfect output of reading the PDF content. With this approach, it's all about simplicity and using as little code as possible to achieve your project needs.

~ PDF ~

Csharp Read Pdf4 related to ~ PDF ~

~ C# Form ~

Csharp Read Pdf5 related to ~ C# Form ~

Library Quick Access

Documentation related to Library Quick Access

Library Documentation

Documentation for the IronPDF library is available in the handy API Reference for you to explore and share.

IronPDF API Reference Documentation

Frequently Asked Questions

What is IronPDF?

IronPDF is a C# library that allows developers to read, edit, and create PDF files within their C# projects.

How do I install IronPDF?

You can install IronPDF via NuGet by downloading it from the NuGet website or using the NuGet package manager in your development environment.

How can I extract text from a PDF using IronPDF?

You can extract text from a PDF by loading the document in IronPDF and using the 'ExtractAllText' method to retrieve the content.

Can IronPDF extract images from PDF files?

Yes, IronPDF can extract high-quality and original images from PDF files.

Is there documentation available for IronPDF?

Yes, the IronPDF library has comprehensive documentation available in the API Reference, which can be accessed online.

What are the basic steps to read a PDF file using IronPDF in C#?

To read a PDF file using IronPDF, download and install the library via NuGet, load the PDF document, and use the ExtractAllText method to read the content.

Can I use IronPDF to display PDF content in a C# Form?

Yes, IronPDF can be used in conjunction with a C# Form to display PDF content, ensuring that the output is presented perfectly.

Is IronPDF suitable for .NET projects?

IronPDF is designed for use within .NET projects, providing a seamless integration for PDF reading and manipulation.

Are there example codes available for using IronPDF?

Yes, there are example codes available that demonstrate how to use IronPDF for reading and extracting text from PDF files in a C# environment.

Where can I view the PDF output when using IronPDF?

The PDF output can be viewed directly in your C# application or through the console, depending on how you implement IronPDF in your project.

Chaknith Bin
Software Engineer
Chaknith works on IronXL and IronBarcode. He has deep expertise in C# and .NET, helping improve the software and support customers. His insights from user interactions contribute to better products, documentation, and overall experience.