How to manipulate and edit PDFs in .NET with PdfSharpCore

Hulya Masharipov

August 12, 2023

How to manipulate and edit PDFs in .NET with PdfSharpCore

Summary

Learn how to use PdfSharpCore, a powerful open source library, to manipulate and edit PDF documents in .NET applications. This comprehensive tutorial demonstrates essential PDF operations, including adding text and images, merging multiple documents, extracting specific pages, and deleting unwanted content. Discover how to set up a .NET project with PdfSharpCore, and explore advanced features available through Nutrient .NET SDK for complex enterprise applications.

PdfSharpCore(opens in a new tab) is a powerful open source library that allows you to generate, modify, edit, and manipulate PDF documents in .NET applications. In this tutorial, you’ll learn how to set up a .NET project with PdfSharpCore, along with how to perform common PDF manipulations, such as adding text and images, and merging, extracting, and even deleting pages from a PDF. Additionally, you’ll learn about Nutrient .NET SDK, which offers more advanced features for complex .NET projects.

Prerequisites

.NET SDK(opens in a new tab) installed on your system.
A code editor, such as Visual Studio Code or Visual Studio.

Step 1 — Setting up a .NET core project

Open your terminal and create a new .NET console application(opens in a new tab) using the .NET command-line interface (CLI):

dotnet new console --name MyPDFEditorApp
cd MyPDFEditorApp

Step 2 — Installing the PdfSharpCore package

Now, install the PdfSharpCore package using the .NET CLI:

dotnet add package PdfSharpCore

Step 3 — Adding text to existing PDFs

Begin by adding text to an existing PDF document. For this tutorial, make sure you have an existing PDF file, input.pdf, in your project folder.

Open the Program.cs file in your code editor and add the following code:

using PdfSharpCore.Pdf;
using PdfSharpCore.Drawing;
using PdfSharpCore.Pdf.IO;

class Program
{
    static void Main()
    {
        // Open an existing PDF document.
        PdfDocument pdfDocument = PdfReader.Open("input.pdf", PdfDocumentOpenMode.Modify);

        // Get the first page of the document.
        PdfPage page = pdfDocument.Pages[0];

        // Create a graphics object to draw on the page.
        XGraphics gfx = XGraphics.FromPdfPage(page);

        // Define the font and size.
        XFont font = new XFont("Arial", 12);

        // Define the text to be added.
        string text = "Hello, PDFSharpCore!";

        // Draw the text on the page.
        gfx.DrawString(text, font, XBrushes.Black, new XPoint(100, 100));

        // Save the changes to the PDF.
        pdfDocument.Save("output.pdf");
        pdfDocument.Close();
    }
}

Run the application with dotnet run and open the output.pdf file to see the result.

adding text

This will add the text “Hello, PDFSharpCore!” to the first page of the PDF document.

Step 4 — Adding images to PDFs

Next, add an image to a PDF document. Make sure you have an image.png file in your project folder. This is the image that will be added to the PDF document:

using PdfSharpCore.Pdf;
using PdfSharpCore.Drawing;
using PdfSharpCore.Pdf.IO;

class Program
{
    static void Main()
    {
        // Open an existing PDF document.
        PdfDocument pdfDocument = PdfReader.Open("input.pdf", PdfDocumentOpenMode.Modify);

        // Get the first page of the document.
        PdfPage page = pdfDocument.Pages[0];

        // Create a graphics object to draw on the page.
        XGraphics gfx = XGraphics.FromPdfPage(page);

        // Load the image to be added.
        XImage image = XImage.FromFile("image.png");

        // Define the position and size of the image.
        double x = 100;
        double y = 100;
        double width = 200;
        double height = 100;

        // Draw the image on the page.
        gfx.DrawImage(image, x, y, width, height);

        // Save the changes to the PDF.
        pdfDocument.Save("output.pdf");
        pdfDocument.Close();
    }
}

In this code, you load the image.png file using XImage.FromFile, and you specify the position and size where you want the image to be drawn on the PDF page using the x, y, width, and height variables. The gfx.DrawImage method is then used to add the image to the PDF page.

adding image

Step 5 — Merging PDFs

Now, you’ll merge multiple PDF documents into a single PDF. Make sure you have two PDF files, named document1.pdf and document2.pdf, in your project folder:

using PdfSharpCore.Pdf;
using PdfSharpCore.Pdf.IO;

class Program
{
    static void Main()
    {
        // Open the first PDF document.
        PdfDocument pdfDocument1 = PdfReader.Open("document1.pdf", PdfDocumentOpenMode.Import);

        // Open the second PDF document.
        PdfDocument pdfDocument2 = PdfReader.Open("document2.pdf", PdfDocumentOpenMode.Import);

        // Create a new PDF document
        PdfDocument mergedDocument = new PdfDocument();

        // Copy pages from the first document to the merged document.
        foreach (PdfPage page in pdfDocument1.Pages)
        {
            mergedDocument.AddPage(page);
        }

        // Copy pages from the second document to the merged document.
        foreach (PdfPage page in pdfDocument2.Pages)
        {
            mergedDocument.AddPage(page);
        }

        // Save the merged document.
        mergedDocument.Save("merged_document.pdf");
        mergedDocument.Close();
    }
}

The code demonstrates the process of combining two PDF documents, document1.pdf and document2.pdf, into a new PDF named merged_document.pdf.

Step 6 — Extracting pages from a PDF

Next, you’ll extract specific pages from an existing PDF and save them as a new PDF:

using PdfSharpCore.Pdf.IO;

class Program
{
    static void Main()
    {
        // Open the existing PDF document.
        PdfDocument pdfDocument = PdfReader.Open("input.pdf", PdfDocumentOpenMode.Import);

        // Create a new PDF document.
        PdfDocument newDocument = new PdfDocument();

        // Specify the page numbers to be extracted (e.g. 1 and 3).
        int[] pageNumbersToExtract = { 1, 3 };

        // Copy the specified pages from the existing document to the new document.
        foreach (int pageNumber in pageNumbersToExtract)
        {
            PdfPage page = pdfDocument.Pages[pageNumber - 1]; // Subtract 1 as pages are zero-based
            newDocument.AddPage(page);
        }

        // Save the new document with extracted pages.
        newDocument.Save("extracted_pages.pdf");
        newDocument.Close();
    }
}

This code extracts specific pages from an existing PDF document and creates a new PDF document containing only the extracted pages.

Step 7 — Deleting pages from a PDF

Finally, you’ll delete specific pages from a PDF document. In this example, you’re deleting pages 2 and 4 and creating a new PDF document without those pages:

using PdfSharpCore.Pdf.IO;

class Program
{
    static void Main()
    {
        // Open the existing PDF document.
        PdfDocument pdfDocument = PdfReader.Open("input.pdf", PdfDocumentOpenMode.Import);

        // Create a new PDF document.
        PdfDocument newDocument = new PdfDocument();

        // Specify the page numbers to be deleted (e.g. 2 and 4).
        int[] pageNumbersToDelete = { 2, 4 };

        // Copy all pages from the existing document to the new document, excluding the ones to be deleted.
        for (int pageNumber = 0; pageNumber < pdfDocument.PageCount; pageNumber++)
        {
            if (!pageNumbersToDelete.Contains(pageNumber + 1))
            {
                PdfPage page = pdfDocument.Pages[pageNumber];
                newDocument.AddPage(page);
            }
        }

        // Save the new document with deleted pages.
        newDocument.Save("modified_document.pdf");
        newDocument.Close();
    }
}

Nutrient .NET SDK

While both PdfSharpCore and Nutrient .NET SDK offer PDF manipulation capabilities for .NET applications, Nutrient .NET SDK stands out as a more comprehensive and feature-rich solution. Here are some reasons why Nutrient .NET SDK can be considered better:

Extensive capabilities — Nutrient .NET SDK provides a wide range of functionalities, including OCR, barcode processing, PDF editing, document conversion, scanning, and image processing. It outperforms many other .NET document imaging technologies available on the market.
OCR (optical character recognition) — Nutrient .NET SDK supports OCR, which enables the extraction of text from scanned documents and images. This feature is essential for applications that require text recognition and data extraction from PDFs.
Advanced PDF editing — Nutrient .NET SDK offers more advanced PDF editing capabilities, such as editing text, images, and annotations, as well as adding watermarks, digital signatures, and form fields to PDF documents.
Comprehensive support — Nutrient has a reputation for providing excellent customer support and regular updates to address any issues or introduce new features. This ensures that your application remains up to date and well-maintained.
Integration-friendly — Nutrient .NET SDK is designed to seamlessly integrate into your existing .NET applications, making it easier to adopt and implement in your projects.
Documentation and resources — Nutrient offers comprehensive documentation, tutorials, and sample code, making it easier for developers to get started and leverage the library’s full potential.

To get started with Nutrient .NET SDK, follow our getting started guide.

Conclusion

In this tutorial, you explored the PdfSharpCore library for manipulating and editing PDFs in .NET applications. You learned how to add text and images, and merge, extract, and even delete pages from existing PDF documents. While PdfSharpCore is a valuable open source option for basic PDF manipulation needs, consider using Nutrient .NET SDK for more advanced features, such as OCR, PDF editing, barcode processing, and document conversion, making it a comprehensive solution for demanding .NET projects.

To see a list of all web frameworks, you can contact our Sales team. Or, launch our demo to see our viewer in action.

FAQ

What is PdfSharpCore, and how can it help with PDF manipulation in .NET?

PdfSharpCore is an open source library for .NET that allows developers to generate, modify, and manipulate PDF documents easily.

Can I add images to existing PDFs using PdfSharpCore?

Yes, PdfSharpCore allows you to add images to existing PDF documents by using the appropriate methods to load and position images on PDF pages.

How can I merge multiple PDFs into a single document?

You can merge multiple PDF documents in PdfSharpCore by opening each PDF and copying its pages into a new PDF document.

Is there a way to extract specific pages from a PDF using PdfSharpCore?

Yes, PdfSharpCore provides functionality to extract specific pages from a PDF and save them into a new document.

What are the advantages of using the Nutrient .NET SDK over PdfSharpCore?

The Nutrient .NET SDK offers advanced features like OCR, barcode processing, and comprehensive PDF editing capabilities, making it a better choice for complex PDF manipulation tasks.