Can PDFluent read PDFs created by PDFBox?

Yes. PDFluent reads standard PDF files regardless of which library produced them.

PDFBox is free (Apache 2.0). Does switching to PDFluent cost money?

PDFluent has a 30-day free trial. After that, a commercial license is required. If budget is the primary constraint and you are on JVM, PDFBox remains a viable option.

How does PDFBox PDF/A support compare?

PDFBox has partial PDF/A support via the preflight module. PDFluent validates PDF/A-1, PDF/A-2, and PDF/A-3 and can convert documents to those standards.

Can I call PDFluent from an existing Java service?

PDFluent exposes a C API that can be called from Java via JNI or JNA. This allows incremental migration without rewriting everything at once.

PDFluentSDK

← Editor Download

Migration guides/Apache PDFBox

Migrate from Apache PDFBox to PDFluent

A step-by-step guide for replacing Apache PDFBox with PDFluent. Covers dependency setup, document loading, text extraction, form filling, and saving.

Migrating from Apache PDFBox to PDFluent. Install with cargo add pdfluent@1.0.0-beta.17

Migration steps

Replace the dependency

Remove PDFBox from pom.xml or build.gradle and add pdfluent to Cargo.toml.

Apache PDFBox (before)

<!-- pom.xml -->
<dependency>
    <groupId>org.apache.pdfbox</groupId>
    <artifactId>pdfbox</artifactId>
    <version>3.0.1</version>
</dependency>

PDFluent (after)

# Cargo.toml
[dependencies]
pdfluent = "0.9"

Open a document

PDFBox uses PDDocument.load() with a File or byte array. PDFluent uses Document::open which returns a Result.

Apache PDFBox (before)

import org.apache.pdfbox.pdmodel.PDDocument;
import java.io.File;

PDDocument doc = PDDocument.load(new File("contract.pdf"));

PDFluent (after)

use pdfluent::PdfDocument;

let doc = PdfDocument::open("contract.pdf")?;

Extract text

PDFBox requires a PDFTextStripper instance and produces a single string for the whole document. PDFluent extracts per-page.

Apache PDFBox (before)

import org.apache.pdfbox.text.PDFTextStripper;

PDFTextStripper stripper = new PDFTextStripper();
stripper.setStartPage(1);
stripper.setEndPage(1);
String text = stripper.getText(doc);

PDFluent (after)

let text = doc.page(1)?.text()?;

// All pages
for i in 0..doc.page_count() {
    let text = doc.page(i)?.text()?;
    println!("{}", text);
}

Fill AcroForm fields

PDFBox accesses fields through PDDocumentCatalog and PDAcroForm. PDFluent uses a direct acroform() handle.

Apache PDFBox (before)

import org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm;

PDAcroForm acroForm = doc.getDocumentCatalog().getAcroForm();
acroForm.getField("first_name").setValue("Jane");
acroForm.getField("last_name").setValue("Smith");
acroForm.flatten();

PDFluent (after)

let mut form = doc.acroform()?;
form.set_field("first_name", "Jane")?;
form.set_field("last_name", "Smith")?;
form.flatten()?;

Save and close

PDFBox requires explicit close(). PDFluent drops the document when it goes out of scope; call save() to write.

Apache PDFBox (before)

doc.save("output.pdf");
doc.close(); // must call close() to release file handles

PDFluent (after)

doc.save("output.pdf")?;
// doc drops automatically at end of scope

Things to watch out for

!PDFBox page numbers are 1-indexed in PDFTextStripper but 0-indexed in PDPageTree. PDFluent always uses 0-indexed.
!PDFBox does not support XFA forms. If your PDFs use XFA, PDFluent handles them natively.
!PDFBox PDDocument must be closed explicitly or resource leaks occur. PDFluent documents drop cleanly with Rust ownership.

Frequently asked questions

Related guides

PDFluent vs PDFBox Migrate from iText

Download PDFluent PDFluent vs PDFBox