
pdfdetach: lists or extracts embedded files (attachments).Poppler is a fork of this software, which adds a number of unique tools that are not part of the original Xpdf package.
Pdf comparator in java pdf#
Xpdf and Poppler are both PDF viewers that include a collection of tools for processing and manipulating PDF files.

In the majority of cases, at least one of them turns out to have the functionality I’m looking for, so it’s a good idea to check them out if you’re not familiar with them already. Whenever I need to get some PDF processing or analysis done and I’m not sure what tool to use, these are usually my starting points. Each of these are capable of a wide range of tasks (including some I won’t explicitly address here), and they can be seen as “Swiss army-knives” of PDF processing. PDF multi-toolsīefore diving into any specific tasks, let’s start with some general-purpose PDF tools and toolkits. They all work under Linux (which is the main OS I’m using these days), but most of them are available for other platforms (including Windows) as well. using additional options or alternative output formats), and they should probably best seen as (hopefully useful) starting points for the reader’s own explorations.Īll of the tools presented here are published as open-source, and most of them have a command-line interface. Also, many of the example commands in this post can be further refined to particular needs (e.g. So there’s probably a fair amount of selection bias here, and I don’t want to make any claims of presenting the “best” way to do any of these tasks here. Some of these tasks could be done using other tools (including ones that are not mentioned here), and in some cases these other tools may well be better choices. It was guided to a great degree by the PDF-related issues I’ve encountered myself in my day to day work.


This post is an attempt to (finally) bring together my go-to PDF analysis and processing tools and commands for a variety of common tasks in one single place.
Pdf comparator in java software#
Over the years, I’ve been using a variety of open-source software tools for solving all sorts of issues with PDF documents.
