Pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language.[1] The tool can also be used to extract data from damaged or corrupt PDF documents.

pdf-parser
Original author(s)Didier Stevens
Initial releaseMay 2, 2008 (2008-05-02)
Written inPython programming language
Operating systemMultiplatform, including smart phones
TypePDF software
LicensePublic domain
Websiteblog.didierstevens.com/programs/pdf-tools/

References

edit
  1. ^ PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010