Renders possibly unsafe PDF files and outputs harmless PDF files
Renders possibly malicious PDF files and outputs harmless PDF files
To do this, the PDF files are rendered and converted to images using PyMuPDF.
The images are then saved to a new PDF file using img2pdf. This ensures no visual data is lost,
but any scripts/external references/flash files are removed.
Instalation:
git clone https://github.com/lacioffi/PDFSanitizer
cd PDFSanitizer
pip install -r requirements.txt
Usage:
PyMuPDF - By Jorj X. McKie (@JorjMcKie)
img2pdf - By Johannes Schauer