October 22, 2024
Chicago 12, Melborne City, USA
pdf

Delete content from PDF using Python


I need to cleanse a large number of PDFs from PDF content and leave only an image inside (the structure of the PDFs is always the same).

Here is a screenshot of the PDF content:

enter image description here

The image marked in yellow is the one I want to keep, all those Paths and Texts and the other smaller image are to be deleted. I have checked out some Python libraries for PDF such as PyPDF but it seems to me like it does not allow me to access that content, only comments and annotations and such stuff.

Does anyone have a solution?



You need to sign in to view this answers

Leave feedback about this

  • Quality
  • Price
  • Service

PROS

+
Add Field

CONS

+
Add Field
Choose Image
Choose Video