python

Extract Text and Table – Maintain PDF structure

By admin
October 17, 2024
0 Comments
Less than a minute
3 Views
5 days ago

I am looking for a Python code to extract text and table as the PDF structure, tried using PDFPlumber but it extracts text separately and table separately?

I tried page wise -> line by line text extraction

The table texts are getting extracting along with raw text

I am expecting raw text and table format as markdown and append to text[] following pdf structure

You need to sign in to view this answers

Related Post