OiO.lk Blog python Extract Text and Table – Maintain PDF structure
python

Extract Text and Table – Maintain PDF structure


I am looking for a Python code to extract text and table as the PDF structure, tried using PDFPlumber but it extracts text separately and table separately?

I tried page wise -> line by line text extraction

The table texts are getting extracting along with raw text

I am expecting raw text and table format as markdown and append to text[] following pdf structure



You need to sign in to view this answers

Exit mobile version