I installed latest PDFBox on my Mac via pip.
I did an import and called on to the extract_text() method. And it keeps running perpetually for a 196 KB file.
Please help.
>>> import pdfbox as p, os
>>> os.path.exists(f). # f is the file path
True
>>> pp = p.PDFBox()
>>> pp.extract_text(f)
extract_text(f) doesn't end, runs perpetually.