0

How can i extract data from pdf to excel using python3 ?

29th Jul 2018, 5:39 AM
Radhia MISSOUM
4 Antworten
+ 3
there are moduls for pdf and excel on pypi. The pdf module only copes with easy pdfs.
29th Jul 2018, 6:55 AM
Oma Falk
Oma Falk - avatar
0
Good luck. Scraping PDFs is a hard problem. Even LibreOffice has a hard time with it. I'm mostly posting just to be notified if someone posts a solution.
29th Jul 2018, 6:03 AM
Janning⭐
Janning⭐ - avatar
0
You can use the pdfquery library but dont expect it to be 100% accurate. Better off using a professional converter then scraping off the text file if you care about accuracy
29th Jul 2018, 6:28 AM
JME
0
thank you
29th Jul 2018, 8:18 AM
Radhia MISSOUM