(Note: Oct 7th, 2019)
As of Oct. 2019, I launched a documentation site and Google Colab notebook for tabula-py. The FAQ would be good place to execute accurate extraction.
Today, I released tabula-py 0.3.0, which extracts table from PDF into Python pandas’s DataFrame.
chezou/tabula-py
_tabula-py - Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame_github.com[](https://github.com/chezou/tabula-py)
It is simple wrapper of tabula-java and it enables you to extract table into DataFrame or JSON with Python.