Western Libraries

Tabula: a tool to extract data from PDF documents

September 10, 2015

Written by:  Vince Gray

A free and open source browser-driven utility, Tabula, is available to extract data (text or numeric) from tables stored in PDF documents. It will work with text-based PDF files only, not with image-based PDF documents. Output may be saved as CSV or Excel.