Use Case
Extract tabular data from unstructured PDF document into CSV format.
Resources
Download the original PDF document
Use Case
Extract tabular data from unstructured PDF document into CSV format.
Resources
Integration
The SDK provides two options for integrating into your project using a Command Line Utility or programatically.
Click here to create your free trial license key.
> Command Line
PDFix provides simple and fast automated PDF processing using a command line. PDFix Command Line Utility is the easiest way to integrate the SDK functionality into your solution available for Windows, MacOS and Linux. Learn more about the Command Line Utility.
$ cd /pdfix_mac/bin
$ ./pdfix_app support@pdfix.net 3bE31NaixzFE58ir -pdf2table /Users/admin/Documents/input.pdf output_csv
Output:
PDF to TABLE
2 tables found
Success
This command extracts tables detected in the PDF into CSV files. Output should point to the folder where separate CSV files will be saved.
These code samples show how to extract tables from a PDF document and save them to CSV output. Code integration into your project allows you to take full control of the PDF data processing:
Result
Found tables extracted to separate CSV files as displayed below:
RAW CSV data:
,”Great Britain”,”The Netherlands”,,,
“1086”,”754″,,,,
“1270”,”759″,,,”957″,
“1300”,”755″,,”1,482″,”957″,
“1348”,”777″,”876″,”1,376″,”1,030″,
“1400”,”1,090″,”1,245″,”1,601″,”885″,
“1450”,”1,055″,”1,432″,”1,668″,”889″,
“1500”,”1,114″,”1,483″,”1,403″,”889″,
“1570”,”1,143″,”1,783″,”1,337″,”990″,
“1600”,”1,123″,”2,372″,”1,244″,”944″,
“1650”,”1,100″,”2,171″,”1,271″,”820″,
“1700”,”1,630″,”2,403″,”1,350″,”880″,
,”1,563″,,,,
“1750”,”1,710″,”2,440″,”1,403″,”910″,
“1800”,”2,080″,”2,617″,”1,244″,”962″,
,,”1,752″,,,
“1820”,”2,133″,”1,953″,”1,376″,”1,087″,
“1850”,”2,997″,”2,397″,”1,350″,”1,144″,
Data Imported to Excel:

Customizing the output
PDFix SDK allows customization of the output by using configuration files that affect table detection process and the output structure. To learn more about the configuration files please follow the Documentation. When using the SDK programatically there are no limits to fit the output your needs.
Contact us if you need help with integration.