PDFix Command-Line

PDFix Command-Line

PDFix CLI

PDFix SDK Command Line Interface
COMMAND LINE INTERFACE
CHECK THE SYNTAX >>
PDFix SDK Command Line Interface Parameters
HOW TO USE
THE PARAMETERS >>
PDFix SDK Command Line Interface Operations
PROCESS THE PDFs
OPERATIONS LIST >>

The essential syntax is:

pdfix_app {email} {key} {operation} {input} {output} {options}

PDFix provides simple, fast and automated PDF processing through the command-line. PDFix CLI is the easiest way to integrate the PDFix SDK functionality into your solutions.  PDFix CLI is available on Mac, Windows and Linux.

Parameters:

email required registered email address
key required license key
input required path to pdf file to process
output required path to the output file or a directory*
options optional operation specific options

* please follow instructions in the opetation details

PDF to HTML -pdf2html

pdfix_app {EMAIL} {LICENSEKEY} -pdf2html input.pdf html/index.html -responsive
 

Converts pdf to html , output is the html file created during conversion. All necessary files generated during the conversion are saved in the same folder as the output file. The following options are available:

-responsive creates responsive HTML, creates fixed layout if not set
-js exports document JavaScript into HTML
-fonts exports embedded TrueType fonts into HTML using CSS3
-textsize retain original text size in created HTML
-noexternalcss use inline css instead of the external file
-noexternaljs use inline javascript instead of the external file
-noexternalimg use embedded based encoded images
-noexternalfont use embedded based encoded fonts
-graybg use gray background and page padding

Add Tags -addtags

pdfix_app {EMAIL} {LICENSEKEY} -addtags input.pdf output.pdf
 

Adds tags to pdf. No options are currently available for CLI.

Extract Text -pdf2txt

pdfix_app {EMAIL} {LICENSEKEY} -pdf2txt input.pdf output.txt
 

Extract text from the pdf into simple txt file. No options are currently available for CLI.

Extract Tables -pdf2table

pdfix_app {EMAIL} {LICENSEKEY} -pdf2table input.pdf output
 

Extract tables detected in the pdf into csv files. Output should point to the folder where separate csv files will be saved. No options are currently available for CLI.

Extract Images -pdf2image

pdfix_app {EMAIL} {LICENSEKEY} -pdf2image input.pdf output -page_width 1200 -image_format 1 -image_quality 75
 

Extract images from the pdf. Output should point to the folder where images will be saved. The following options are available:

-page_width with of the rendered page in pixels used for scaling the images
-image_format image format (0-PNG, 1-JPG)
-image_quality image quality. For JPG means the compression level otherwise it’s ignored

Flatten Annotations -flatten

pdfix_app {EMAIL} {LICENSEKEY} -flatten input.pdf output.pdf
 

Flatten all annotations into the pdf content. No options are currently available for CLI.

OCR -ocr

pdfix_app {EMAIL} {LICENSEKEY} -ocr input.pdf output.pdf -lang eng -data ocr/tesseract
 

Converts scans or images-only PDF documents into searchable, editable PDF files. The following options are available:

-lang OCR language. Please download a Language traineddata + Special Data Files from here
-data path to Tesseract OCR data. Please download from here

Make Accessible -makeaccessible

pdfix_app {EMAIL} {LICENSEKEY} -makeaccessible input.pdf output.pdf -lang en -title Title
 

Makes PDF Accessible. If you have image-only PDF, please use OCR command before. The following options are available:

-lang document language
-title document title
Are you interested?
If you want to get answers and information about our products and services, or to discuss your subscription, get in touch with us.
DOWNLOAD THE SDK >>
CONTACT US >>
REQUEST QUOTE >>