PDFix SDK

Powerful PDF Accessibility API for Seamless Workflow Integration

For Windows, Linux and macOS

PDFix Command-Line Interface

The Ultimate PDF Accessibility & Automation Solution

PDFix SDK is a high-performance C++ PDF library that enables developers to automate PDF accessibility and document processing at scale. Easily integrate our API to:

  • Auto-tag PDFs for full compliance with PDF/UA and WCAG
  • Generate accessible PDFs with a logical reading order for screen readers
  • AI-powered structure detection – headings, lists, and tables
  • High-speed batch processing for enterprise-scale accessibility workflows
  • Simple integration: REST API for C++, Java, Python, and .NET

There are numerous excellent SDKs for the various processes that are digitized using PDF. The PDFix library focuses on and excels at automatically recognizing logical structures, which are essential for Liquid Mode in Acrobat Reader (responsive content rendering) and accessibility,”

– Michael Karbe, Managing Director of Actino Software GmbH


Explore PDFix SDK Solutions

PDF Accessibility

PDF Conversion

Data Extraction


Automated PDF Accessibility

Transform unstructured PDFs into accessible, compliant documents effortlessly with PDFix SDK’s automated PDF accessibility solutions. Our AI-powered technology ensures WCAG & PDF/UA compliance while streamlining remediation workflows for businesses, developers, and document specialists.

Why Choose PDFix SDK

AI-Powered Auto-Tagging for Faster Remediation

PDFix SDK uses advanced layout recognition to automatically detect and tag:

  • Headings & paragraphs
  • Tables & Lists
  • Images
  • Logical reading order

This automated PDF tagging drastically reduces manual work, enabling large-scale document accessibility with precision.

No-Code Batch Actions for Effortless Document Workflows

  • A no-code framework for automating any type of PDF processing
  • Uses a JSON configuration file to define a sequence of actions – editing content, modifying structure, and more
  • Ideal for custom workflows (e.g., bulk editing, document preparation)
  • Streamlines document remediation workflows
  • Highly flexible and scalable for various PDF operations

Accessibility Actions Commands

  • Set of Batch Actions focused on automated PDF accessibility remediation
  • Automates accessibility fixes without breaking the document structure
  • Run accessibility actions from a JSON file – no coding needed
  • Includes built-in tools like the Make Accessible Command to fix common issues in non-tagged PDFs
  • Supports fully custom workflows
  • Ensure PDF/UA and WCAG conformance with minimal effort

PDFix SDK Command-Line Interface (CLI)

  • Automate PDF processing with a powerful and scriptable CLI tool
  • Easy integration into any workflow
  • Ideal for accessibility automation – no coding required
  • Access multiple subcommands to handle PDF accessibility and structure
  • Stable and compatible across platforms and scripting environments
  • Seamlessly integrate accessibility into automated projects or pipelines
Make Accessible using PDFix SDK

PDF Conversion

PDF to HTML

  • Convert PDF to HTML, XML, CSV, and JSON
  • Fixed-layout or responsive HTML with content reflow
  • Tagged PDF to HTML powered by the Derivation algorithm
  • HTML5, CSS3, and JavaScript compatibility
  • Embed PDF content directly into web pages

PDF Forms to HTML Forms

  • Convert PDF forms into fully functional HTML forms
  • Fill out, flatten, and sign form fields in HTML
  • Native support for HTML form elements: Text inputs, drop-down lists, checkboxes, radio buttons

PDF to XML

  • Extract and convert PDF data to XML
  • Customize XML output for structured data

PDF Data Scraping & Extraction

Text & Data Extraction

  • Search and extract text from PDFs
  • Recognize and export tables, annotations, and structured data
  • Use regular expressions (RegEx) and pattern matching for precise data capture

Advanced Data Scraping

  • Extract data for big data analytics, AI, and machine learning
  • Support for data mining, indexing, and automation

Structured Content Extraction

  • Extract text, images, tables, charts, and lists with layout preservation
  • Intelligent document structure recognition for accurate data parsing
  • PDF data mining for structured datasets
  • Convert extracted data into multiple formats:
    • HTML, HTML5, JSON, XML, Word, Excel, CSV

yOUR PDF PROBLEMS SOLVED IN FEW HOURS!

Describe your project and our experts will craft a custom solution to save you time, money, and headaches. Get in touch today!


Disclaimer: PDFix provides tools and technology to assist in making your documents accessible, but we do not guarantee 100% document accessibility. Achieving full compliance requires human checks and intervention. Please note that PDFix is a technology provider, not a service provider. The responsibility for document compliance rests with the user.