Automate PDF Accessibility with Smart External Actions

Easily integrate these PDF accessibility actions into PDFix Desktop or create a custom Action Pipeline to batch process PDF files and achieve accessibility compliance at scale. Save significant processing time and ensure error-free results for large volumes of documents.

Explore our extensive library of PDF actions, and check back regularly for updates – new tools and actions are added frequently, reflecting the evolving needs of PDF remediators and ensuring access to the latest PDF accessibility solutions.

How To Add External Action in PDFix Desktop

Integrating external actions into PDFix Desktop is a fast and flexible way to customize your automated PDF accessibility workflows. You can choose from two action types:

  • Docker-based actions – ideal for scalable, containerized processing
  • Local actions – run via command-line applications on your system

To add an external action, simply select the desired action type and import it into PDFix Desktop. This enables the creation of powerful, reusable automation pipelines for PDF/UA remediation, validation, and batch processing.

For a step-by-step walkthrough, see our guide: How to add the external action.

PDFix Actions

Explore our extensive library of PDF actions, and check back regularly for updates – new tools and actions are added frequently, reflecting the evolving needs of PDF remediators and ensuring access to the latest PDF accessibility solutions.

A Dockerized solution for automated PDF tagging using Paddle and PDFix SDK. Supports pdf tagging, pdfix layout template generation, MathML extraction from images, and adding MathML associated files to PDF formula tags.

https://hub.docker.com/r/pdfix/pdf-accessibility-paddle

https://github.com/pdfix/action-pdf-accessibility-paddle-docker

AutoTag (Paddle)Automatically tags PDF using Paddle.
Create Layout Template JSON (Paddle)Automatically creates layout template json using Paddle, saving it as JSON file.
Generate Formula MathML from Image to XML (Paddle)Automatically generates MathML from an image file using Paddle, saving it as an XML file.
Generate Formula MathML (Paddle)Automatically generates MathML for Formula tags using Paddle, attaching it as an associated file to each tag.

A Docker image that enhances the accessibility of PDF documents using OpenAI and PDFix SDK. Automatically generates alternate text for images and formulas, summarizes tables, and produces MathML associate files for formulas.

The service would require users to input their unique OpenAPI key into the action’s parameter, specifically on their account page.

https://hub.docker.com/r/pdfix/alt-text-openai

https://github.com/pdfix/action-pdf-accessibility-openai-docker

Generate Alternate Text (OpenAI)Automatically generates alternate text for Figure and Formula tags using OpenAI, embedding it into each tag’s Alt attribute.
Generate Alternate Text from Image to Text (OpenAI)Automatically generates alternate text from an image file using OpenAI, saving the description to a text file.
Generate Alternate Text from XML to Text (OpenAI)Automatically generates alternate text from an XML file using OpenAI, saving the description to a text file.
Generate Table Summary (OpenAI)Automatically generates table summaries for Table tags using OpenAI, inserting them into each tag’s Summary attribute.
Generate Table Summary from Image to Text (OpenAI)Automatically generates a table summary from an image file using OpenAI, saving the description to a text file.
Generate Formula MathML (OpenAI)Automatically generates MathML for Formula tags using OpenAI, attaching it as an associated file to each tag.
Generate Formula MathML from Image to XML (OpenAI)Automatically generates MathML from an image file using OpenAI, saving it as an XML file.

How to Get an OpenAI API Key

To begin, head to OpenAI’s official platform website. If you haven’t already, create an account following the simple steps on the website. After that, you can enter the email address and password linked to your OpenAI account to sign in or log in with an existing Google or Microsoft account.

A Docker image that adds an OCR text layer to scanned PDF files using PDFix SDK and Tesseract OCR.

https://hub.docker.com/r/pdfix/ocr-tesseract

https://github.com/pdfix/action-ocr-tesseract-docker

OCR (Tesseract)Automatically adds an OCR text layer to scanned PDF files using PDFix SDK and Tesseract OCR.

A Docker image that automatically detects the language of a PDF file. It uses a configuration file for customizable options and can be run with various command-line arguments.

https://hub.docker.com/r/pdfix/lang-detect

https://github.com/pdfix/action-detect-language-docker

Detect PDF Language (LangDetect)Automatically detects the language of a PDF and updates its metadata.
Detect PDF Language to Text (LangDetect)Automatically detects the language of a PDF and saves the detected language code to a TXT file.

A Docker image that converts HTML pages to PDF/UA files using PDFix SDK and Google Headless Chrome.

https://hub.docker.com/r/pdfix/html-to-pdf

https://github.com/pdfix/action-convert-html-to-pdf-docker

Convert HTML to PDF (Chrome)Converts HTML to PDF.
Convert URL to PDF (Chrome)Converts URL to PDF.

A Docker image that automatically generates and applies alternative image descriptions to PDF files.

https://hub.docker.com/r/pdfix/alt-text-vision

https://github.com/pdfix/action-generate-alternate-text-vision-docker

Generate Alternate Text (Vision)Automatically generates alternate text for Figure tags using Vision, embedding it into each figure’s Alt attribute.
Generate Alternate Text from Image to Text (Vision)Automatically generates alternate text from an image file using Vision, saving the description to a text file.

A Docker image that validates PDFs using veraPDF, the industry-supported open-source validator that validates all PDF/A and PDF/UA parts & conformance levels. Results can be saved as HTML, XML, or JSON reports. This PDF Accessibility Checker allows users to easily perform PDF validation with customizable validation profiles.

https://hub.docker.com/r/pdfix/verapdf-validation

https://github.com/pdfix/action-validate-pdf-verapdf-docker

Generate PDF Validation Report to HTML (veraPDF)Generate a PDF validation report in HTML format.
Generate PDF Validation Report to XML (veraPDF)Generate a PDF validation report in XML format.

A Docker image that automatically checks grammar in PDF using the Arlington PDF Model. The verification output is a report containing all of the grammatical errors contained in your PDF.

https://hub.docker.com/r/pdfix/arlington-pdf-model

https://github.com/pdfix/action-validate-pdf-arlington-docker

Validate PDF Grammar to HTML (Arlington)Automatically checks grammar in a PDF using the Arlington PDF Model.
Validate PDF Grammar to XML (Arlington)Automatically checks grammar in a PDF using the Arlington PDF Model.

A Java CLI tool for validating the compliance of PDF documents with accessibility standards such as WCAG, or PDF/UA.

https://github.com/pdfix/action-validate-pdf-pdfix-java

Generate Duplicate MCID Report to TXTValidates and reports duplicate MCID in the PDF content.

Disclaimer: PDFix provides tools and technology to assist in making your documents accessible, but we do not guarantee 100% accessibility. Achieving full compliance requires human checks and intervention, as no tool can ensure complete accessibility. Please note that PDFix is a technology provider, not a service provider. The responsibility for document compliance rests with the user.