Auto-Tag PDFs Using IBM Docling AI

The problem this solves

An untagged PDF has no machine-readable structure. Screen readers cannot determine reading order, identify headings, navigate tables, or distinguish figures from body text. This makes the document inaccessible to users with visual impairments and causes it to fail PDF/UA (ISO 14289), WCAG 2.1, and Section 508 audits.

Manually tagging a complex document can take hours – and for organizations remediating large document libraries, manual tagging is not a viable path.

a graphic of computer screen labeled with PDFix and Docling Logo

What this action does

The Auto-Tag PDF action integrates IBM Research’s open-source Docling AI toolkit directly into PDFix Desktop. Docling analyzes the full layout of each PDF page – detecting paragraphs, headings, tables, figures, lists, and captions – and generates a complete, structured tag tree.

It runs entirely on your local machine via Docker, with no data sent to any external service. The model was trained on 81,000 manually labeled pages from diverse document types, delivering enterprise-grade layout detection accuracy at zero cost.

It ships as two variants inside PDFix Desktop:

  • AutoTag (Docling): Processes the PDF and applies a full accessibility tag structure automatically. Use this for straightforward documents or high-volume batch workflows where immediate output is the priority.
  • Create Layout Template (Docling): Generates a layout template JSON file from the document structure detected by Docling. Use this when you need to review, adjust, or reuse the detected layout before committing tags to the PDF – ideal for complex or non-standard document formats.

Actions

Free · LocalAutoTag (Docling)Automatically tags PDF using Docling
Free · LocalCreate Layout Template (Docling)Automatically creates layout template json using Docling, saving it as JSON file

Frequently Asked Questions

What is IBM Docling?

Docling is an open-source AI toolkit developed by IBM Research for document understanding and analysis. It uses the RT-DETR object detection architecture to analyze PDF page layouts, identifying structural elements such as headings, paragraphs, tables, figures, and lists. The PDFix integration uses the Docling model to drive automatic PDF accessibility tagging.

What does auto-tagging mean for PDF accessibility?

PDF tagging is the process of adding a logical structure tree to a PDF document that describes the type and reading order of each content element. Tags allow screen readers, braille displays, and other assistive technologies to correctly interpret document content. Without tags, a PDF is effectively inaccessible. PDF/UA (ISO 14289) requires that all PDFs intended for public use be fully tagged.

Which accessibility standards does this action support?

The AutoTag Docling action produces tag structures that support PDF/UA-1 (ISO 14289-1), WCAG 2.1 Success Criteria 1.3.1 (Info and Relationships) and 1.3.2 (Meaningful Sequence), and Section 508. After tagging, use the VeraPDF Validation action to confirm compliance.

Does this action require an internet connection?

No. After the initial Docker image download, the action runs entirely offline on your local machine. No document data is transmitted to IBM, PDFix, or any third party.

What is the difference between Auto-Tag (Docling) and Create Layout Template (Docling)?

AutoTag (Docling) processes the PDF and writes a complete accessibility tag structure directly to the file in a single step. Use this for one-off documents or when immediate output is the priority.

Create Layout Template (Docling) takes a different approach: instead of tagging the PDF directly, it analyzes the document layout and generates a reusable JSON template file describing the page structure – heading regions, table boundaries, body text areas, footer zones. You can review and adjust this template in PDFix Desktop before applying it. The key advantage is reusability: once a template exists for a given document format, the same template can be applied to every future version of that document automatically, with no repeated work.

This makes Create Layout Template the right choice for recurring documents such as monthly reports, standardized forms, or any document series where the layout is consistent across versions. Learn more about PDFix Layout Templates →

How does this compare to the AutoTag PDF (Amazon Textract) action?

Both actions auto-tag PDFs, but they differ in cost, privacy, and use case. Docling is 100% free, open-source, and runs entirely locally – making it suitable for sensitive or regulated documents with no per-document cost. Amazon Textract is a cloud-based service that requires an AWS account, sends documents to Amazon’s servers for processing, and is billed per page. Textract can offer advantages for scanned or low-quality documents where cloud OCR accuracy is a priority.