Catch PDF structural errors and syntax violations with the specification-derived Arlington PDF Model.
Stop guessing if your PDF follows ISO 32000-2 specifications – validate the PDF with machine-readable precision. The Arlington PDF Model automatically checks every PDF object against comprehensive syntax rules, identifying structural errors that could cause accessibility issues or violate PDF standards.
PDFix integrates the Arlington PDF Model directly through Docker, bringing validation technology to your desktop. Validate PDF grammar locally with complete privacy, zero costs, and detailed reports showing exactly where your PDF deviates from the specification.
Meet PDFix Desktop + Arlington PDF Model: Specification-Level Validation
- 100% Free & Open Source: Funded validation technology with zero licensing costs
- Specification-Derived: Based directly on ISO 32000-2 (PDF 2.0)
- Comprehensive Coverage: Validates entire PDF Document Object Model
- Grammar Checking: Verifies PDF syntax, object relationships, and data integrity
- Local Processing: Private, secure validation on your machine – no cloud required
- Developer-Friendly: Machine-readable output for automated workflows
- Vendor-Neutral: Independent verification not tied to any PDF software vendor

How It Works: Choose Your Path
For PDFix Desktop Users
Perfect for accessibility professionals, document engineers, and QA teams.
- Download PDFix Desktop
- 🐳 Install Docker Desktop First!
- Open PDFix Desktop and pull the docker via Action Manager (one-time set up) → Validate Arlington PDF Model
- Upload a PDF to PDFix Desktop → Run action
💡 Tip: HTML reports provide human-readable error descriptions, while XML format enables automated processing and integration with custom tools.
For PDFix SDK Users & Developers
Perfect for automated workflows, CI/CD pipelines, and research applications.
- Automated Pipeline Integration
- Integrate Arlington validation into your document processing systems or quality assurance workflows using PDFix SDK.
- Resources for SDK Integration:
- 📦 Docker Hub: https://hub.docker.com/r/pdfix/validate-pdf-arlington
- 💻 GitHub Repository: github.com/pdfix/action-validate-pdf-arlington-docker
- SDK Benefits
- Pre-validation before remediation workflows
- Automated structure checking in CI/CD
- Batch validation of document collections
- Machine-readable XML output for analysis
- Custom validation pipelines
Local Processing Benefits
- 100% Local Processing
- No Cloud Dependencies
- No internet required
- No costs
- Complete privacy
- Fast processing
- Open Source Transparency
The Technology Behind It
- Arlington PDF Model
- A machine-readable definition representing the complete PDF document object model derived directly from ISO 32000-2
- The model uses formal grammar rules and predicates that define valid PDF object structures
- PDFix Integration
- The Dockerized solution integrates Arlington’s validation engine with PDFix’s processing capabilities:
- Object Analysis
- Grammar Validation
- Detailed Reporting
- The Dockerized solution integrates Arlington’s validation engine with PDFix’s processing capabilities:
Resources
- Getting Started Guide: https://pdfix.net/user-guide-external-actions/
- GitHub Repository: https://github.com/pdfix/action-validate-pdf-arlington-docker
- Docker Hub: https://hub.docker.com/r/pdfix/validate-pdf-arlington
- Arlington PDF Model: https://github.com/pdf-association/arlington-pdf-model
Actions
| 🆓 🖥️ [Free][Local] | Arlington PDF Model in HTML | Automatically checks grammar in a PDF using the Arlington PDF Model [Local] |
| 🆓 🖥️ [Free][Local] | Arlington PDF Model in XML | Automatically checks grammar in a PDF using the Arlington PDF Model [Local] |









