Generate Formula MathML (Paddle)

Transform inaccessible mathematical formulas into screen reader-compatible MathML – 100% Free, 100% Local

PDFix now integrates with Paddle’s open-source technology to automatically convert mathematical formulas into accessible MathML (Mathematical Markup Language) format. This free, locally-deployed solution delivers PDF/UA compliance ensuring that mathematical expressions are not just visible images but structured, navigable content for assistive technologies.

Meet PDFix Desktop + Paddle

  • 100% Free: Completely free formula recognition powered by PaddleOCR
  • Private & Secure: All processing happens locally, no data sent to external servers
  • Advanced Recognition: Recognizes printed formulas, complex equations
  • Automated MathML Generation: Converts formula images into LaTeX and MathML markup
  • PDF/UA Compliance: Embeds MathML as associated files to Formula tags
  • Batch Processing: Process entire folders of technical documents with identical workflows
a computer screen labeled Paddle paddle logo with pdfix

Why MathML Matters for Accessibility

MathML was specifically created by the World Wide Web Consortium (W3C) as a universally designed format for mathematical expressions, and it is supported by many assistive technology applications such as screen readers. MathML allows equations to be stored as structured text that can be reformatted, searched, and most importantly, enables blind users to navigate through mathematical expressions interactively.

How It Works: Choose Your Path

For PDFix Desktop Users

Perfect for accessibility professionals, document specialists, and teams who prefer visual workflows.

For PDFix SDK Users

Perfect for software developers, enterprises with custom workflows, and organizations needing programmatic integration.

How to Get Started with Paddle

🐳 Don’t Forget to Install Docker Desktop First!

No API keys, no cloud accounts – just install Docker and you’re ready to process formulas locally:

  1. Install Docker Desktop
    • Download from docker.com
    • Follow the installation guide for your operating system
    • Start Docker Desktop
  2. Open PDFix Desktop: Navigate to Action Manager
  3. Pull the Paddle Container: Search for the Formula MathML (Paddle) action and click to pull the container (one-time setup)
  4. Start Processing
    • Upload your PDF with mathematical formulas
    • Select External Actions → Generate Formula MathML (Paddle)

🖥️ Local & Free → PaddleOCR Engine

  • No internet required after initial model download
  • Completely free
  • Complete privacy
  • Unlimited processing
  • Fast inference – optimized for local hardware
  • Open source

The Technology Behind It

  • PaddleOCR Formula Recognition
    • PaddleOCR’s advanced models analyze formula images and generate LaTeX representations, which are then converted into MathML markup. PaddleOCR supports over 109 languages and excels at recognizing complex elements.
  • PDFix SDK Integration
    • The Dockerized solution seamlessly connects Paddle’s open-source OCR engine with PDFix’s remediation capabilities, creating:
      • Formula Recognition
      • MathML Conversion
      • Associate File Embedding
      • Batch Automation

Real-World Applications

  • Academic Publishing
  • Educational Materials
  • Technical Documentation
  • Corporate
  • Government & Healthcare

Resources

Actions

🆓 🖥️ [Free][Local]Set Formula MathML (Paddle)Automatically generates MathML from an image file using Paddle, saving it as an XML file [Local]
🆓 🖥️ [Free][Local]Set Formula MathML (Paddle)Automatically generates MathML for all Formula tags using Paddle, attaching it as an associated file to each tag [Local]