Release Notes

Release Notes

PDFix Release Notes
Release Notes

Version 4.1.0 (October 3rd 2018)

New Features:

  • PdsObject API – access to PDF low level objects
  • PdsStructTree API – access to PDF document structure tree (Tags)
  • PDF to XML conversion

Updates:

  • PDFix SDK Core
    • PdfDoc::GetInfo returns empty string if Info dict not present
    • PdfPage::GetMediaBox returns bounding box 0,0,612,792 if not present
    • PdeContentWriter
      • content stream optimization for TJ and Tj operators
      • type3 font support
      • inline image support
    • PdfFlattenAnnotsParams – added flat_annot member to select annotation type to flatten
    • PdfDocTemplate
      • added kTrDetectRotation (automatic detection of page rotation flag)
      • added kMinCharClipRatio (clipping area intersection threshold)
      • added kTrMaxLineSpacing (maximum line spacing in paragraph threshold)
      • removed kTemplateFlatAnnots (use flat_annot in PdfFlattenAnnotsParams instead)
      • removed kTrLineSpacingTextSplit (use kTrMaxLineSpacing instead)
      • removed kTrLineSpacing (use kTrMaxLineSpacing instead)
      • removed kTemplateAcceptTags
    • PdfPageMap
      • applied kTextFlagStrikeout flag on text when Cross-Out annotation is in place
      • text outside of clip area treated as artifact
      • split filling words into separate PdeTexts
    • AddTags
      • set generic Form Field tooltip when missing TU key
      • extra spaces in text handling
    • Metadata – support of  in dc:title update
    • ErrorCode – renamed kErrorPdfCosObjInvalid to kErrorPdsObjectInvalid
    • Xref stream support when saving files
  • PDF to HTML
    • text decoration under text markup annotations (highlight, underline, cross-out, squiggly)
    • css & html validation fixes
    • removed css white-space: -pre-wrap, -o-pre-wrap, -moz-pre-wrap as unsupported properties
    • fixed .pdf-page h1 h2 h3 h4 to .pdf-page h1, ,pdf-page h2, … to apply style on .pdf-page only, same with .pdf-page.responsive table h1 .. h4
    • fixed <input> maxLength property – missing quotes
    • changed <input> defaultValue property to data-default-value
    • PdfToHtmlDoc::SaveDocHtml, SavePageHtml memory allocation fix
    • unwanted “)” in html output
    • textarea CSS – scrollbar was missing in case of long text
  • Command Line Interface
    • PDF to HTML – added options: -page_width, -image_format, -image_quality
Download PDFix 4.1.0:

Version 4.0.5 (August 20th 2018)

Updates:

  • Bug fixes in PDF Auto Tag conversion:
    • Text tagging fix
    • Text content writer fix
Download PDFix 4.0.5:

Version 3.2.5 (August 1st 2018)

Updates:

  • Bug fixes in PDF to HTML conversion
    • removed css properties white-space: -pre-wrap, -o-pre-wrap, -moz-pre-wrap 
    • fixed css .pdf-page h1 h2 h3 h4 to .pdf-page h1, ,pdf-page h2, … not to apply style outside .pdf-page (same with .pdf-page.responsive table h1 .. h4)
    • fixed missing quotes in maxLength property in input element
    • changed defaultValue property to data-default-value in input element
Download PDFix 3.2.5:

Version 4.0.1 (June 20th 2018)

Updates:

  • Updated c# wrapper
  • Stability improvements
  • New sample on Github for c# 
  • OcrTesseract::SetLanguage parameters update
  • Font subsetting update

Bug fixes:

  • Table recognition fix

Version 3.2.4 (May 18th 2018)

Updates:

  • PdfFormField::GetFontName returns user friendly font name
  • Bug fixes in PDF to HTML conversion
    • form field font style support (font family, size)
    • form field NoScroll flag support for textarea
    • form field MaxLen support

Version 3.2.3 (May 8th 2018)

Updates:

  • Thread-safe support for jni interface

Version 4.0.0 (May 7th 2018)

New Features:

  • Make PDF Accessible – allows you to convert PDF to PDF/UA
  • OCR module – allows you to convert scanned PDF into searchable PDF
  • EmbedFonts – allows you to embed/subset fonts in PDF document

Updates:

  • Github repository samples
  • Online documentation
  • Trial key authorization update
  • PdfixPlugin::GetID was removed
  • PsImage::Save renamed to SaveToStream, new PsImage::Save()
  • PsMetadata object
  • PdfDoc::SetLang, PdfDoc::GetLang
  • Read/write document metadata (title, author, creator, description) and sync. with Info dict
  • PdfDoc::GetMetadata,
  • PdsStructElement renamed to PdfStructElement

Bug fixes:

  • PdfToHtml – Link annotations with URI action in a fixed view
  • File size issue after saving file
  • Incremental save fix
  • Digital signature fix
  • Content writer – double, float value format – no exponent
  • PdePageMap android crash fix

Version 3.2.2 (February 22th 2018)

Updates:

  • Command line application now supports relative path names for input and output files
  • Ability to reset previously modified Document templates
  • Content detection enhancements
    • bullet detection
    • rotated text handling
  • Bug fixes in PDF to HTML conversion
    • table column alignment
    • duplicate form field elements in fixed layout
    • incorrect url data format in embedded image streams
Download PDFix 3.2.2:

Version 3.2.0 (February 8th 2018)

New features

  • New PDF to HTML conversions
    • Support for JPEG image format and image quality
    • Simplified CSS in the output

Updates

  • Improved content extraction algorithms
    • Tables, charts, bullets, lists, TOC, headers/footers recognition update
    • Reading order detection improvements
  • Customizable document templates
    • Ability to configure content extraction and Autotag process for speficid document type or document set (please contact support for more information)
  • General bug fixes

Version 3.0.0 (October 3th 2017)

New features

  • Autotag – Add tags to PDF automatically
    • Make PDF accessible
    • PDF/UA compliant

Updates

  • New PDF to HTML conversions
    • PDF conversion to embeddable <div> html element
    • Embedded resources
  • Streams support
  • Improved content extraction algorithms
    • Table detection
    • Paragraph recognition
    • List detection
    • Table of contents detection
  • Customizable document templates

Version 2.0.0 (February 16th 2017)

  • PDF to HTML conversion SDK
  • Fixed or Responsive layout output
  • Multiple HTML files output or one continuous HTML page
  • TrueType Font Extraction
  • Text size output control
  • AcroForm support

Version 2.0.0 (February 16th 2017)

  • More accurate elements detection
  • Text Tables detection improvement
  • Colspan and Rowspan bug fixies
  • Text Paragraphs detection improvement
  • Bullets recognition
  • Repeated pattern detection (patterns of characters, such as a series of dots or dashes, between a tab and the following text)
  • Background image separation
  • Drop Cap text detection
  • Form Field flattening

Version 1.0.2 (June 24th 2016)

  • Text tables detection
  • Lists and Table of contents detection
  • Image intersection improvement
  • Words detection improvement
  • Elements clipping improvement
  • Reading order detection improvement
  • Header/Footer detection

 

Version 1.0.1 (May 20th 2016)

  • Discrete elements detection
  • Text tables detection (output as an image)
  • Table cell background color detection
  • Words detection improvement
  • Headers/Footers detection
  • XForm object handling

 

Version 1.0.0 (May 1st 2016)

  • Initial release

 

Are you interested?
If you want to get answers and information about our products and services, or to discuss your subscription, get in touch with us.
DOWNLOAD THE SDK >>
CONTACT US >>
REQUEST QUOTE >>