Category: Blog

Scrape the PDF Data easily with the PDFix SDK

Scrape the PDF Data easily with the PDFix SDK


Have you ever tried to get any data from various PDF files? Then you know how panful it is. We have created an algorithm that allows you to extract data in an easily readable structured way. With PDFix we can recognize all logical structures and we can give you…

READ MORE >>
PDFix SDK Version 4.1.0

PDFix SDK Version 4.1.0


Version 4.1.0 (October 2nd 2018) – New Features: PdsObject API – access to PDF low level objects, PdsStructTree API – access to PDF document structure tree (Tags), Updates: PdfDoc::GetInfo returns empty string if Info dict not present, PdfPage::GetMediaBox returns bounding box 0,0,612,792 if not present, …

READ MORE >>
Attending ODSC Europe 2018 in London

Attending ODSC Europe 2018 in London


“Open Data Science Conference (OSDC) has become a must-go event in Data Science and Machine Learning, as it continuously strived to deliver a smooth event, while improving the content of its programs, and quality of its speakers to deliver value to a diverse set of attendees.”

READ MORE >>
PDFix SDK Version 4.0.1

PDFix SDK Version 4.0.5


Version 4.0.5 (August 20th 2018). Updates: Bug fixes in PDF Auto Tag conversion: Text tagging fix, Text content writer fix. Download PDFix 4.0.5: Windows, Mac, Linux, Android, iOS

READ MORE >>
PDFix SDK Version 3.2.2

PDFix SDK Version 3.2.5


Version 3.2.5 (August 1st 2018). Updates: Bug fixes in PDF to HTML conversion, removed css properties white-space: -pre-wrap, -o-pre-wrap, -moz-pre-wrap, fixed css .pdf-page h1 h2 h3 h4 to .pdf-page h1, ,pdf-page h2, … not to apply style outside .pdf-page (same with .pdf-page.responsive table h1 .. h4), fixed missing quotes in maxLength property in input element

READ MORE >>
PDF Form filling directly in a Web Browser?

PDF Form filling directly in a Web Browser?


PDF document is the final, fixed output format. And so, the forms in PDF are perfectly arranged and uniform after each form completion. But is it always pleasant filling out the AcroFom in standard PDF Readers?

READ MORE >>
PDFix SDK Version 4.0.1

PDFix SDK Version 4.0.1


Version 4.0.1 (June 20th 2018), Updates: Updated c# wrapper, Stability improvements, New sample on Github for c# , OcrTesseract::SetLanguage parameters update, Font subsetting update. Bug fixes: Table recognition fix.

READ MORE >>
Responsive PDF? Here´s how to!

Responsive PDF? Here´s how to!


Many developers and users are asking the question: “How to make a PDF document responsive?” Responsive layout and text reflow have been a matter of course for a couple of last years. It´s very pleasant to have adapted view of the content on each device. And what about the case of PDF document?

READ MORE >>
GDPR and personal data in PDF. How to access and process them?

GDPR and personal data in PDF. How to access and process them?


The GDPR – European General Data Protection Regulation is applicable as of May 25th, 2018. Does this privacy and protection include the coverage of PDF files that also contain the personal data?

READ MORE >>