pdf2Data

pdf2Data
About pdf2Data pdf2Data is an add-on for iText7 to recognize data inside PDF documents in an intuitive and predictable manner. It provides a mechanism to extract predefined data fields from the PDF documents based on the same template (for example, an invoice coming from the same supplier). A new release We are proud to inform you about the latest release: pdf2Data 1.1.2. Notable improvements are: Improvements for multipage table recognition. Table header extraction tool near headers text area. Bug fixes for saving templates in editor. All the binaries are uploaded to...
pdf2Data
About pdf2Data pdf2Data is an add-on for iText7 to recognize data inside PDF documents in an intuitive and predictable manner. It provides a mechanism to extract predefined data fields from the PDF documents based on the same template (for example, an invoice coming from the same supplier). A new release We are proud to inform you about the latest release: pdf2Data 1.1.0. Notable improvements are: Enhanced support for multipage tables, which now correctly skips page footers and headers and recognizes repeating table header and footer rows. New selector for images that allows extracting images...
pdf2Data
Pdf2Data is a tool that allows for structured data to be extracted from similarly structured text documents. The way that this is done is through the use of rules to define the location of text that should be extracted or the format of the text. Through the rest of this blog post I’ll describe the basics of how it works and then show how it works with a simple example. To begin, let's look at how the rules are defined to choose the text you would like extracted. You can define the areas with either, the pdf2Data web app, or with Adobe reader's comment feature. Once you have selected the area...