Upload the PDF of an invoice. Check if the PDF contains a clear text and an XML file compliant with a normalized Factur-X schema. Validate that the PDF is a PDF/A-3b. If the PDF only contains images, typically photocopies, recover the text of the invoice by OCR. Check the mandatory information on an invoice and its content (dates, lines, totals). Verify that the content of the XML is identical to the content of the invoice. Analyze the conformity of an invoice with regulatory texts. Generate the XML file factur-x.xml of an invoice. Convert the PDF and the XML into a PDF/A-3b.
Factur-X is a file format suitable for exchanging invoices for all types of organizations. It consists of an image file (PDF) and a structured data file (XML). Factur-X complies with the European Semantic Standard EN 16931, published by the European Commission on October 16, 2017.
The PDF/A is an ISO-standardized version of the PDF format specialized for use in the archiving and preservation of electronic documents.
OpenAI is a leading research organization focused on advancing safe and beneficial artificial general intelligence.
Tesseract is an open-source optical character recognition engine supported by Google.
The veraPDF consortium, led by the Open Preservation Foundation and the PDF Association, was created in response to the EU Commission's PREFORMA challenge to develop an open-source validator for the PDF/A format.
Ghostscript is a suite of software for processing Postscript and PDF files.
Poppler provides a set of commands for extracting the pages, the text and the images of PDF files.
All functionalities are available in the interface of your personal space or by program through a simple REST API. See the User's Guide.
All communications are encrypted.
The files you upload or download are inaccessible to others.








