
ABBYY is introducing a brand new optical character recognition (OCR) API to allow builders to extract information from unstructured paperwork.
“As a vanguard of OCR, ABBYY has lengthy had a vibrant neighborhood of cutting-edge builders creating transformational options with our superior doc AI,” stated Nick Hyatt, vp of Engineering R&D at ABBYY. “ABBYY Doc AI API is a serious step ahead for creating automated doc workflows.”
The ABBYY Doc AI API—at the moment in technical preview—will permit builders to remodel unstructured information into structured JSON in just some strains of code. It consists of SDKs for Python, C#, JavaScript, and Java.
Some examples of paperwork that information may be transformed from embody invoices, receipts, and tax types.
Throughout this technical preview, the OCR fashions are solely accessible as pre-trained fashions, with no choices for customized coaching or fine-tuning but. The API will likely be free to make use of through the preview, however there’s a processing quantity restrict of 1000 pages.
It at the moment helps OCR in English, German, French, Spanish, Dutch, Japanese, and each conventional and simplified Chinese language. For handwriting recognition, or ICR, it helps English, German, French, Spanish, and Japanese.
Builders can be part of the technical preview right here.