GdPicture Tesseract Plugin
GdPicture-ORPALIS
Version: 1.1.6
Based on Google's open source Tesseract OCR, the GdPicture Tesseract Plugin adds OCR features to GdPicture Toolkits, such as text recognition on a specific area of an image and the ability to create searchable PDF/A files (PDF-OCR) from scanned documents, images or existing PDF documents.
GdPicture Tesseract Plugin supports many languages (see below) and can process more than 90 document formats.
Main Features
Unicode Support.
Character recognition confidence.
Retrieve character location.
Output text.
Support for PDF/A OCR generation (PDF Image + hidden searchable text).
Multiple languages: English, French, Italian, German, Spanish, Brazilian Portuguese, Vietnamese, Polish and Dutch.
Can recognize only digits, only alpha or only "white listed" characters.
Fast area processing.
Document orientation detection.
Easy to use.
Fast, accurate & bug free.
Royalty-free licensing: no distribution license required for server or desktop.