GlyphReader OCR Engine.
GlyphReader is highly accurate and very cost effective. Because you can rely on the quality of the output, you can process more jobs with less time spent correcting mistakes. This accuracy has been developed through years of comprehensive testing, analysis and improvement.
GlyphReader is a closed source OCR engine that vectorizes glyphs then determines all possible letters it could be. Supports the European Character Set, reports individual character position and size, reports character confidence, properly OCR's rotated pages reporting the rotation angle, has Auto-Rotate functionality rotating documents to the correct orientation, can automatically break merged characters or merge broken characters, can disable recognition of specific characters, can optionally reject low confidence characters, can optionally reject low confidence lines, Full Page color OCR can be generated when combined with the Searchable PDF Module (Included within DotImage).
OmniPage OCR Engine.
OmniPage offers industry-leading optical character recognition (OCR) for fast, easy accurate document conversion. Instantly turn paper and digital documents into files you can edit, search and share securely.
- OmniPage Server is the best choice to efficiently manage high-volume document conversion
- Converts millions of documents at a time, with complete scalability
- Built on the industry's most accurate OCR technology
- Comes with an easy-to-use API to shorten development cycles
- OmniPage Server accurately digitizes, converts and compresses files in your document archives at scale.
ABBYY OCR Engine.
ABBYY is a global leader in the development of document recognition, content capture and language-based technologies and solutions that integrate across the entire information lifecycle.
ABBYY offers a fast, closed-source engine for OCR and ICR. Supports 201 languages with a high accuracy rate, reports individual character position and size, reports character confidence, can optionally reject low confidence characters, can optionally reject low confidence lines, support E13B and CMC 7 MICR fonts, multiple recognition culture support, parallel processing to improve performance. (Single document), autorotate.
Tesseract OCR Engine.
Tesseract OCR is included within the DotImage SDK as is PDFTranslator, our Searchable PDF SDK. This module automatically translates an image into a searchable PDF file.
Tesseract is an intelligent learning open-source OCR engine with many extended language options including integrated support for the languages Dutch, English, French, German, Italian, Portuguese, and Spanish. Ability to determine character, word, and line size and location, Reports confidence of each recognized character, output to Text or Searchable PDF, Tesseract3 Engine, supports multiple cultures.
Creating Searchable PDFs
Atalasoft offers several OCR Engines that can be used to OCR documents or as part of a process to create Searchable PDFs. If you would like to create searchable PDF's using Atalasoft SDK's you would need our DotImage SDK and an OCR SDK. We include Tesseract OCR within our DotImage SDK. If you also need to view and search / highlight after they are created (or if you already have existing searchable PDFs) we have you covered as this technology is included within DotImage. PDFTranslator (Searchable PDF SDK) is also included within DotImage. This module automatically translates an image into a searchable PDF file.
We are here to help.
DotImage is backed by a remarkable support team of expert .NET and imaging developers. If you have any questions please take a look at our technical section or simple send us an e-mail and we will find the answer for you. Rather test drive DotImage, feel free – we offer a 30 day Evaluation with Full Support.