OCR & Searchable PDF

OmniPage OCR

The OmniPage add-on for DotImage delivers industry-leading OCR (Optical Character Recognition) for fast, accurate document conversion. Its intuitive API streamlines development, enabling you to instantly transform paper and digital documents into secure, editable, and searchable files. Whether you are converting a handful of documents or processing millions of pages, OmniPage is ideal for individuals, small businesses, and enterprises alike.

Learn More about OmniPage

GlyphReader OCR Engine

The GlyphReader add-on for DotImage supports the European Character Set and is a highly accurate OCR engine that vectorizes glyphs to evaluate all possible character options. GlyphReader provides detailed reporting on individual character position, size, and confidence, and can automatically split merged characters or combine broken ones. It also allows you to disable recognition of specific characters, reject low-confidence results, and auto-rotate documents to the correct orientation.

DotImage Spec Sheet

Tesseract OCR Engine

The Tesseract OCR add-on for DotImage is an advanced, open-source OCR engine supporting multiple languages, including Dutch, English, French, German, Italian, Portuguese, and Spanish. Tesseract can determine the size and location of characters, words, and lines, and reports confidence levels for each recognized character. The Tesseract3 engine is fully integrated for seamless use.

Sample Applications

Creating Searchable PDFs

Atalasoft OCR engines can be used to create searchable PDFs. To do so, you will need the DotImage SDK, an OCR engine, and the Searchable PDF SDK (PDFTranslator), which converts images into searchable PDF files. For viewing, searching, and highlighting within PDFs, the PDF Reader with Text Extraction SDK is also required

Contact Atalasoft

Atalasoft Solutions

Atalasoft offers advanced HTML5 viewing technologies and image processing solutions for .NET professionals. Our offerings include PDF and OCR capabilities, barcode reading and writing, mobile capture, as well as TWAIN and web-based scanning. To learn more, please visit the DotImage Product page.

Visit Solutions Page

OCR & Document Recognition Add-ons

Atalasoft provides OCR SDKs that can be integrated into your desktop or web applications for manual or automated batch processing of images. Our industry proven document transformation engines are add-ons to the DotImage SDK and can save countless hours and significantly improve accuracy.

OmniPage OCR

GlyphReader OCR Engine

Tesseract OCR Engine

Creating Searchable PDFs

Atalasoft Solutions