DotImage OCR GlyphReader Engine
An add-on to DotImage Document Imaging
GlyphReader is a highly accurate OCR engine built for DotImage. It is available as an add-on module and uses the DotImage OCR interface. It has been tested with the ISRI OCR Performance Toolkit and found to be more accurate than other, more expensive industry leading OCR engines, with 99.5% accuracy.
Supports the European Character Set
Reports individual character position and size
Reports character confidence
Properly OCR's rotated pages, reporting the rotation angle
Can automatically break merged characters, or merge broken characters
Can optionally reject low confidence characters
Can optionally reject low confidence lines
Can disable recognition of specific characters
Full Page color OCR can be generated when combined with the Searchable PDF Module
The GlyphReader OCR engine gives many options for extracting the text from your images. Through only a few lines of code, you have access to the characters, alternate characters, and location of the text from a selected region of interest:
private void PerformOcr()
GlyphReaderEngine engine = new GlyphReaderEngine();
OcrPage page = engine.Recognize(myImage, myBounds);
OcrTextRegion region = page.Regions as OcrTextRegion;
For more information about using the GlyphReader engine, see these articles:
Correct Raw OCR Information with Alternate Characters
Prevent OCR on a Page while including it in the Final Document
"Thin client features such as navigation, zooming, and image zoning coupled with OCR have leveaged imaging into a more active part of our abstraction process.
When selecting technology to leveage in our workflow application, we looking at imaging apart from technology, selecting the product that was the best solution and most configurable.
We're a Java shop and committed to that technology, but chose Atalasoft DotImage even though it's a .NET product."
- Joe Aparo, Senior Developer, EBSCO