DotImage OCR GlyphReader Engine
An add-on to DotImage Document Imaging
GlyphReader is a highly accurate OCR engine built for DotImage. It is available as an add-on module and uses the DotImage OCR interface. It has been tested with the ISRI OCR Performance Toolkit and found to be more accurate than other, more expensive industry leading OCR engines, with 99.5% accuracy.
Features
-
Supports the European Character Set
-
Reports individual character position and size
-
Reports character confidence
-
Properly OCR's rotated pages, reporting the rotation angle
-
Can automatically break merged characters, or merge broken characters
-
Can optionally reject low confidence characters
-
Can optionally reject low confidence lines
-
Can disable recognition of specific characters
-
Full Page color OCR can be generated when combined with the Searchable PDF Module
The GlyphReader OCR engine gives many options for extracting the text from your images. Through only a few lines of code, you have access to the characters, alternate characters, and location of the text from a selected region of interest:
private void PerformOcr()
{
GlyphReaderEngine engine = new GlyphReaderEngine();
engine.Initialize();
OcrPage page = engine.Recognize(myImage, myBounds);
OcrTextRegion region = page.Regions[0] as OcrTextRegion;
engine.ShutDown();
engine.Dispose();
}
For more information about using the GlyphReader engine, see these articles:
Correct Raw OCR Information with Alternate Characters
Prevent OCR on a Page while including it in the Final Document
Customer Quote
"Thin client features such as navigation, zooming, and image zoning coupled with OCR have leveaged imaging into a more active part of our abstraction process.
When selecting technology to leveage in our workflow application, we looking at imaging apart from technology, selecting the product that was the best solution and most configurable.
We're a Java shop and committed to that technology, but chose Atalasoft DotImage even though it's a .NET product."
- Joe Aparo, Senior Developer, EBSCO