Search

Atalasoft Knowledge Base

INFO: OcrEngine - Overview

Administrator
DotImage

OCR Engine

developer can recognize an image and output that image to a file, or enumerate its lines, words, and characters with confidence.

Data sources for the engine can be scanned images or files. The engine output consists of either a file or a class hierarchy. This model is illustrated below.

As OcrEngine object is abstract, you cannot create an instance of this object. Nevertheless, the object definition contains most of the necessary functionality needed for a concrete subclass to function with a minimum of extra code.

The OcrEngine object has five primary components as illustrated below:

  • Preprocessing options
  • Document translators
  • Page element factory
  • Font mapping
  • Font building

 

See Also

GlyphReader Engine
Tesseract Engine (Retired in 11.1)
Tesseract3Engine
RecoStar Engine (retired)

Original Article:
Q10364 - INFO: OcrEngine - Overview

Details
Last Modified: Last Year
Last Modified By: Administrator
Type: INFO
Article not rated yet.
Article has been viewed 138 times.
Options
Also In This Category