Atalasoft's Document Transformation Engines
Today's digital document libraries need to be searchable and workers need to be able to index and pull data from within these documents. This can be a very slow and expensive process compared to an automated computer application. Our toolkit allows OCR engines to be implemented by extending our base OcrEngine class. The Recognize() method is used to start the process. Additionally, we have partnerships with the following OCR Engines:
- GlyphReader OCR Engine
- Abbyy OCR/ICR Engine
- Tesseract OCR Engine
Optical Character Recognition (OCR) is a method by which software "reads" the text characters to preform text recognition from an otherwise flat, scanned image. The resulting text can be placed anywhere programmatically and is necessary in larger document workflows and for discoverability.
Intelligent Character Recognition (ICR) follows the same software concept but is tuned to recognize hand printed rather than computer printed text. To do ICR you need to clearly define the areas that need to be recognized, text should be in block caps only with framing.
OCR is an add-on to our DotImage SDK