Atalasoft DotImage

OCR Searchable PDF SDK for .NET

Add-on to DotImage for creating text-only or text-underneath image PDFs

A great addition to any capture solution to create searchable PDFs with your OCR workflow. Create text-only or text-underneath PDF documents. Learn how this product can fit in to your solution in the article "How to Make PDF Your Imaging Format"

The DotImage OCR Searchable PDF module has runtime royalty-free distribution for desktop applications and requires DotImage Document Imaging edition and the OCR with Tesseract or Glyphreader engine add-ons.

The DotImage OCR Searchable PDF Module can be used with DotImage OCR module and any associated engines to generate Adobe PDF documents containing recognized characters from the OCR recognition process.

Feature List

  • Create text only or text underneath image PDF's
  • Supports JBIG2 image compression with a licensed JBIG2 codec
  • Generates high quality thumbnails from the original images
  • Will generate PDF v1.4 or v1.5 when using JBIG2 or JPEG2000 compression
  • Can be used with any OCR engine using the DotImage OCR Module.
  • Create segmented documents (ie, separate text and image blocks)
  • Can automatically rotate pages that are reported to be rotated by the OCR engine.
  • Embedded TrueType font support
  • Supports ICC Color profiles
  • Document metadata support, including Title, Subject, Author, Creation Date and Keywords
  • Full support for color text
  • Compressed page content streams
 

How-to


The PdfTranslator can be used with any of the DotImage OCR engines to create searchable PDFs:

GlyphReaderEngine engine = new GlyphReaderEngine();
engine.Initialize();
engine.Translate(myImageSource,
"application/pdf", @"C:\output.txt");
engine.ShutDown();
engine.Dispose();

These articles can help you further customize your application to give you exactly the results you are looking for in a searchable Pdf:
Perform an OCR "Translate" on a single AtalaImage
Prevent OCR on a Page while including it in the Final Document
Generate a searchable PDF from a multipage TIFF with GlyphReader OCR
 


Customer Quote

"The Graduate School Office feels very strongly that this system has gone beyond being merely useful and has entered the realm of being necessary. It has allowed us a safe and long-lasting method for archiving student and other records which may need to be accessed in a timely manner."

- Sandra Powers, Dean of Undergraduate Studies, The College of Charleston
 

Download 30-day Trial
top corner top corner top corner
 

Training Camp

We know that once you start an evaluation of DotImage, you'll want to see if it can meet your requirements quickly. That's why we provide standard demo applications in C# and VB.NET, tutorial videos, whitepapers, and Visual Studio integrated documentation to help get you going quickly.

However, you might want some training straight from our support team. That's why we've created the DotImage Training Camp. For information on attending one of these paid classroom sessions, contact training@atalasoft.com.

 

What's new in DotImage 10

  • New HTML document viewer - zero-footprint with flick scrolling
  • Silverlight Imaging SDK - managed code goodness
  • Improved web scanning - addition of ActiveX TWAIN and is included with Document Imaging edition
  • Barcode reader SDK improved - more symbologies
  • the base DotImage Photo SDK edition is still free

Get a 30-day trial today

bottom corner   bottom corner   bottom corner
preload preload preload
Powered by Olark