Login
 
Atalasoft DotImage

OCR Searchable PDF SDK for .NET

Add-on to DotImage for creating text-only or text-underneath image PDFs

A great addition to any capture solution to create searchable PDFs with your OCR workflow. Create text-only or text-underneath PDF documents. Learn how this product can fit in to your solution in the article "How to Make PDF Your Imaging Format"

The DotImage OCR Searchable PDF module has runtime royalty-free distribution for desktop applications and requires DotImage Document Imaging edition and the OCR with Tesseract or Glyphreader engine add-ons.

The DotImage OCR Searchable PDF Module can be used with DotImage OCR module and any associated engines to generate Adobe PDF documents containing recognized characters from the OCR recognition process.

Feature List

  • Create text only or text underneath image PDF's
  • Supports JBIG2 image compression with a licensed JBIG2 codec
  • Generates high quality thumbnails from the original images
  • Will generate PDF v1.4 or v1.5 when using JBIG2 or JPEG2000 compression
  • Can be used with any OCR engine using the DotImage OCR Module.
  • Create segmented documents (ie, separate text and image blocks)
  • Can automatically rotate pages that are reported to be rotated by the OCR engine.
  • Embedded TrueType font support
  • Supports ICC Color profiles
  • Document metadata support, including Title, Subject, Author, Creation Date and Keywords
  • Full support for color text
  • Compressed page content streams
 

How-to


The PdfTranslator can be used with any of the DotImage OCR engines to create searchable PDFs:

GlyphReaderEngine engine = new GlyphReaderEngine();
engine.Initialize();
engine.Translate(myImageSource,
"application/pdf", @"C:\output.txt");
engine.ShutDown();
engine.Dispose();

These articles can help you further customize your application to give you exactly the results you are looking for in a searchable Pdf:
Perform an OCR "Translate" on a single AtalaImage
Prevent OCR on a Page while including it in the Final Document
Generate a searchable PDF from a multipage TIFF with GlyphReader OCR
 


Customer Quote

"The Graduate School Office feels very strongly that this system has gone beyond being merely useful and has entered the realm of being necessary. It has allowed us a safe and long-lasting method for archiving student and other records which may need to be accessed in a timely manner."

- Sandra Powers, Dean of Undergraduate Studies, The College of Charleston
 

Download 30-day Trial
top corner top corner top corner
 

Training Camp

We know that once you start an evaluation of DotImage, you'll want to see if it can meet your requirements quickly. That's why we provide standard demo applications in C# and VB.NET, tutorial videos, whitepapers, and Visual Studio integrated documentation to help get you going quickly.

However, you might want some training straight from our support team. That's why we've created the DotImage Training Camp. For information on attending one of these paid classroom sessions, contact training@atalasoft.com.

 

What's new in Atalasoft's 10.3

  • The Web Document Viewer component is now supported on iOS Mobile Safari and Android's Chrome.
  • The Web Document Viewer includes a JavaScript API for creating annotations, altering context menus, changing the properties of an annotation, and changing the current page.
  • WingScan can now read in the index fields associated with a repository and content type from SharePoint, let you edit them and include their values on the import.
  • WingScan can be configured to import documents into SharePoint.
  • JoltImage now includes a barcode reader with the same functionality as the .NET edition
  • The Assemblies have been simplified and some have been combined to make deployment simpler.

Get a 30-day trial today

bottom corner   bottom corner   bottom corner
preload preload preload