OCR Searchable PDF SDK for .NET
Add-on to DotImage for creating text-only or text-underneath image PDFs
A great addition to any capture solution to create searchable PDFs with your OCR workflow. Create text-only or text-underneath PDF documents. Learn how this product can fit in to your solution in the article "How to Make PDF Your Imaging Format"
The DotImage OCR Searchable PDF module has runtime royalty-free distribution for desktop applications and requires DotImage Document Imaging edition and the OCR with Tesseract or Glyphreader engine add-ons.
The DotImage OCR Searchable PDF Module can be used with DotImage OCR module and any associated engines to generate Adobe PDF documents containing recognized characters from the OCR recognition process.
- Create text only or text underneath image PDF's
- Supports JBIG2 image compression with a licensed JBIG2 codec
- Generates high quality thumbnails from the original images
- Will generate PDF v1.4 or v1.5 when using JBIG2 or JPEG2000 compression
- Can be used with any OCR engine using the DotImage OCR Module.
- Create segmented documents (ie, separate text and image blocks)
- Can automatically rotate pages that are reported to be rotated by the OCR engine.
- Embedded TrueType font support
- Supports ICC Color profiles
- Document metadata support, including Title, Subject, Author, Creation Date and Keywords
- Full support for color text
- Compressed page content streams
The PdfTranslator can be used with any of the DotImage OCR engines to create searchable PDFs:
GlyphReaderEngine engine = new GlyphReaderEngine();
engine.Initialize();
engine.Translate(myImageSource, "application/pdf", @"C:\output.txt");
engine.ShutDown();
engine.Dispose();
These articles can help you further customize your application to give you exactly the results you are looking for in a searchable Pdf:
Perform an OCR "Translate" on a single AtalaImage
Prevent OCR on a Page while including it in the Final Document
Generate a searchable PDF from a multipage TIFF with GlyphReader OCR
Customer Quote
"The Graduate School Office feels very strongly that this system has gone beyond being merely useful and has entered the realm of being necessary. It has allowed us a safe and long-lasting method for archiving student and other records which may need to be accessed in a timely manner."
- Sandra Powers, Dean of Undergraduate Studies, The College of Charleston