Atalasoft
Welcome to Atalasoft Community Sign in | Join | Help
in

Atalasoft's 31 Apps in 31 Days

ClickWord: Online OCR by Click

Welcome to the 22nd of 31 applications we will post (in addition to the contest participants' submissions). Make sure you add this blog to your RSS Atom feed and check the gallery for summaries of all the apps as they are released.

 

 

 

 

 

 

 

 

 

 

This application is a great example of how you might be able to quickly leverage the words in a document in other applications. By simply clicking on words, ClickWord determines the text underneath your mouse and gets it ready for you to copy it with a CTRL-C key stroke. Very simple.

Check out ClickWord

Download the Source

Published Thursday, May 22, 2008 11:16 AM by jacobl

Comments

 

DotNetKicks.com said:

You've been kicked (a good thing) - Trackback from DotNetKicks.com

May 23, 2008 12:44 PM
 

saksu said:

This is a greate OCR Web application.

But,when i deploy this application to c:\inetpub\wwwroot,this doesn't work.

The problem is yellow ocr area doesn't display in screen.

Why?

Thanks.

May 14, 2009 11:27 PM
 

jacobl said:

Greetings saksu,

Please follow the instructions for setting up Tesseract in our DotImage documentation (http://www.atalasoft.com/products/dotimage/docs/Tesseract_Engine.html). Running the DotImage installer will fulfill most of those requirements and can be found here, http://www.atalasoft.com/download/downloadDotImage.aspx.

Best,

Jacob

May 18, 2009 9:29 AM
 

saksu said:

Thank you for your reply.

I already installed dotImage, and if i run clickword application in visual studio by clicking F5,it does work.

But if I deploy this application to c:\inetpub\wwwroot and I call this page,it doesn't work.

I replaced atalasoft dll into my setup dll.

The problem is that Recognize(atalaImage) of TesseractEngine stops.

What's the problem?

May 27, 2009 8:42 PM
 

jacobl said:

Hi saksu,

Are you trying to deploy ClickWord to the wwwroot directory on the same machine you installed the SDK on? If not, you will need to either install the SDK or set the TESSDATA_PREFIX environment variable to the directory where tessdata is located. You can copy the tesseract resources from your development machine in C:\Program Files\Atalasoft\DotImage 7.0\bin\OCRResources.

Best,

Jacob

May 28, 2009 8:13 AM
 

jacobl said:

saksu,

You could also try and use ProcMon to see where ClickWord is looking for the resources. Run ProcMon and filter on the IIS worker process (w3wp.exe) and click on a word making ClickWord attempt to use the engine.

- Jacob

May 28, 2009 8:16 AM
 

saksu said:

Hi,jacobl.

Thank you very much for your reply.

I deployed ClickWord to the wwwroot directory on the same machine I installed the SDK on,and I copied the tesseract resources from C:\Program Files\Atalasoft\DotImage 6.0\bin\OCRResources to wwwroot directory.

But Recognize of Tesseract still stops.

Why?

Thanks.

May 31, 2009 10:41 PM
 

jacobl said:

Have you tried using ProcMon to see where ClickWord is looking for the resources?

If you did, we can then just put the resources in that location and you should be all set.

Did you create an environment variable called TESSDATA_PREFIX and set it to the wwwroot directory where you have copied the tesseract resources?

Best,

Jacob

June 1, 2009 7:35 AM
 

saksu said:

Hi,jacobl

I tried to find the resources location where ClickWord is looking for by using ProcMon,but when i filtered w3wp.exe on ProcMon running,there are so many records and i couldn't find the resources location.

And I registered TESSDATA_PREFIX variable as below code:

string absolutepath = "C:/Inetpub/wwwroot/Bin/OcrResources/Tesseract/v2.01";

           System.Environment.SetEnvironmentVariable("TESSDATA_PREFIX", absolutepath, EnvironmentVariableTarget.User);

But still problem.

Thanks

June 2, 2009 4:08 AM
Anonymous comments are disabled