• Help in Tesseract Integration - Community Edition

  • Problems with installing OpenKM? No problemo, the solution is closer than you think.
Problems with installing OpenKM? No problemo, the solution is closer than you think.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #40407  by slackbot
 
Hi,

I am a novice user of openkm and tesseract and working on windows 64 bit system, can anyone please guide me how to integrate tesseract ocr engine with community edition of openKM. I've read http://wiki.openkm.com/index.php/Third- ... ation:_OCR and http://wiki.openkm.com/index.php/Applic ... abling_OCR but i couldn't understand how to integrate or change the config file or where it's located.

Ps: I am working with maven on eclipse juno and i've added all the directories and tomcat seems to be working fine.
 #40411  by KittyCathy
 
slackbot wrote:Hi,

I am a novice user of openkm and tesseract and working on windows 64 bit system, can anyone please guide me how to integrate tesseract ocr engine with community edition of openKM. I've read http://wiki.openkm.com/index.php/Third- ... ation:_OCR and http://wiki.openkm.com/index.php/Applic ... abling_OCR but i couldn't understand how to integrate or change the config file or where it's located.

Ps: I am working with maven on eclipse juno and i've added all the directories and tomcat seems to be working fine.
Connecting the topic. I have the same problem.
 #40427  by jllort
 
I suppose you have installed tesseract ocr in your server.
Which is your system.ocr value ?
Did you checked the ocr from server before ? I suggest check an image on the server and then upload to OpenKM.
At registered.text.extractor did you removed the com.openkm.extractor.CuneiformTextExtractor and added the class for Tesseract ? ( after this change I suggest restart the service )

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.