• OCR on community ?

  • We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #1347  by justroll
 
hi small question, is it possible to integrate on the community edition ?

my idea would be to write a webservice and ocr the text and put it as keywords..

i was just currious how the EE edition is doing this .. i was looking in the source code and notice bits&pieces of tesseract usage ..

thanks
 #1351  by pavila
 
OpenKM use Tesseract for OCR by default, but can be expanded to other engines easily (mostly propietary). You have to configure tesseract executable path in OpenKM.cfg using the system.ocr property. Also you need to install Imagemagick package.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.