Page 1 of 1

OCR on community ?

PostPosted:Mon Nov 10, 2008 10:33 pm
by justroll
hi small question, is it possible to integrate on the community edition ?

my idea would be to write a webservice and ocr the text and put it as keywords..

i was just currious how the EE edition is doing this .. i was looking in the source code and notice bits&pieces of tesseract usage ..

thanks

Re:OCR on community ?

PostPosted:Tue Nov 11, 2008 12:10 pm
by pavila
OpenKM use Tesseract for OCR by default, but can be expanded to other engines easily (mostly propietary). You have to configure tesseract executable path in OpenKM.cfg using the system.ocr property. Also you need to install Imagemagick package.