• OCR Engine Questions

  • OpenKM has many interesting features, but requires some configuration process to show its full potential.
OpenKM has many interesting features, but requires some configuration process to show its full potential.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #22799  by joako
 
I am considering using the ABBYY for Linux OCR engine because I currently use the ABBYY for Windows and have good results with it.

However, ABBYY for Linux is licensed per page. What I am wondering is will OpenKM recognize that PDF files currently in the repository already have been recognized by OCR and will not do it again? Also I am currently importing files directly into OpenKM from my scanner bypassing the OCR, will OpenKM recognize these files as needing OCR? Currently I do not have any OCR engine configured in OpenKM and would not deploy ABBYY for at least a few weeks.
 #22809  by pavila
 
OpenKM will make OCR of new documents and those which were modified. From administation you can see the pending extraction queue (inside statistics).

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.