• PDF Full Text Search

  • OpenKM has many interesting features, but requires some configuration process to show its full potential.
OpenKM has many interesting features, but requires some configuration process to show its full potential.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #21463  by cjain78
 
How can we enable Full Text Search in PDF files which contains images in it. We have tried PDF.Force.OCR also but it didnt work.
Is tessarect engine is capable of doing such kind of PDF files?

Kindly suggest.
 #21509  by jllort
 
this could be a image resolucion problem, sometimes if images has low relution then are not able to extract keyword ( then should be used non open source ocr engine ). One way to test it, is extract the pdf image and then test tesseract from terminal.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.