• Content Search not effective

  • We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #25415  by vielktus
 
Hi guys,
I've just found out that some content files cannot be searched.

For example: i got "A Window" string in File A and File B (pdf and word respectively)
When i search full text "A Window", the result is just File A.

Is there anyway to improve this ? I got lot of content that cannot be searched :(

OpenKM version: OpenKM 6.4 community
 #25439  by jllort
 
pdf file contains text ( you can copy and paste ) or is an image. I suspect the problem is that you got pdf with images, and then you need to configure OCR engine to index pdf images. Other interesting thing is upload pdf/a files http://wiki.openkm.com/index.php/Cognitive_PDF/A here some video http://youtu.be/XRnsgVfFtHc ( actally only available in spanish but you can understand perfectly what is done ). PDF/a files has a layout with text content
 #25508  by jllort
 
The other post problem is caused by a lock in index queue.
You're using version 6.2.4 can you try upgrade to nighly build available at integration.openkm.com and here is migration guide for what will be version 6.2.5 http://wiki.openkm.com/index.php/Migrat ... 4_to_6.2.5 ( can jump without problem to this version )

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.