• I can't search by content

  • We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #1678  by Iwen
 
I install okm 3.0.
I import lots of PDFs(Chinese) which is scaned documents and dealed with OCR.
I can\'t search them by content?
But it\'s ok using okm 2.0
Why?:blink:
 #1684  by pavila
 
Are the PDF composed of images or text? Can you post here a sample PDF?
 #1692  by Iwen
 
The PDF only composed by text.
When I open the pdf file using adobe reader, I can select text in it and search word in pdf.
 #1695  by jllort
 
Put seomeone on OpenKM demo, all tell us the location and the search you\'re doing to test it.
 #1700  by Iwen
 
I login OpenKM demo using user9 and create a folder in My documents named Iwen which I put one file in. The file name is \"2.09 明达投资顾问有限公司简介.pdf\". I search content by \"明达\" which is behind \"2.09\". But no result.
I open this file in adobe reader, I can search \"明达\" in pdf.
 #1708  by pavila
 
OpenKM uses a generic index algorithm, and it should be tunned to work with chinese to get better search results. By the way, the document you mention is a PDF composed by images. It won\'t be indexed, only PDF with text can be indexed.
 #9101  by joako
 
So, I am wondering if a "searchable PDF" of a scanned image can not be searched for by content? If an OCR process is run on the PDF file I can search it in Acrobat and other applications and even select the text and copy & paste it around. I can provide a sample but right now the ones I am testing have sensitive data.
 #9209  by pavila
 
Sorry, but I need a sample PDF to check why the text extraction fails. If you documents have sensitive data, maybe you want to contact us and become a customer.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.