• content searching not working properly

  • OpenKM has many interesting features, but requires some configuration process to show its full potential.
OpenKM has many interesting features, but requires some configuration process to show its full potential.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #6062  by ibrahim
 
Hi

I have So far configured the Openkm4.0 with MySQL 5.0 and is working fine as per the requirement.

But, whenever am searching the document 'by content', the searching result showing only .txt files. The files with the extensions .doc, .docx, .xls, .xlsx, .pdf, etc are not showing in the result.

Have you got any ideas to solve this problem?

best regards, Ibrahim
 #6064  by jllort
 
very very strange ... normally the problems are that you're not making well the queries.

Do:
1- test document in our online demo demo.openkm.com and tell me if there runs, if not indicate me where you've upload the document and your query
 #6074  by ibrahim
 
Hi,

I have tested the same in your demo, it is working fine. But in my application it is searching only the .txt files.

I have uploaded the documents in the Taxonomy (okm:root). The query that am executing is as follows,
Code: Select all
/jcr:root/okm:root//*[@jcr:primaryType eq 'okm:void' or (@jcr:primaryType eq 'okm:document' and jcr:contains(okm:content,'ibrahim'))] order by @jcr:score descending 
Have you got any ideas?

best regards, Ibrahim
 #6088  by jllort
 
The only problem I could imagine is that for some reason the content of .doc has not been indexed. Which version of .doc are you using, in demo OpenKM 5.0 has better supporting office version formats than 4.X ( we've upgraded some libraries not still availables when we released version 4.X )
 #6239  by ibrahim
 
Hi,

I have solved the the content searching problem with Microsoft office files. But still am not able to search the contents of a .pdf file. Whenever am uploading a pdf file, getting a warning at server side:
Code: Select all
org.apache.jackrabbit.extractor.PdfTextExtractor - Failed to extract PDF text content 
Do you have any ideas to solve this problem?

best regards, Ibrahim
 #6253  by jllort
 
has that pdf some restriction like password, copy dissabled or similar ... could you try with other pdf created by you without any restriction ... probably the problem be there.
 #6312  by ibrahim
 
I have created my own .pdf file and tried.... but the same problem occurs. moreover I have tried in your demo also, it is working fine.

now am not getting the problem of .pdf files.

do you have any suggestions related to this problem?

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.