Page 1 of 1

Problem in Search by Content

PostPosted:Mon Apr 11, 2016 1:29 pm
by Jahangir
Hi,
I am trying to search content in document(word document / PDF). It is only searching on single words like if i search "OpenKM is a document manager" it wont find it. It will only return results if write single words like document, manager etc.

Is there any setting or specific way for searching some text / sentence ?

Re: Problem in Search by Content

PostPosted:Wed Apr 13, 2016 4:36 am
by Jahangir
I am attaching word file. Upload the file in openkm and search following text

"The paper size is set to Letter" and openkm will return nothing in search however if you try to find single word Letter it will search. Even it wont search on two words for example

"Title Page"

Re: Problem in Search by Content

PostPosted:Wed Apr 13, 2016 12:48 pm
by Jahangir
I have uploaded the document on online demo today.
When I search content using advance mode "The paper size is set to Letter" it returns result highlighting "paper size set letter" four words. but unable to search same text on local. But if I search four words skipping "The, is, to" then it returns result same as demo site.

Am i missing some configuration here
HELP please.

Re: Problem in Search by Content

PostPosted:Fri Apr 15, 2016 7:29 am
by pavila
Hello,

I've tried and works as expected. Try the last night build from http://integration.openkm.com/6.3/

Upload the attached file and after it's indexed, search for "el santo grial".

Regards.

Re: Problem in Search by Content

PostPosted:Fri Apr 15, 2016 9:38 am
by Jahangir
Thank you Pavila for replying. I have downloaded nightly build and its searching "el santo grial" from the document you provided but it is still not searching from word document which i have attached. Can you please confirm uploading the word document at your end on community version and try searching "disclosed that may be essential".

If I provide "to, that, you, on, etc" in search it wont return any result how ever in professional version it does.

Regards,

Re: Problem in Search by Content

PostPosted:Sat Apr 16, 2016 5:59 pm
by jllort
Be sure document is yet indexed. Take in mind document is uploaded comes into "pending text extractor queueu" -> you can see from Administration / stats. And the cron tab task name "text extractor workers" process the documents in queue.

Re: Problem in Search by Content

PostPosted:Mon Apr 18, 2016 7:51 pm
by pavila
I've made some work here. Please, try next day night build.

Re: Problem in Search by Content

PostPosted:Mon May 09, 2016 12:08 pm
by Jahangir
Hi pavila,
I tried build from last night it has same problem. I am attaching the document again and I am unable to search
"optional in this document" following content. It return results if i search "optional document". its working fine on the demo build. is there any dictionary or tessaract problem ?

Re: Problem in Search by Content

PostPosted:Mon Jul 04, 2016 7:28 am
by pavila
The search part is more o less the same in both versions. It should give the same results.