Page 1 of 1

Search PDF content

PostPosted:Mon Dec 17, 2012 10:04 am
by SecuIT
Hi!

We use OpenKM 6.2.1 on Windows Server 2008R2. Preview and download as PDF is working fine. But we can't search content of PDFs. How I have to configure OpenKM to get this working?


Regards
Sascha

Re: Search PDF content

PostPosted:Tue Dec 18, 2012 5:54 am
by shaardu
You can search by going to "advanced search" and search in "content"

Re: Search PDF content

PostPosted:Tue Dec 18, 2012 10:00 am
by SecuIT
I know it, but it don't works. OpenKM only finds text in my rtf-files. The PDFs are searchable, i have testet it on OpenKM-Demo.

So what I have to do to make OpenKM search the content of my PDF-Files?

Re: Search PDF content

PostPosted:Wed Dec 19, 2012 6:27 pm
by jllort
Your pdf contains images ? because in online demo we have OCR engine enabled ( this could be the cause of the problem )

Re: Search PDF content

PostPosted:Fri Dec 28, 2012 10:22 am
by SecuIT
No, the pdf that I use for testing contains only text.

I have tested with a files that contains images an a text-layer for indexing. Same problem as with text-only pdf. :-(

We have enabled OCR in our setup, too.

Re: Search PDF content

PostPosted:Sat Dec 29, 2012 11:34 am
by jllort
If you go to administration -> stats -> there's and option to see if there're documents pending on indexing queue ( is queue empty ? )

Re: Search PDF content

PostPosted:Wed Jan 02, 2013 11:32 am
by SecuIT
I only found an option to see the text extractions queue. There are no current extractions and no pending extractions.

I can't find an indexing queue. :(

Re: Search PDF content

PostPosted:Thu Jan 03, 2013 10:40 pm
by jllort
If you go to administration -> there's and option to see stats ( some graphics with number of documents etc... ) there's an option at top right. Make a screenshot if you do not see it.