Page 1 of 1

Search does not return any text from email

PostPosted:Fri Aug 22, 2014 11:38 am
by malte
Hi there!

It seems I have a problem concerning Email Search. When I upload a mail from my outlook via the okm plugin, the mail is properly stored, and text extractor extracts text from images in the mail or its attachments properly.
When I now try to search for the mail, I do not get any result. Searching for other keywords may return attachments of that mail but never the mails body text or its subject.

I tried uploading an email in .msg format to the text extractor tool in the administrator menu, but it does not return any text at all. Instead I can see the following error in the log:
Code: Select all
WARN  org.apache.jackrabbit.extractor.MsOutlookTextExtractor- Failed to extract Message content
org.apache.poi.hsmf.exceptions.ChunkNotFoundException: Chunk not found
Doing the same with a word file works fine and shows me its texts.

Any help is appreciated.

My System:
Debian 7 x86-64
tried openjdk and sun jdk
OpenKM community 6.3.0

Regards
Malte

EDIT: I just found out, that the mail content is stored in table OKM_NODE_MAIL, but full text search does not return any content of it?

Re: Search does not return any text from email

PostPosted:Sun Aug 24, 2014 8:33 am
by jllort
Could you try in our online demo at demo.openkm.com
Can you provide us a msg file for testing purpose and tell us what is the search you're trying to do ( better some screenshot ) ?

Re: Search does not return any text from email

PostPosted:Mon Aug 25, 2014 10:24 am
by malte
Hi jllort and thanks for your reply!

I cannot add a .msg file here, but I added a screenshot of the mail. The mail has been uploaded into taxonomy and its text is found in the OKM_NODE_MAIL Table.
When I do a search for "Daten*" I only get some PDF Files (whose text is indexed well). The mail, that should obiously be there is not found.

I will try the file inside the demo...

Malte
Sample msg file as picture
Sample msg file as picture
mail_sample.jpg (113.15 KiB) Viewed 2162 times

EDIT
I've just discovered the problem... When I check all boxes (Document,Folder,Mail) in "Extended Search" and hit search, the results are fine.
When I then switch back to "Basic Search" and do a search again the results are also fine. Could this be a bug?
E-search.JPG
E-search.JPG (32.09 KiB) Viewed 2162 times
b-search.JPG
b-search.JPG (17.68 KiB) Viewed 2162 times

Re: Search does not return any text from email

PostPosted:Wed Aug 27, 2014 2:45 pm
by jllort
Really has only take in consideration documents, when implemented it we do not thing on mail, and also could be done there. We will take note about it and on near future will implement.