• Search does not return any text from email

  • We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #29634  by malte
 
Hi there!

It seems I have a problem concerning Email Search. When I upload a mail from my outlook via the okm plugin, the mail is properly stored, and text extractor extracts text from images in the mail or its attachments properly.
When I now try to search for the mail, I do not get any result. Searching for other keywords may return attachments of that mail but never the mails body text or its subject.

I tried uploading an email in .msg format to the text extractor tool in the administrator menu, but it does not return any text at all. Instead I can see the following error in the log:
Code: Select all
WARN  org.apache.jackrabbit.extractor.MsOutlookTextExtractor- Failed to extract Message content
org.apache.poi.hsmf.exceptions.ChunkNotFoundException: Chunk not found
Doing the same with a word file works fine and shows me its texts.

Any help is appreciated.

My System:
Debian 7 x86-64
tried openjdk and sun jdk
OpenKM community 6.3.0

Regards
Malte

EDIT: I just found out, that the mail content is stored in table OKM_NODE_MAIL, but full text search does not return any content of it?
 #29645  by jllort
 
Could you try in our online demo at demo.openkm.com
Can you provide us a msg file for testing purpose and tell us what is the search you're trying to do ( better some screenshot ) ?
 #29660  by malte
 
Hi jllort and thanks for your reply!

I cannot add a .msg file here, but I added a screenshot of the mail. The mail has been uploaded into taxonomy and its text is found in the OKM_NODE_MAIL Table.
When I do a search for "Daten*" I only get some PDF Files (whose text is indexed well). The mail, that should obiously be there is not found.

I will try the file inside the demo...

Malte
Sample msg file as picture
Sample msg file as picture
mail_sample.jpg (113.15 KiB) Viewed 2014 times

EDIT
I've just discovered the problem... When I check all boxes (Document,Folder,Mail) in "Extended Search" and hit search, the results are fine.
When I then switch back to "Basic Search" and do a search again the results are also fine. Could this be a bug?
E-search.JPG
E-search.JPG (32.09 KiB) Viewed 2014 times
b-search.JPG
b-search.JPG (17.68 KiB) Viewed 2014 times
 #29686  by jllort
 
Really has only take in consideration documents, when implemented it we do not thing on mail, and also could be done there. We will take note about it and on near future will implement.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.