Open Source Document Management System | OpenKM

PostPosted:**Mon Nov 14, 2016 9:17 am**

Hi,

We are using OpenKM v 6.2.5
In Content Search we find that if a word is spans 2 lines using hyphen at the end, then that word is not searchable.

Ex:
Line 1: My SSN is 1234-5678-
Line 2: 1234.

So when we try to search for "1234-5678-1234" then the text does not match, because during extraction the extractor adds new line character after the hyphen.

So is there a way to resolve this issue and get the content search working in above scenarios.

PostPosted:**Wed Nov 16, 2016 10:35 am**

About what mime-type are talking about.
Is into the document a lot of text, or few lines ?

PostPosted:**Wed Jan 18, 2017 6:43 am**

You can implement your own custom text extractor. Take a look at the implementation of any of them.

Regards.

Open Source Document Management System | OpenKM

During Text Extractor - How to skip new line characters

During Text Extractor - How to skip new line characters

Re: During Text Extractor - How to skip new line characters

Re: During Text Extractor - How to skip new line characters