Page 1 of 1

During Text Extractor - How to skip new line characters

PostPosted:Mon Nov 14, 2016 9:17 am
by JavaDev
Hi,

We are using OpenKM v 6.2.5
In Content Search we find that if a word is spans 2 lines using hyphen at the end, then that word is not searchable.

Ex:
Line 1: My SSN is 1234-5678-
Line 2: 1234.

So when we try to search for "1234-5678-1234" then the text does not match, because during extraction the extractor adds new line character after the hyphen.

So is there a way to resolve this issue and get the content search working in above scenarios.

Re: During Text Extractor - How to skip new line characters

PostPosted:Wed Nov 16, 2016 10:35 am
by jllort
About what mime-type are talking about.
Is into the document a lot of text, or few lines ?

Re: During Text Extractor - How to skip new line characters

PostPosted:Wed Jan 18, 2017 6:43 am
by pavila
You can implement your own custom text extractor. Take a look at the implementation of any of them.

Regards.