Page 1 of 1

openoffice dictionary encoding problem

PostPosted:Fri Feb 17, 2012 10:28 am
by mickeyratt
Dear readers

I have a problem with openkm. I use a debian system:
Linux openkm 2.6.32-5-xen-amd64 #1 SMP Mon Jan 16 20:48:30 UTC 2012 x86_64 GNU/Linux

with locale:
Code: Select all
LANG=hu_HU.UTF-8
LANGUAGE=
LC_CTYPE="hu_HU.UTF-8"
LC_NUMERIC="hu_HU.UTF-8"
LC_TIME="hu_HU.UTF-8"
LC_COLLATE="hu_HU.UTF-8"
LC_MONETARY="hu_HU.UTF-8"
LC_MESSAGES="hu_HU.UTF-8"
LC_PAPER="hu_HU.UTF-8"
LC_NAME="hu_HU.UTF-8"
LC_ADDRESS="hu_HU.UTF-8"
LC_TELEPHONE="hu_HU.UTF-8"
LC_MEASUREMENT="hu_HU.UTF-8"
LC_IDENTIFICATION="hu_HU.UTF-8"
LC_ALL=
i'd like to use the latest openoffice dictionary to "enhance" cuneiform OCR process, but the following error message appears whenever i add a file, i tried a lot of version of oxt files (the earliest too) but the error message still appears:
Code: Select all
11:26:01,540 WARN  [CuneiformTextExtractor] Failed to extract OCR text
java.lang.IllegalStateException: java.util.concurrent.ExecutionException: java.io.UnsupportedEncodingException: 1262
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary.waitToLoad(OpenOfficeSpellDictionary.java:289)
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary.getSuggestions(OpenOfficeSpellDictionary.java:264)
        at com.openkm.util.DocumentUtils.spellChecker(DocumentUtils.java:185)
        at com.openkm.extractor.CuneiformTextExtractor.doOcr(CuneiformTextExtractor.java:130)
        at com.openkm.extractor.PdfTextExtractor.doOcr(PdfTextExtractor.java:137)
        at com.openkm.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:98)
        at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
        at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
        at org.apache.jackrabbit.core.query.lucene.TextExtractorJob$1.call(TextExtractorJob.java:93)
        at EDU.oswego.cs.dl.util.concurrent.FutureResult$1.run(Unknown Source)
        at org.apache.jackrabbit.core.query.lucene.TextExtractorJob.run(TextExtractorJob.java:172)
        at EDU.oswego.cs.dl.util.concurrent.PooledExecutor$Worker.run(Unknown Source)
        at java.lang.Thread.run(Thread.java:662)
Caused by: java.util.concurrent.ExecutionException: java.io.UnsupportedEncodingException: 1262
        at java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222)
        at java.util.concurrent.FutureTask.get(FutureTask.java:83)
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary.waitToLoad(OpenOfficeSpellDictionary.java:283)
        ... 12 more
Caused by: java.io.UnsupportedEncodingException: 1262
        at sun.nio.cs.StreamDecoder.forInputStreamReader(StreamDecoder.java:52)
        at java.io.InputStreamReader.<init>(InputStreamReader.java:83)
        at org.dts.spell.dictionary.myspell.MySpell.load_tables(MySpell.java:390)
        at org.dts.spell.dictionary.myspell.MySpell.initFromStreams(MySpell.java:177)
        at org.dts.spell.dictionary.myspell.MySpell.<init>(MySpell.java:69)
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary.initFromZipFile(OpenOfficeSpellDictionary.java:198)
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary.access$100(OpenOfficeSpellDictionary.java:31)
        at org.dts.spell.dictionary.OpenOfficeSpellDictionary$2.call(OpenOfficeSpellDictionary.java:88)
        at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
        at java.util.concurrent.FutureTask.run(FutureTask.java:138)
        at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
        ... 1 more
If you have an advice, it would be helpful to me. Thank you in advance!

Re: openoffice dictionary encoding problem

PostPosted:Mon Feb 20, 2012 9:23 am
by pavila
Please, post a link to the dictionary or attach it to the thread so I can test and verify the error in my local installation.

Re: openoffice dictionary encoding problem

PostPosted:Mon Feb 20, 2012 10:17 am
by mickeyratt
Thank you pavila, i thought it is openkm related problem. I'll try, if there will be solution, i'll post here. Thanks again.

Re: openoffice dictionary encoding problem

PostPosted:Mon Mar 05, 2012 2:34 pm
by pavila
Ok, but I need the dictionary to test in my local installation and fix it in case of being a bug.

Re: openoffice dictionary encoding problem

PostPosted:Wed Nov 25, 2015 9:07 pm
by ecom
I have same error if enable dictionary file.
System is OKM 6.3.1 and Ubuntu server 14.10

How to solve?

Thank you for reply.