Hi to all !
I run Open Km 5.1.9 under Centos 5.7 x64 , preview for jpeg, pdf or Office document are OK
Now I'm trying to configure OCR Tesseract3 here the value via the admin setting :
also When I scan a very well printed I have these error
I run Open Km 5.1.9 under Centos 5.7 x64 , preview for jpeg, pdf or Office document are OK
Now I'm trying to configure OCR Tesseract3 here the value via the admin setting :
Code: Select all
Do we need to put /usr/local/bin/tesseract ${fileIn} ${fileOut} -l fra instead for french langage ?system.ocr String /usr/local/bin/tesseract ${fileIn} ${fileOut} also When I scan a very well printed I have these error
Code: Select all
2012-03-12 17:24:57,984 WARN [com.openkm.extractor.CuneiformTextExtractor] IO exception executing command: /usr/local/bin/tesseract /tmp/image02251514838640367430.tiff /tmp/okm4578699148093339593.txt -l fra
java.util.zip.ZipException: error in opening zip file
at java.util.zip.ZipFile.open(Native Method)
at java.util.zip.ZipFile.<init>(Unknown Source)
at java.util.zip.ZipFile.<init>(Unknown Source)
at com.openkm.util.DocumentUtils.spellChecker(DocumentUtils.java:177)
at com.openkm.extractor.CuneiformTextExtractor.doOcr(CuneiformTextExtractor.java:130)
at com.openkm.extractor.PdfTextExtractor.doOcr(PdfTextExtractor.java:137)
at com.openkm.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:98)
at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob$1.call(TextExtractorJob.java:93)
at EDU.oswego.cs.dl.util.concurrent.FutureResult$1.run(Unknown Source)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob.run(TextExtractorJob.java:172)
at EDU.oswego.cs.dl.util.concurrent.PooledExecutor$Worker.run(Unknown Source)
at java.lang.Thread.run(Unknown Source)
2012-03-12 17:24:58,007 WARN [com.openkm.extractor.CuneiformTextExtractor] IO exception executing command: /usr/local/bin/tesseract /tmp/image03630279688831400324.tiff /tmp/okm6425780083587314556.txt -l fra
java.util.zip.ZipException: error in opening zip file
at java.util.zip.ZipFile.open(Native Method)
at java.util.zip.ZipFile.<init>(Unknown Source)
at java.util.zip.ZipFile.<init>(Unknown Source)
at com.openkm.util.DocumentUtils.spellChecker(DocumentUtils.java:177)
at com.openkm.extractor.CuneiformTextExtractor.doOcr(CuneiformTextExtractor.java:130)
at com.openkm.extractor.PdfTextExtractor.doOcr(PdfTextExtractor.java:137)
at com.openkm.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:98)
at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob$1.call(TextExtractorJob.java:93)
at EDU.oswego.cs.dl.util.concurrent.FutureResult$1.run(Unknown Source)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob.run(TextExtractorJob.java:172)
at EDU.oswego.cs.dl.util.concurrent.PooledExecutor$Worker.run(Unknown Source)
at java.lang.Thread.run(Unknown Source)
Last edited by techexpress on Wed Mar 14, 2012 1:42 pm, edited 1 time in total.
