OCR issues - convert pre-process not working
PostPosted:Wed Apr 13, 2011 2:47 pm
OpenKM 5.0.3 (build: 5159) on Ubuntu server
tesseract 3
imagemagick 6.6.2.6-1ubuntu1.1
Running tesseract from the terminal I'm able to OCR a Tiff file without any problems, but when I upload the same document on OKM the document does not get processed. This what I get on my log file:
tesseract 3
imagemagick 6.6.2.6-1ubuntu1.1
Running tesseract from the terminal I'm able to OCR a Tiff file without any problems, but when I upload the same document on OKM the document does not get processed. This what I get on my log file:
Code: Select all
The command convert -depth 8 -monochrome gets called first then tesseract gets issued for OCR, now testing the convert command from the terminal (also with the same tiff file) I get this error:2011-04-13 15:06:05,607 DEBUG [com.openkm.extractor.TiffTextExtractor] CMD: convert -depth 8 -monochrome /tmp/okm2222865511775852701.tif /tmp/okm8301523301139778998.tif
2011-04-13 15:06:05,797 DEBUG [com.openkm.extractor.TiffTextExtractor] CMD: convert -depth 8 -monochrome /tmp/okm2134408032814981524.tif /tmp/okm7437339512715843922.tif
2011-04-13 15:06:08,968 DEBUG [com.openkm.extractor.TiffTextExtractor] CMD: /usr/bin/tesseract /tmp/okm7437339512715843922.tif /tmp/okm998230814336069662
2011-04-13 15:06:08,969 DEBUG [com.openkm.extractor.TiffTextExtractor] CMD: /usr/bin/tesseract /tmp/okm8301523301139778998.tif /tmp/okm1392151765958597643
2011-04-13 15:06:09,216 DEBUG [com.openkm.extractor.TiffTextExtractor] TEXT:
2011-04-13 15:06:09,338 DEBUG [com.openkm.extractor.TiffTextExtractor] TEXT:
2011-04-13 15:06:09,362 INFO [org.apache.jackrabbit.core.query.lucene.MultiIndex] updating index with 1 nodes from indexing queue.
Code: Select all
any ideas as to what's causing this?? I've checked for any missing libraries but for what I can see everything looks in order.> /usr/bin/convert -depth 8 -monochrome /root/file.tif /tmp/filenew.tif
convert: Bits/sample must be 1 for Group 3/4 encoding/decoding. `/tmp/filenew.tif' @ warning/tiff.c/TIFFErrors/494.