Unable to invoke tesseract OCR 3.x
PostPosted:Wed Jan 09, 2013 7:07 am
Hi,
I am working on OpenKM 6.2 on windows.
I have downloaded the tesseract 3.01v for text extraction from the images. From the command line it's working fine.
Configured the system.ocr property to "D:\Venu\Tesseract Images\Tesseract-OCR\tesseract.exe ${fileIn} ${fileOut}"
But it's not working.
When I uploaded the image file ,Found the log message as below
If we want to use tesseract text extractors on images what steps do we need to follow.
I am working on OpenKM 6.2 on windows.
I have downloaded the tesseract 3.01v for text extraction from the images. From the command line it's working fine.
Configured the system.ocr property to "D:\Venu\Tesseract Images\Tesseract-OCR\tesseract.exe ${fileIn} ${fileOut}"
But it's not working.
When I uploaded the image file ,Found the log message as below
Code: Select all
Can you tell the path of the files, where the text extracted exactly.2013-01-09 12:30:51,351 [Text Extractor Worker] INFO com.openkm.d.t - processSerial.
Working on {docUuid=2183568a-a1ab-4265-9ee3-933cf4ce8c99, docPath=/okm:root/Invoice/test_text.png, docVerUuid=d4635339-cbfe-4697-869d-1208dcfa9018, date=Wed Jan 09 12:28:45 IST 2013}
2013-01-09 12:30:51,356 [Text Extractor Worker] WARN com.openkm.dao.NodeDocumentDAO - There was a problem extracting text from '/okm:root/Invoice/test_text.png': Too few text extractedIf we want to use tesseract text extractors on images what steps do we need to follow.