I have installed the community version 6.3.11. Office documents are indexed correctly and fulltext search works, only with pdf files there is a problem. When i export a word document that was successfully indexed as pdf and upload it, it tells me:
Thank you in advance
Code: Select all
What does "Undefined OCR application" mean? Is there no OCR engine included in the bundle? I read something about tesseract, you have to install it manually, is that correct?2022-06-20 12:50:08,723 [Thread-22] WARN c.o.extractor.CuneiformTextExtractor - Undefined OCR application
2022-06-20 12:50:08,724 [Thread-22] WARN com.openkm.dao.NodeDocumentDAO - There was a problem extracting text from '/okm:trash/okmAdmin/Prozessliste_2017.pdf': Too few text extracted
Thank you in advance