Dear all,
I have been importing bulks of documents into OpenKM (6.3.2 CE).
From time to time, some PDF files could not be OCRed (and therefore no indexing of its contents) (I know that is the problem of tesseract or imagick but not OpenKM).
Therefore, such files are stuck at the head of the queue in text extraction and preventing other files to be properly processed.
Is there anyway to prevent such problematic documents from text extraction but keeping them in the system ?
Further, anyone has any tips to convert such documents to be "text-extractable" ?
Thanks in advance.
Regards
Alex
I have been importing bulks of documents into OpenKM (6.3.2 CE).
From time to time, some PDF files could not be OCRed (and therefore no indexing of its contents) (I know that is the problem of tesseract or imagick but not OpenKM).
Therefore, such files are stuck at the head of the queue in text extraction and preventing other files to be properly processed.
Is there anyway to prevent such problematic documents from text extraction but keeping them in the system ?
Further, anyone has any tips to convert such documents to be "text-extractable" ?
Thanks in advance.
Regards
Alex