Starting with OpenKM 5.1.3 you can see what text was extracted from a document. To see, go to Administration and go to Repository View. Also you can check what document had problem when extracting text running this Hibernate query:
Code: Select all
from Administration / Database Query.from Activity where action='MISC_TEXT_EXTRACTION_FAILURE'