Page 1 of 1

Text extraction queue broken

PostPosted:Fri Aug 05, 2022 12:51 am
by scrumi
I have 70,000 documents or so stuck in the queue. Logs show a continuous stream of a version of this:
Code: Select all
2022-08-04 17:48:55,318 [pool-14908-thread-4] [] WARN  c.o.extractor.TextExtractorThread - /home/openkm/tomcat-8.5.34/repository/datastore/57/6f/91/4a/576f914a-b4a1-499a-9553-c5acf1a4064c (No such file or directory)
java.io.FileNotFoundException: /home/openkm/tomcat-8.5.34/repository/datastore/57/6f/91/4a/576f914a-b4a1-499a-9553-c5acf1a4064c (No such file or directory)
	at java.io.FileInputStream.open0(Native Method) ~[na:1.8.0_312]
	at java.io.FileInputStream.open(FileInputStream.java:195) ~[na:1.8.0_312]
	at java.io.FileInputStream.<init>(FileInputStream.java:138) ~[na:1.8.0_312]
	at com.openkm.module.db.stuff.FsDataStore.read(FsDataStore.java:65) ~[classes/:6.3.11]
	at com.openkm.dao.NodeDocumentDAO.textExtractorHelper(NodeDocumentDAO.java:1402) ~[classes/:6.3.11]
	at com.openkm.extractor.TextExtractorThread.run(TextExtractorThread.java:49) ~[classes/:6.3.11]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [na:1.8.0_312]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [na:1.8.0_312]
	at java.lang.Thread.run(Thread.java:748) [na:1.8.0_312]

Re: Text extraction queue broken

PostPosted:Tue Aug 16, 2022 7:19 am
by jllort
1- check if the document exist ( should be )
2- usually the problem is because started OpenKM with root user and then the document has been created with root user as the owner. Later when executed OpenKM as service ( it uses openkm user ) do not have grants to access to this document. To solve, I suggest update the owner for all the tomcat folder

chown openkm:openkm -Rf tomcat-xxxx