Hi
Using version OKM 5.1.7
when i'm uploading PDF or DOC type files by netdrive i'm getting the following message:
(for the doc type ofcourse the doctext variant)
I really want to keep at least the extraction for PDF and DOC files (other type are disabled on the configuration tab)
This error don't appear when i'm uploading documents by the "add document" button on the OpenKM desktop
Using version OKM 5.1.7
when i'm uploading PDF or DOC type files by netdrive i'm getting the following message:
(for the doc type ofcourse the doctext variant)
Code: Select all
Ant idea of the root cause?17:16:14,841 WARN [PdfTextExtractor] Failed to extract PDF text content
java.io.IOException: Error: End-of-File, expected line
at org.apache.pdfbox.pdfparser.BaseParser.readLine(BaseParser.java:1176)
at org.apache.pdfbox.pdfparser.PDFParser.parseHeader(PDFParser.java:290)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:162)
at com.openkm.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:63)
at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
at com.openkm.extractor.RegisteredExtractors.getText(RegisteredExtractors.java:75)
at com.openkm.extractor.RegisteredExtractors.index(RegisteredExtractors.java:117)
at com.openkm.webdav.DefaultHandler.importData(DefaultHandler.java:362)
at com.openkm.webdav.DefaultHandler.importContent(DefaultHandler.java:269)
at com.openkm.webdav.DefaultHandler.importContent(DefaultHandler.java:334)
at org.apache.jackrabbit.server.io.IOManagerImpl.importContent(IOManagerImpl.java:105)
at org.apache.jackrabbit.webdav.simple.DavResourceImpl.addMember(DavResourceImpl.java:602)
at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.doPut(AbstractWebdavServlet.java:516)
at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.execute(AbstractWebdavServlet.java:244)
at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.service(AbstractWebdavServlet.java:196)
at com.openkm.servlet.WebdavServlet.service(WebdavServlet.java:80)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:803)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
at org.jboss.web.tomcat.filters.ReplyHeaderFilter.doFilter(ReplyHeaderFilter.java:96)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:230)
at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:175)
at org.jboss.web.tomcat.security.SecurityAssociationValve.invoke(SecurityAssociationValve.java:182)
at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:432)
at org.jboss.web.tomcat.security.JaccContextValve.invoke(JaccContextValve.java:84)
at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
at org.jboss.web.tomcat.service.jca.CachedConnectionValve.invoke(CachedConnectionValve.java:157)
at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:262)
at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:844)
at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:446)
at java.lang.Thread.run(Thread.java:636)
17:16:14,866 WARN [RegisteredExtractors] There was a problem extracting text from '/okm:personal/jan/change default website.pdf'
17:16:18,586 ERROR [DavResourceImpl] Error while importing resource: java.io.IOException: okm:author
I really want to keep at least the extraction for PDF and DOC files (other type are disabled on the configuration tab)
This error don't appear when i'm uploading documents by the "add document" button on the OpenKM desktop