Page 1 of 1

Webdav + netdrive

PostPosted:Sun Oct 23, 2011 3:35 pm
by JanB
Hi

Using version OKM 5.1.7

when i'm uploading PDF or DOC type files by netdrive i'm getting the following message:
(for the doc type ofcourse the doctext variant)
Code: Select all
17:16:14,841 WARN  [PdfTextExtractor] Failed to extract PDF text content
java.io.IOException: Error: End-of-File, expected line
        at org.apache.pdfbox.pdfparser.BaseParser.readLine(BaseParser.java:1176)
        at org.apache.pdfbox.pdfparser.PDFParser.parseHeader(PDFParser.java:290)
        at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:162)
        at com.openkm.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:63)
        at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
        at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
        at com.openkm.extractor.RegisteredExtractors.getText(RegisteredExtractors.java:75)
        at com.openkm.extractor.RegisteredExtractors.index(RegisteredExtractors.java:117)
        at com.openkm.webdav.DefaultHandler.importData(DefaultHandler.java:362)
        at com.openkm.webdav.DefaultHandler.importContent(DefaultHandler.java:269)
        at com.openkm.webdav.DefaultHandler.importContent(DefaultHandler.java:334)
        at org.apache.jackrabbit.server.io.IOManagerImpl.importContent(IOManagerImpl.java:105)
        at org.apache.jackrabbit.webdav.simple.DavResourceImpl.addMember(DavResourceImpl.java:602)
        at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.doPut(AbstractWebdavServlet.java:516)
        at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.execute(AbstractWebdavServlet.java:244)
        at org.apache.jackrabbit.webdav.server.AbstractWebdavServlet.service(AbstractWebdavServlet.java:196)
        at com.openkm.servlet.WebdavServlet.service(WebdavServlet.java:80)
        at javax.servlet.http.HttpServlet.service(HttpServlet.java:803)
        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
        at org.jboss.web.tomcat.filters.ReplyHeaderFilter.doFilter(ReplyHeaderFilter.java:96)
        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
        at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:230)
        at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:175)
        at org.jboss.web.tomcat.security.SecurityAssociationValve.invoke(SecurityAssociationValve.java:182)
        at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:432)
        at org.jboss.web.tomcat.security.JaccContextValve.invoke(JaccContextValve.java:84)
        at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
        at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
        at org.jboss.web.tomcat.service.jca.CachedConnectionValve.invoke(CachedConnectionValve.java:157)
        at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
        at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:262)
        at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:844)
        at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
        at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:446)
        at java.lang.Thread.run(Thread.java:636)
17:16:14,866 WARN  [RegisteredExtractors] There was a problem extracting text from '/okm:personal/jan/change default website.pdf'
17:16:18,586 ERROR [DavResourceImpl] Error while importing resource: java.io.IOException: okm:author
Ant idea of the root cause?
I really want to keep at least the extraction for PDF and DOC files (other type are disabled on the configuration tab)
This error don't appear when i'm uploading documents by the "add document" button on the OpenKM desktop

Re: Webdav + netdrive

PostPosted:Mon Oct 24, 2011 8:18 am
by jllort
You're uploading the same document from webdav than UI ? or different document ( I refer not the same type, is the same document ? )

Re: Webdav + netdrive

PostPosted:Mon Oct 24, 2011 10:23 am
by JanB
Hi

I have uploaded the same document twice (only changed the name)
So the contents are identical.

As first initiating i have uploaded approx 100 PDF and 100 MsWord files trough netdrive.
All files creating the same kind of error

Re: Webdav + netdrive

PostPosted:Thu Oct 27, 2011 9:22 am
by pavila
WebDAV subsystem has been re-worked in the last OpenKM Beta. You can grab a night build from http://integration.openkm.com/5.1.x/ .