Page 1 of 1

Zone OCR in multipage PDF

PostPosted:Sat Mar 06, 2021 3:10 pm
by pavel.petruska
In professional version 6.4.49 zonal OCR for PDF works correctly if the document has one page. I need to solve zonal OCR for multi-page PDF as well.

It occurred to me to define a separate prototype for each page and link it to different metadata items - I assume that a match is possible for the same document for multiple prototypes. However, during the detection test, the processing always reads only the first page.

What PDF settings / conversions can I make to process zone OCR for all pages of a document?

Regards
PP

Re: Zone OCR in multipage PDF

PostPosted:Thu Mar 11, 2021 7:49 am
by jllort
For splitting pages, what seems is your issue, we do it usually with small customization, it depends on the type of the document. Sometimes also we integrate Chronoscan for its purposes, but without a more detailed description is not easy to suggest what is best for you.

You can take a look here https://www.youtube.com/watch?v=jYkRItZsBSo

I suggest use our contact form https://www.openkm.com/en/contact.html and ask there about your problem.

Re: Zone OCR in multipage PDF

PostPosted:Tue Apr 13, 2021 10:03 am
by pavel.petruska
Thanks for your reply. You're right, this is not a scanning issue. We need to perform OCR on multipage PDF created by any text editor/PDF generator/scanner (using separate prototypes per page).
Please confirm, that this cannot be solved by standard configuration steps and there is some "small customization" needed thru the support.

Re: Zone OCR in multipage PDF

PostPosted:Sun Apr 18, 2021 6:29 am
by jllort
This behaviour requires small customization ( we have done to other customers ) or the use of chronoscan. All it depends on the type of the documents. If you are a customer should do this question from the official support website and we'll ask there to share the documents to analyze and suggest several options to process them. Without taking a look at the documents I can not suggest what is better.