How to make a PDF Scan searchable?

OpenKM has many interesting features, but requires some configuration process to show its full potential.
Forum rules
Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
Post Reply
pipen
Fresh Boarder
Fresh Boarder
Posts: 4
Joined: Wed Dec 05, 2018 10:05 am

How to make a PDF Scan searchable?

Post by pipen » Thu Jan 10, 2019 1:08 pm

Hello,
i scan a document with a Konica Minolta as PDF 300dpi. When i import the document to openkm, the PDF is not searchable. What can i do?
Christian

jllort
Moderator
Moderator
Posts: 10497
Joined: Fri Dec 21, 2007 11:23 am
Location: Sineu - ( Illes Balears ) - Spain
Contact:

Re: How to make a PDF Scan searchable?

Post by jllort » Fri Jan 11, 2019 5:51 pm

Might configure tesseract ocr engine and upload the document again or ( mark the document to be reindexed or index the whole repository again -> before it let's focus in text extraction ).

Follow the steps described here to install and configure ocr engine, https://docs.openkm.com/kcenter/view/ok ... ngine.html , then upload a file and wait until the document be processed ( the documents go into a queue what you can check from Administration > Stats > pending text extraction queue ).

Post Reply

Who is online

Users browsing this forum: No registered users and 5 guests