• Automatic file name with OCR when scanning

  • OpenKM has many interesting features, but requires some configuration process to show its full potential.
OpenKM has many interesting features, but requires some configuration process to show its full potential.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #16774  by fj.leon
 
Hello, we are new at openkm and want to use it in our company. We want the scanner tool inside openkm to NOT request the file name before scanning, rather the document should be scanned and uploaded and with OCR it should analize a specific region in the document and use the detected text as the file name.

My boss asked for this and i think it's a long shot, i don't expect it to work but i must ask before giving him the answer.
The reason for this is to avoid user error when entering the file name, since we must rely on it for the company procedures.
 #16778  by jllort
 
Do not exists any Free OCR engine which you can use to scan zones, basically it's the problem. The only aproximation could be that in all document could be some tags, for example filename_start doc_name filename_end and that could be parsed before ocr engine, but forget in your mind any free solution with zone scanning.

You could try with this open source http://code.google.com/p/ocropus/ based on tesseract , but I think do not working with zones. If you found some one tell us well be pleased shared this kind of information with community.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.