• Integrate with OCR Server using web service

  • Do you want to create a native client or integrate with third party applications: webservices are the solution.
Do you want to create a native client or integrate with third party applications: webservices are the solution.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #41523  by storm8
 
dear all,
i want to integrate OpenKM with Enterprise OCR server, the OCR server will installed in stand alone server and the only way to integrate will be throw web service. i saw in application configuration that can be integrate throw invoke the command it self. but i didn't find any information about integrating using web service.so i want to know is there is any one who integrate with OCR using web service ??
 #41549  by jllort
 
You should download portable dev environement ( we suggest install in VM with 2 cores and 4GB ram ) https://sourceforge.net/projects/openkmportabledev/

Then you must create a class like Tesseract3TextExtractor.java ( https://sourceforge.net/p/openkm/code/H ... actor.java ) and modify the method private String doOcr(File pdfImg) into the class PdfTextExtractor ( you will find it here https://sourceforge.net/p/openkm/code/H ... actor.java ).

Hope this information will be useful to you.
 #41552  by jllort
 
OK, If you success on it we can integrate in the OpenKM source code. Follow the steps I related before and you should get it. Take in mind that there's a crontab which processes the batch queue of pending document to be indexed. I will be useful for testing.

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.