How to set an alternative text extractor?
PostPosted:Mon Sep 19, 2022 9:49 am
I wondered why my searches yield zero results.
When I check the text extraction in utilities I see that I use "com.openkm.extractor.AbbyTextExtractor" and there are no keywords extracted. I guess it is somehow related to AbbyyFinereader.
There is nothing related to Abbyy on that machine. I do not know why the Abby Extractor ist configured. I read that there are alternative extractors but I dont see how to configure them correctly.
In general I do not need OCR as all PDFs are OCRed by AbbyyFineReader on another machine.
Of course text extraction from other file types would be interesting (word, excel).
When I check the text extraction in utilities I see that I use "com.openkm.extractor.AbbyTextExtractor" and there are no keywords extracted. I guess it is somehow related to AbbyyFinereader.
There is nothing related to Abbyy on that machine. I do not know why the Abby Extractor ist configured. I read that there are alternative extractors but I dont see how to configure them correctly.
In general I do not need OCR as all PDFs are OCRed by AbbyyFineReader on another machine.
Of course text extraction from other file types would be interesting (word, excel).