Skip to Content
0
Oct 08, 2013 at 02:26 PM

How to index image files (TIFF, JPEG, etc.) which have been OCRed

144 Views

Hello,

we've got the situation having two different image file constellations:

  1. an image file (TIFF or JPEG) managed in SAP DVS as a KPro document without a full text file associated with
  2. an image file with a full text layer embedded in a PDF/A file

To index those files with TREX I think we have to manage the files as KPro documents in order to be able to index the text layer. in the first case our procedure would be to OCR the original in the DVS document and add the text file received to the original as an attribute of the document.

In the second case we would manage the PDF/A as a KPro content within KM.

Now my question: has anybody experience in indexing those documents with TREX?

I'm looking forward to receiving helpful ideas.

Thanks in advance and

Best Regards

Uli