Hi Reinaldo,
not for me, I don't want to deal with those big, obsolete multi page picture file monsters. Already my standard viewer can't show me, what is in those files.
On the other side I need no API. I have made good experiences with CL Tools such as GhostScript, calibre (epub-convert/epub-meta), ImageMagick, NirCmd, SumatraPDF, DXFView, image2pdf, PDFtk, xpdfbin (pdftotext, pdfimages,...). Ups, quite a lot, I wasn't aware of it.
So, my recommendation is: For test purposes, split one of those affected tiff, OCRing (with the API) all single tiffs, pngs or ppm/pbm (built one searchable pdf) and compare the results. Then decide about the next steps.
Good luck, Frank
↧