BLOG


BLOG
About Matthias Nott
SAP
Picture Galleries
Code
Technical





Locations of visitors to this page
01.07.10 18:00 Age: 65 days

New OCR Wrapper published on Sourceforge


I've shared a small project I've recently written on sourceforge: pdfocrwrapper.sourceforge.net This tool serves as a wrapper around the excellent ABBYY OCR engine which also exists as a Linux variant. The wrapper recursively iterates a directory structure and submits all PDF that it finds (and that it has not yet worked on) to ABBYY. The wrapper is of course flexible enough to work with other engines.

With this wrapper, you can keep scanning all your documents into PDFs. The wrapper will run, asynchronously, over those PDFs and apply the OCR engine to them. As a result, you'll be able to index and search the content of these files.

 


25.06.2010, 11:09 Copyright © 2005 MN Soft Industry Software, 8a, Chemin de la Tarpa, CH-1872 Troistorrents-Chenarlier
Tel.: +41 797 844554; Fax: +41 860 797 844554, Responsible: Matthias Nott., mn(at)mnsoft.org
Top of Page