オープンソースのOCRソフト比較
(via OSnews)
結論だけ読んだ。
The good news is that there are solutions available on Linux right now which interpret documents at up to 99% accuracy. The bad news is that 99% is not 100%, and that anything other than a high quality 400-600 DPI scan of 12-14 point font drops off very quickly in accuracy. The combination of Tesseract and Ocropus is clearly the project we can most rely on to provide the missing elements of a full-featured Free OCR suite.
Linux OCR: A review of free optical character recognition software
というわけで Tesseract がやっぱりいいっぽいという話。