Aug
28
This document describes how to set up Tesseract OCR on Ubuntu 7.04.
OCR means "Optical Character Recognition". The resulting system will be
able to convert images with embedded text to text files. Tesseract is
licensed under the Apache
License v2.0.
This howto is meant as a practical guide; it does not cover the theoretical
backgrounds. They are treated in a lot of other documents in the web.
This document comes without warranty of any kind! I want to say that this is
not the only way of setting up such a system. There are many ways of achieving this goal but this is the way I take. I do
not issue any guarantee that this will work for you!
Leave a Reply