Optical Character Recognition With Tesseract OCR On Ubuntu 7.04

Posted by suvi under Desktop, Ubuntu

This document describes how to set up Tesseract OCR on Ubuntu 7.04.
OCR means "Optical Character Recognition". The resulting system will be
able to convert images with embedded text to text files. Tesseract is
licensed under the Apache
License v2.0
.

This howto is meant as a practical guide; it does not cover the theoretical
backgrounds. They are treated in a lot of other documents in the web.

This document comes without warranty of any kind! I want to say that this is
not the only way of setting up such a system. There are many ways of achieving this goal but this is the way I take. I do
not issue any guarantee that this will work for you!

Read more at HowtoForge 

 

Leave a Reply

*
To prove you're a person (not a spam script), type the security word shown in the picture. Click on the picture to hear an audio file of the word.
Click to hear an audio file of the anti-spam word