Research/OCR

From Publication Station
Revision as of 13:45, 4 December 2015 by Andre (talk | contribs) (Created page with "OCR (optical character recognition) =tesseract= https://code.google.com/p/tesseract-ocr/ Teassearct is OCR software. It was HP Labs between 1985 and 1995 currently is devel...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

OCR (optical character recognition)

tesseract

https://code.google.com/p/tesseract-ocr/

Teassearct is OCR software. It was HP Labs between 1985 and 1995 currently is developed by Google.

install

Debian:

aptitude install tesseract-ocr

Mac:

using homebrew need to run the commands:

brew install leptonica --with-libtiff
brew install tesseract --all-languages

https://gist.github.com/henrik/1967035