This script is for Slackware 14.0 only and may be outdated.

SlackBuilds Repository

14.0 > Graphics > tesseract (3.01)

Tesseract is a commercial quality OCR engine originally developed at HP 
between 1985 and 1995. In 1995, this engine was among the top 3 evaluated 
by UNLV. It was open-sourced by HP and UNLV in 2005.

You will need to get one of the language packs in order to do anything
useful with tesseract, and that language pack tarball should be present
in the same directory as the SlackBuild script when the package is created.
See http://code.google.com/p/tesseract-ocr/downloads/list for a list of
all available language packs.  Note that you can install more than one
(or even all) of the language packs, as they do not conflict with each
other.  The build script defaults to use English, but this is easily 
changed by passing an alternate value on the command line.

Here is the relevant code from the build script:
  # Language pack(s) to use
  # We'll install English by default, but you can pass another one.
  # Edit the LANGNAM variable to switch to another language
  # Please use full package name on that variable (including the extension)
  # see https://code.google.com/p/tesseract-ocr/downloads/list for the list

This requires: leptonica

Maintained by: Pierre Cazenave
Keywords: optical character recognition,ocr,language,google,scan,tiff,office,document,scanner
ChangeLog: tesseract

Homepage:
http://code.google.com/p/tesseract-ocr/

Source Downloads:
tesseract-3.01.tar.gz (1ba496e51a42358fb9d3ffe781b2d20a)
tesseract-ocr-3.01.eng.tar.gz (89c139a73e0e7b1225809fc7b226b6c9)

Download SlackBuild:
tesseract.tar.gz
tesseract.tar.gz.asc (FAQ)

(the SlackBuild does not include the source)

Individual Files:

• README

• slack-desc

• svutil.cpp-include_stdio_h.diff

• tesseract.SlackBuild

• tesseract.info

Validated for Slackware 14.0

See our HOWTO for instructions on how to use the contents of this repository.

Access to the repository is available via:
ftp git cgit http rsync