安装包及依赖包下载
    https://blog.csdn.net/zhaohuihua/article/details/109340779

    解压tesseract
    unzip tesseract-ocr.zip
    cd tesseract
    安装依赖包
    yum localinstall -y rpms/.rpm
    如果安装不成功,请看最后面的异常处理
    安装leptonica
    tar -zxvf leptonica-1.74.4.tar.gz
    cd leptonica-1.74.4
    ./configure
    make # 约2分钟
    make install
    cd ..
    安装pkg-config
    tar -zxvf pkg-config-0.29.2.tar.gz
    cd pkg-config-0.29.2
    ./configure —with-internal-glib
    make # 约1分钟
    make install
    cd ..
    安装tesseract
    unzip tesseract-4.1.1.zip
    cd tesseract-4.1.1
    ./autogen.sh
    ./configure —prefix=/opt/tesseract/ PKG_CONFIG_PATH=/usr/local/lib/pkgconfig
    make # 约5分钟
    make install
    ldconfig
    cd ..
    *检查安装是否成功

    /opt/tesseract/bin/tesseract —version
    输出 tesseract 4.1.1; leptonica-1.74.4; 之类的信息

    安装语言包
    cp -f langs/eng.traineddata /opt/tesseract/share/tessdata/
    cp -f langs/chi_sim.traineddata /opt/tesseract/share/tessdata/

    测试运行结果
    英文原图及识别效果
    /opt/tesseract/bin/tesseract test/test-001.png test-001
    vi test-001.txt

    image.png image.png

    中文原图及识别效果

    /opt/tesseract/bin/tesseract test/test-002.png test-002 -l chi_sim
    vi test-002.txt

    image.png
    image.png

    异常处理
    在另一台机器安装时,到安装依赖包那一步就出现异常了,折腾老半天。

    1. Error: Package: libstdc++-4.8.5-36.el7.i686 (@EL)
    2. Requires: libc.so.6(GLIBC_2.3)
    3. Removing: glibc-2.17-260.el7.i686 (@EL)
    4. libc.so.6(GLIBC_2.3)
    5. Updated By: glibc-2.17-307.el7.1.x86_64 (/glibc-2.17-307.el7.1.x86_64)
    6. Not found

    加上参数再试一次
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/*.rpm
    仍然失败,但是可以看到如下的提示信息:

    1. =================================================================
    2. Package Arch Version Size
    3. =================================================================
    4. Installing:
    5. autoconf noarch 2.69-11.el7 2.2 M
    6. autoconf-archive noarch 2017.03.21-1.el7 2.2 M
    7. libjpeg-turbo-devel x86_64 1.2.90-6.el7 314 k
    8. libpng-devel x86_64 2:1.5.13-7.el7_2 211 k
    9. libtiff-devel x86_64 4.0.3-27.el7_3 1.3 M
    10. Updating:
    11. libgcc x86_64 4.8.5-39.el7 175 k
    12. Skipped (dependency problems):
    13. automake noarch 1.13.4-3.el7 1.7 M
    14. cpp x86_64 4.8.5-39.el7 15 M
    15. gcc x86_64 4.8.5-39.el7 37 M
    16. glibc x86_64 2.17-307.el7.1 13 M
    17. perl-Test-Harness noarch 3.28-3.el7 593 k
    18. perl-Thread-Queue noarch 3.02-2.el7 27 k
    19. Transaction Summary
    20. =================================================================

    可以看到缺少的应该是Installing下面列出来的这些
    于是一个一个的安装一遍:
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/autoconf.rpm
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/libjpeg-turbo-devel
    .rpm
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/libpng-devel.rpm
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/libtiff-devel
    .rpm
    然后回到上面的步骤操作,进行到tesseract-4.1.1下面的./autogen.sh又报缺少依赖包libtoolize:
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/libtool.rpm
    此过程中会出现缺少依赖包,缺哪个装哪个:
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/perl-Test-Harness
    .rpm
    yum localinstall -y —setopt=protected_multilib=false —skip-broken rpms/automake*.rpm
    再执行autogen.sh就成功了!