• I installed Tesseract on my server, and I think I did everything correctly. From what I understand, it’s supposed to work like this — upload a document to the media library, and a post should show up with the OCR text. Is this correct? I’m not getting any kind of error, I’m simply not getting the post with the text. How do I test if it is working correctly? Am I doing something wrong? Thanks in advance!

Viewing 7 replies - 1 through 7 (of 7 total)
  • Plugin Author tattersoftware

    (@tattersoftware)

    Hi! Sorry for the delay, the notice went to my Spam. ??
    Your understanding is correct: you add an image to the Media Library and the plug-in will generate a new Post. If you are willing could you please share the following (and let me k is if you need help locating any of this):
    * operating system and version of the server
    * version and install location of Tesseract
    * the output of php -m (your installed PHP modules)
    * a screenshot or values of the plug-in configuration from WordPress

    That will help me troubleshoot your issue.

    Thread Starter jbcreations

    (@jbcreations)

    Sorry for the delay in responding. It’s been quite a week!

    Here’s everything I think you asked for:

    My website is hosted on Dreamhost and they are using Ubuntu version 18.04.4.LTS

    I’m using Tesseract 4.1.1
    installed at /home/paulandartie/.linuxbrew/bin/tesseract

    Here’s the output of php -m
    [PHP Modules]
    bcmath
    bz2
    calendar
    Core
    ctype
    curl
    date
    dom
    exif
    fileinfo
    filter
    ftp
    gd
    gettext
    hash
    iconv
    imagick
    imap
    intl
    json
    libxml
    mbstring
    memcached
    mysqli
    mysqlnd
    openssl
    pcntl
    pcre
    PDO
    pdo_mysql
    pdo_sqlite
    Phar
    posix
    pspell
    Reflection
    session
    SimpleXML
    soap
    sockets
    sodium
    SPL
    sqlite3
    standard
    tokenizer
    xml
    xmlreader
    xmlrpc
    xmlwriter
    xsl
    Zend OPcache

    zip
    zlib

    [Zend Modules]
    Zend OPcache

    and I can’t seem to get a screenshot to post of the plug-in configuration in WordPress, but it’s listed as:

    Absolute path to Tesseract: /home/paulandartie/.linuxbrew/bin/tesseract
    Resize percentage: 200
    Language string: eng

    Thanks!

    Plugin Author tattersoftware

    (@tattersoftware)

    That all looks correct. Unfortunately there aren’t any debugging tools built into the plugin. One thing you could do, on your server browse to the WordPress upload directory (usually “wp-content/uploads/”) and find the folder for the day you test. We want to know if the temporary processing files are still in there: the resized image (“ocr_image.png”) or the scanned text (“ocr_text.txt”). If either of those are present please check their content or share a copy here.

    Thread Starter jbcreations

    (@jbcreations)

    I’m afraid I don’t see any files that look like that in the uploads folder. I guess my question is, what would that mean? They aren’t getting processed at all? Thanks.

    Plugin Author tattersoftware

    (@tattersoftware)

    Not necessarily. Those files are produced during the process and then removed at various stages. I am working on adding some debug logging that will helpful hope troubleshoot issues like this – stay tuned!

    Thread Starter jbcreations

    (@jbcreations)

    Any luck on this yet? Thanks!

    [ Please do not bump. ]

    Plugin Author tattersoftware

    (@tattersoftware)

    Still nothing ready to publish. This is my only WP plugin and was developed for a single client (who graciously agreed to let me open-source it), and it is working fine for his instance. Unfortunately this means changes like this compete with my need to complete work for paid clients, so I only work on this in between tasks. It may be a bit still, but if you are able to do any additional debugging yourself I am glad to receive the results which might help discover a faster solution.

Viewing 7 replies - 1 through 7 (of 7 total)
  • The topic ‘Functionality broken?’ is closed to new replies.