ocrmypdf-auto

Docker 应用程序 from cmccambridge's Repository

概述

[p]This container monitors an input file directory for PDF documents to process, and automatically invokes [code][strong]OCRmyPDF[/strong][/code] on each file.[/p] [p]It uses [code]inotify[/code] to monitor the input directory efficiently, and is fairly configurable.[/p] [h4]Configuration Details[/h4] [p]See the descriptions of the Unraid volumes and environment variables for highlights of the configurability of [code]ocrmypdf-auto[/code], but for details including how to specify custom commandline parameters to [code]ocrmydf[/code] itself, or custom [code]tesseract[/code] configuration files, see the full README at https://github.com/cmccambridge/ocrmypdf-auto/blob/master/README.md[/p]

运行时参数

网络: bridge
特权: false

模板配置

Input DirectoryPathrw

Input directory from which to process files for OCR. emContainer path: code/input/code/em

目标: /input

Output DirectoryPathrw

Output directory to which post-OCR files will be written. emContainer path: code/output/code/em

目标: /output

Config DirectoryPathrw

Config/appdata directory. emContainer path: code/config/code/em

目标: /config
默认值: /mnt/user/appdata/ocrmypdf-auto
价值: /mnt/user/appdata/ocrmypdf-auto

Output ModeVariable

Controls the output directory layout: br / codeMIRROR_TREE/code - (Default) Mirror the directory structure of the input directory, i.e. for an input file code/input/foo/bar.pdf/code create an output file code/output/foo/bar.pdf/code. br / codeSINGLE_FOLDER/code - Collect all output files in a single flat folder, i.e. for an input file code/input/foo/bar.pdf/code create an output file code/output/bar.pdf/code.

目标: OCR_OUTPUT_MODE
默认值: MIRROR_TREE
价值: MIRROR_TREE

Action On SuccessVariable

Controls the action (if any) to perform after successful OCR processing: br / codeNOTHING/code - (Default) Do nothing. Input files remain in place where they were found. br / codeARCHIVE_INPUT_FILES/code - Archive input files by strongmoving/strong them em(overwriting existing files!)/em to the code/archive/code Volume br / codeDELETE_INPUT_FILES/code - Delete the input file after successful processing.

目标: OCR_ACTION_ON_SUCCESS
默认值: NOTHING
价值: NOTHING

Additional LanguagesVariable

Additional languages (besides English) to install, given as a space-separated list of language abbreviations. All available languages can be found on the a href=https://packages.ubuntu.com/search?keywords=tesseract-ocr-&searchon=names&suite=bionic&section=allUbuntu site/a. Example for German, Chinese - Simplified, and Italian: codedeu chi-sim ita/code

目标: OCR_LANGUAGES

Notify URLVariable

On a successful completion, a POST will be made to the given URL, with a JSON payload of code{'pdf': '/output/doc.pdf', 'txt': '/output/doc.pdf.txt'} /code. The txt property will only be present if you add the code--sidecar /code option to the codeocr.config/code file. This could be used to kick off additional processing, like indexing of the content or notifications.

目标: OCR_NOTIFY_URL

Process Existing on StartupVariable

Set to code1/code to enable processing of any files in the input directory when the container is launched. br/ Set to code0/code (Default) or unset to ignore existing files until they are modified.

目标: OCR_PROCESS_EXISTING_ON_START
默认值: 0
价值: 0

VerbosityVariable

Control the verbosity of debug logging. Accepts python codelogging/code levels, e.g. codewarn/code (Default), codeinfo/code, codedebug/code, etc.

目标: OCR_VERBOSITY

UID OverrideVariable

Set the UID that the OCR tools will run as. unRAID standard is 99.

目标: USERMAP_UID
默认值: 99
价值: 99

GID OverrideVariable

Set the primary GID that the OCR tools will run with. unRAID standard is 100.

目标: USERMAP_GID
默认值: 100
价值: 100

类别

Productivity

下载统计数据

336,026

下载总数

660

本月

1,436

平均每月

长期下载总量

加载图表...

链接

模板支持 Docker Hub Pro连接

详细信息

存储库

cmccambridge/ocrmypdf-auto:latest

登记处

https://hub.docker.com/r/cmccambridge/ocrmypdf-auto

最后更新2021-02-04

初见2020-03-17

在Unraid 上运行 ocrmypdf-auto 。

ocrmypdf-auto 已被列入Unraid OS 的社区应用程序。探索Unraid ，构建灵活的家庭服务器、NAS 或家庭实验室。

探索Unraid OS