ocrmypdf-auto

ocrmypdf-auto

Docker 应用程序 from cmccambridge's Repository

概述

[p]This container monitors an input file directory for PDF documents to process, and automatically invokes [code][strong]OCRmyPDF[/strong][/code] on each file.[/p] [p]It uses [code]inotify[/code] to monitor the input directory efficiently, and is fairly configurable.[/p] [h4]Configuration Details[/h4] [p]See the descriptions of the Unraid volumes and environment variables for highlights of the configurability of [code]ocrmypdf-auto[/code], but for details including how to specify custom commandline parameters to [code]ocrmydf[/code] itself, or custom [code]tesseract[/code] configuration files, see the full README at https://github.com/cmccambridge/ocrmypdf-auto/blob/master/README.md[/p]

运行时参数

网络
bridge
特权
false

模板配置

Input DirectoryPathrw

Input directory from which to process files for OCR. emContainer path: code/input/code/em

目标
/input
Output DirectoryPathrw

Output directory to which post-OCR files will be written. emContainer path: code/output/code/em

目标
/output
Config DirectoryPathrw

Config/appdata directory. emContainer path: code/config/code/em

目标
/config
默认值
/mnt/user/appdata/ocrmypdf-auto
价值
/mnt/user/appdata/ocrmypdf-auto
Output ModeVariable

Controls the output directory layout: br / codeMIRROR_TREE/code - (Default) Mirror the directory structure of the input directory, i.e. for an input file code/input/foo/bar.pdf/code create an output file code/output/foo/bar.pdf/code. br / codeSINGLE_FOLDER/code - Collect all output files in a single flat folder, i.e. for an input file code/input/foo/bar.pdf/code create an output file code/output/bar.pdf/code.

目标
OCR_OUTPUT_MODE
默认值
MIRROR_TREE
价值
MIRROR_TREE
Action On SuccessVariable

Controls the action (if any) to perform after successful OCR processing: br / codeNOTHING/code - (Default) Do nothing. Input files remain in place where they were found. br / codeARCHIVE_INPUT_FILES/code - Archive input files by strongmoving/strong them em(overwriting existing files!)/em to the code/archive/code Volume br / codeDELETE_INPUT_FILES/code - Delete the input file after successful processing.

目标
OCR_ACTION_ON_SUCCESS
默认值
NOTHING
价值
NOTHING
Additional LanguagesVariable

Additional languages (besides English) to install, given as a space-separated list of language abbreviations. All available languages can be found on the a href=https://packages.ubuntu.com/search?keywords=tesseract-ocr-&searchon=names&suite=bionic&section=allUbuntu site/a. Example for German, Chinese - Simplified, and Italian: codedeu chi-sim ita/code

目标
OCR_LANGUAGES
Notify URLVariable

On a successful completion, a POST will be made to the given URL, with a JSON payload of code{'pdf': '/output/doc.pdf', 'txt': '/output/doc.pdf.txt'} /code. The txt property will only be present if you add the code--sidecar /code option to the codeocr.config/code file. This could be used to kick off additional processing, like indexing of the content or notifications.

目标
OCR_NOTIFY_URL
Process Existing on StartupVariable

Set to code1/code to enable processing of any files in the input directory when the container is launched. br/ Set to code0/code (Default) or unset to ignore existing files until they are modified.

目标
OCR_PROCESS_EXISTING_ON_START
默认值
0
价值
0
VerbosityVariable

Control the verbosity of debug logging. Accepts python codelogging/code levels, e.g. codewarn/code (Default), codeinfo/code, codedebug/code, etc.

目标
OCR_VERBOSITY
UID OverrideVariable

Set the UID that the OCR tools will run as. unRAID standard is 99.

目标
USERMAP_UID
默认值
99
价值
99
GID OverrideVariable

Set the primary GID that the OCR tools will run with. unRAID standard is 100.

目标
USERMAP_GID
默认值
100
价值
100

下载统计数据

336,026
下载总数
660
本月
1,436
平均每月

长期下载总量

加载图表...

详细信息

存储库
cmccambridge/ocrmypdf-auto:latest
最后更新2021-02-04
初见2020-03-17

在Unraid 上运行 ocrmypdf-auto 。

ocrmypdf-auto 已被列入Unraid OS 的社区应用程序。探索Unraid ,构建灵活的家庭服务器、NAS 或家庭实验室。