Paperless-GPT

Paperless-GPT

Docker 应用程序 from Morgoth's Repository

概述

paperless-gpt seamlessly pairs with paperless-ngx to generate AI-powered document titles and tags, saving you hours of manual sorting. While other tools may offer AI chat features, paperless-gpt stands out by supercharging OCR with LLMs-ensuring high accuracy, even with tricky scans. If you’re craving next-level text extraction and effortless document organization, this is your solution. Description of container variables: https://github.com/icereed/paperless-gpt?tab=readme-ov-file#configuration

要求

paperless-ngx
local ollama or AI cloud service (OpenAI, Google Document AI, Azure Document Intelligence)

运行时参数

网络用户界面
http://[IP]:[PORT:8080]
网络
bridge
外壳
sh
特权
false

模板配置

Web UI PortPorttcp
目标
8080
默认值
8080
价值
8080
Prompt TemplatesPathrw

Custom Prompt Templates https://github.com/icereed/paperless-gpt?tab=readme-ov-file#custom-prompt-templates

目标
/app/prompts
默认值
/mnt/user/appdata/paperless-gpt/prompts
PAPERLESS_BASE_URLVariable

URL of your paperless-ngx instance (e.g. http://paperless-ngx:8000).

默认值
http://localhost:8000
PAPERLESS_API_TOKENVariable

API token for paperless-ngx. Generate one in paperless-ngx admin.

PAPERLESS_PUBLIC_URLVariable

Public URL for Paperless (if different from PAPERLESS_BASE_URL).

MANUAL_TAGVariable

Tag for manual processing.

默认值
paperless-gpt
AUTO_TAGVariable

Tag for auto processing.

默认值
paperless-gpt-auto
LLM_PROVIDERVariable

AI backend (openai or ollama).

LLM_MODELVariable

AI model name, e.g. gpt-4o, gpt-3.5-turbo, deepseek-r1:8b.

OPENAI_API_KEYVariable

OpenAI API key (required if using OpenAI).

OPENAI_BASE_URLVariable

OpenAI base URL (optional, if using a custom OpenAI compatible service like LiteLLM).

LLM_LANGUAGEVariable

Likely language for documents (e.g. English).

默认值
English
OLLAMA_HOSTVariable

Ollama server URL (e.g. http://host.docker.internal:11434).

默认值
http://localhost:11434
OCR_PROVIDERVariable

OCR provider to use (llm, azure, or google_docai).

默认值
llm
VISION_LLM_PROVIDERVariable

(for OCR) - openai or ollama

默认值
openai
VISION_LLM_MODELVariable

(for OCR) - minicpm-v (ollama example), gpt-4o (for openai), etc.

默认值
gpt-4o
AZURE_DOCAI_ENDPOINTVariable

Azure Document Intelligence endpoint. Required if OCR_PROVIDER is azure.

AZURE_DOCAI_KEYVariable

Azure Document Intelligence API key. Required if OCR_PROVIDER is azure.

AZURE_DOCAI_MODEL_IDVariable

Azure Document Intelligence model ID. Optional if using azure provider.

默认值
prebuilt-read
AZURE_DOCAI_TIMEOUT_SECONDSVariable

Azure Document Intelligence timeout in seconds.

默认值
120
GOOGLE_PROJECT_IDVariable

Google Cloud project ID. Required if OCR_PROVIDER is google_docai.

GOOGLE_LOCATIONVariable

Google Cloud region (e.g. us, eu). Required if OCR_PROVIDER is google_docai.

GOOGLE_PROCESSOR_IDVariable

Document AI processor ID. Required if OCR_PROVIDER is google_docai.

GOOGLE_APPLICATION_CREDENTIALSVariable

Path to the mounted Google service account key. Required if OCR_PROVIDER is google_docai.

AUTO_OCR_TAGVariable

Tag for automatically processing docs with OCR.

默认值
paperless-gpt-ocr-auto
LOG_LEVELVariable

Application log level (info, debug, warn, error).

默认值
info
AUTO_GENERATE_TITLEVariable

Generate titles automatically if paperless-gpt-auto is used. Default: true.

默认值
true
AUTO_GENERATE_TAGSVariable

Generate tags automatically if paperless-gpt-auto is used. Default: true.

默认值
true
AUTO_GENERATE_CORRESPONDENTSVariable

Generate correspondents automatically if paperless-gpt-auto is used. Default: true.

默认值
true
AUTO_GENERATE_CREATED_DATEVariable

Generate the created dates automatically if paperless-gpt-auto is used.

默认值
true
OCR_LIMIT_PAGESVariable

Limit the number of pages for OCR. Set to 0 for no limit.

默认值
5
TOKEN_LIMITVariable

Maximum tokens allowed for prompts/content. Set to 0 to disable limit. Useful for smaller LLMs.

CORRESPONDENT_BLACK_LISTVariable

A comma-separated list of names to exclude from the correspondents suggestions. Example: John Doe, Jane Smith.

下载统计数据

738,682
下载总数
146,515
本月
77,696
平均每月

长期下载总量

加载图表...

详细信息

存储库
icereed/paperless-gpt:latest
最后更新2026-03-29
初见2025-04-23

在Unraid 上运行 Paperless-GPT 。

Paperless-GPT 已被列入Unraid OS 的社区应用程序。探索Unraid ,构建灵活的家庭服务器、NAS 或家庭实验室。