ollama
OfficialDocker app from joly0's Repository
Overview
Requirements
Nvidia-Driver plugin (nVidia Support)
Radeon-TOP plugin (AMD Support)
Runtime arguments
- Web UI
http://[IP]:[PORT:11434]/- Network
bridge- Shell
bash- Privileged
- false
Template configuration
- Target
- /root/.ollama
- Default
- /mnt/user/appdata/ollama
- Value
- /mnt/user/appdata/ollama
Port number where ollama listens on.
- Target
- 11434
- Default
- 11434
- Value
- 11434
IP and Port the server binds to. Set to 127.0.0.1:11434 for internal only access.
- Default
- 0.0.0.0:11434
- Value
- 0.0.0.0:11434
Comma-separated list of allowed CORS origins.
- Default
- *
- Value
- *
How long a model stays in VRAM, e.g. 60m or 24h (Set to -1 for infinite, 0 for none).
- Default
- 5m
- Value
- 5m
Timeout for stall detection during model loads.
- Default
- 5m
- Value
- 5m
Max number of parallel requests a single model can handle.
- Default
- 1
- Value
- 1
Default context window (tokens) if not specified by the model.
- Default
- 4096
- Value
- 4096
Quantization type for the K/V cache, e.g. f16, q8_0, q4_0.
- Default
- f16
- Value
- f16
The path where model weights and blobs are stored.
- Default
- /root/.ollama/models
- Value
- /root/.ollama/models
Maximum number of models loaded per GPU at once (Set to 0 for infinite).
- Default
- 0
- Value
- 0
Max requests that can wait in line when the server is busy.
- Default
- 512
- Value
- 512
Log detail level: 0 for INFO, 1 for DEBUG, 2 for TRACE.
- Default
- 0|1|2
Reserved VRAM (in bytes) to leave empty on each GPU.
- Default
- 0
- Value
- 0
Enables experimental Flash Attention optimizations.
- Default
- false|true
If true, always spreads model layers across all visible GPUs.
- Default
- false|true
Optimizes prompt caching when multiple users share a model.
- Default
- false|true
If true, does not delete unused model blobs on startup.
- Default
- false|true
Disables the readline history in the interactive CLI.
- Default
- false|true
Enables the experimental new Ollama engine.
- Default
- false|true
Enables experimental Vulkan hardware acceleration.
- Default
- false|true
Proxy for downloading models over HTTP.
Proxy for downloading models over HTTPS.
Comma-separate list of hosts/IPs that bypass the proxy.
Download Statistics
Total Downloads Over Time
Details
ollama/ollamaRun Ollama on Unraid.
Ollama is listed in Community Apps for Unraid OS. Explore Unraid to build a flexible home server, NAS, or homelab.