ollama-intel-gpu
ollama-intel-gpu
Docker app from SpaceInvaderOne's Repository
Overview
Requirements
Runtime arguments
- Web UI
http://[IP]:[PORT:11434]/- Network
bridge- Shell
bash- Privileged
- false
- Extra Params
--device=/dev/dri
Template configuration
Path on the host for persistent model storage. Models are large (4-20 GB each).
- Target
- /root/.ollama
- Default
- /mnt/user/appdata/ollama-intel-gpu
- Value
- /mnt/user/appdata/ollama-intel-gpu
Port for the Ollama API.
- Target
- 11434
- Default
- 11434
- Value
- 11434
Allowed origins for CORS. Set to * to allow Open WebUI and other frontends to connect.
- Default
- *
- Value
- *
Select which Intel GPU to use. Use level_zero:0 for the first GPU. Change only if you have multiple Intel GPUs.
- Default
- level_zero:0
- Value
- level_zero:0
Number of parallel inference requests. Set to 1 for 12 GB VRAM cards (B580). Increase only if you have more VRAM.
- Default
- 1
- Value
- 1
Context window size in tokens. Larger values use more VRAM. Default 4096 is a good balance for 12 GB cards.
- Default
- 4096
- Value
- 4096
How long to keep a model loaded in VRAM after the last request. Use 5m for 5 minutes, -1 for forever, 0 to unload immediately.
- Default
- 5m
- Value
- 5m
Categories
Download Statistics
Details
spaceinvaderone/ollama-intel-gpuRun ollama-intel-gpu on Unraid.
ollama-intel-gpu is listed in Community Apps for Unraid OS. Explore Unraid to build a flexible home server, NAS, or homelab.