llama.cpp
llama.cpp
Docker app from grtgbln's Repository
Overview
Inference of Meta's LLaMA model (and others) in pure C/C++
Requirements
The image for this container is several gigabytes. If you receive a "no space left on device" warning during installation, please increase the vDisk size in your Docker settings.
This container expects a "model.gguf" file to be present in the model storage path.
If you are using an Nvidia GPU, add "--gpus all" to the Extra Parameters field under Advanced.
Runtime arguments
- Web UI
http://[IP]:[PORT:8000]/- Network
bridge- Privileged
- false
Template configuration
WebUIPorttcp
Container Port: 8000
- Target
- 8000
- Default
- 8000
- Value
- 8000
Model Storage PathPathrw
Storage for model
- Target
- /models
- Default
- /mnt/user/appdata/llama_cpp/model
- Value
- /mnt/user/appdata/llama_cpp/model
Categories
Details
Repository
ghcr.io/ggml-org/llama.cpp:fullLast Updated2026-06-01
First Seen2026-02-24
Run llama.cpp on Unraid.
llama.cpp is listed in Community Apps for Unraid OS. Explore Unraid to build a flexible home server, NAS, or homelab.