Speaches

Speaches

Docker app from grtgbln's Repository

Overview

speaches is an OpenAI API-compatible server supporting streaming transcription, translation, and speech generation. Speach-to-Text is powered by faster-whisper and for Text-to-Speech piper and Kokoro are used. This project aims to be Ollama, but for TTS/STT models.
**Nvidia GPU Use:** Using the Unraid Nvidia Plugin to install a version of Unraid with the Nvidia Drivers installed and add **--runtime=nvidia --gpus=all** to "extra parameters" (switch on advanced view)

Runtime arguments

Web UI
http://[IP]:[PORT:8000]/
Network
bridge
Privileged
false

Template configuration

Web UI PortPorttcp

Container Port: 8000

Target
8000
Default
8000
Value
8000
Cache DirectoryPath

Path to the cache directory. This is where models will be stored, which can be quite large.

Target
/home/ubuntu/.cache/huggingface/hub
Default
/mnt/user/appdata/speaches/cache
Value
/mnt/user/appdata/speaches/cache

Details

Repository
ghcr.io/speaches-ai/speaches:latest-cpu
Last Updated2026-06-01
First Seen2025-04-16

Run Speaches on Unraid.

Speaches is listed in Community Apps for Unraid OS. Explore Unraid to build a flexible home server, NAS, or homelab.