All apps · 0 apps

whisper-asr-webservice

Overview

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper Use ASR_MODEL to specify which model to run. Available models are tiny, base, small, medium, large, large-v1 and large-v2. Please note that large and large-v2 are the same model. For English-only applications, the .en models tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models. The ASR model is downloaded each time you start the container, using the large model this can take some time. If you want to decrease the time it takes to start your container by skipping the download, you can provide a persistent storage path. Next time you start your container the ASR Model will be loaded from there instead of being downloaded again. For GPU support, add '--gpus all' to 'Extra Parameters'.

Readme

View on GitHub

Release Docker Pulls Build Licence

🚀 Try Speech Box Desktop App | Offline, multi-language desktop transcriptions: https://speechbox.gumroad.com/l/desktop-app

Whisper ASR Box

Whisper ASR Box is a general-purpose speech recognition toolkit. Whisper Models are trained on a large dataset of diverse audio and is also a multitask model that can perform multilingual speech recognition as well as speech translation and language identification.

🎉 Join our Discord Community! Connect with other users, get help, and stay updated on the latest features: https://discord.gg/4Q5YVrePzZ

Features

Current release (v1.9.1) supports following whisper models:

Quick Usage

CPU

docker run -d -p 9000:9000 \
  -e ASR_MODEL=base \
  -e ASR_ENGINE=openai_whisper \
  onerahmet/openai-whisper-asr-webservice:latest

GPU

docker run -d --gpus all -p 9000:9000 \
  -e ASR_MODEL=base \
  -e ASR_ENGINE=openai_whisper \
  onerahmet/openai-whisper-asr-webservice:latest-gpu

Cache

To reduce container startup time by avoiding repeated downloads, you can persist the cache directory:

docker run -d -p 9000:9000 \
  -v $PWD/cache:/root/.cache/ \
  onerahmet/openai-whisper-asr-webservice:latest

Key Features

Multiple ASR engines support (OpenAI Whisper, Faster Whisper, WhisperX)
Multiple output formats (text, JSON, VTT, SRT, TSV)
Word-level timestamps support
Voice activity detection (VAD) filtering
Speaker diarization (with WhisperX)
FFmpeg integration for broad audio/video format support
GPU acceleration support
Configurable model loading/unloading
REST API with Swagger documentation

Environment Variables

Key configuration options:

ASR_ENGINE: Engine selection (openai_whisper, faster_whisper, whisperx)
ASR_MODEL: Model selection (tiny, base, small, medium, large-v3, etc.)
ASR_MODEL_PATH: Custom path to store/load models
ASR_DEVICE: Device selection (cuda, cpu)
MODEL_IDLE_TIMEOUT: Timeout for model unloading

Documentation

For complete documentation, visit: https://ahmetoner.github.io/whisper-asr-webservice

Development

# Install poetry v2.X
pip3 install poetry

# Install dependencies for cpu
poetry install --extras cpu

# Install dependencies for cuda
poetry install --extras cuda

# Run service
poetry run whisper-asr-webservice --host 0.0.0.0 --port 9000

After starting the service, visit http://localhost:9000 or http://0.0.0.0:9000 in your browser to access the Swagger UI documentation and try out the API endpoints.

Credits

This software uses libraries from the FFmpeg project under the LGPLv2.1

Install whisper-asr-webservice on Unraid in a few clicks.

Find whisper-asr-webservice in Community Apps on your Unraid server, review the template, and click Install. Unraid handles the Docker app or plugin setup from the published template.

Open the Apps tab on your Unraid server Search Community Apps for whisper-asr-webservice Review the template variables and paths Click Install

Explore Unraid OS

Related apps

Explore more like this

Explore all

Utilities apps

Links

Projectgithub.com Docker Hubregistry.hub.docker.com Templateraw.githubusercontent.com

Details

Repository

onerahmet/openai-whisper-asr-webservice

Registry

https://registry.hub.docker.com/r/onerahmet/openai-whisper-asr-webservice

Last Updated2025-07-01

First Seen2023-04-26

Runtime arguments

Web UI: http://[IP]:[PORT:9000]
Network: bridge
Shell: sh
Privileged: false
Extra Params: --gpus all

Template configuration

WebUIPorttcp

Target: 9000
Value: 9000

ASR_MODELVariable

Default: base
Value: base

Model storage pathPathrw

Target: /root/.cache/whisper

Library

Curated

Categories