whisper-asr-webservice

whisper-asr-webservice

Aplicación Docker from Vodros' Repository

Visión general

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper Use ASR_MODEL to specify which model to run. Available models are tiny, base, small, medium, large, large-v1 and large-v2. Please note that large and large-v2 are the same model. For English-only applications, the .en models tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models. The ASR model is downloaded each time you start the container, using the large model this can take some time. If you want to decrease the time it takes to start your container by skipping the download, you can provide a persistent storage path. Next time you start your container the ASR Model will be loaded from there instead of being downloaded again. For GPU support, add '--gpus all' to 'Extra Parameters'.

Argumentos en tiempo de ejecución

Interfaz web
http://[IP]:[PORT:9000]
Red
bridge
Concha
sh
Privilegiado
false
Parámetros adicionales
--gpus all

Configuración de plantillas

WebUIPorttcp
Objetivo
9000
Valor
9000
ASR_MODELVariable
Por defecto
base
Valor
base
Model storage pathPathrw
Objetivo
/root/.cache/whisper

Descargar estadísticas

1,794,454
Descargas totales
90,508
Este mes
72,507
Media / Mes

Descargas totales a lo largo del tiempo

Cargando gráfico...

Detalles

Repositorio
onerahmet/openai-whisper-asr-webservice
Última actualización2025-07-01
Visto por primera vez2023-04-26

Ejecute whisper-asr-webservice en Unraid.

whisper-asr-webservice se encuentra en Community Apps para Unraid OS. Explore Unraid para crear un servidor doméstico flexible, un NAS o un laboratorio doméstico.