whisper-asr-webservice

whisper-asr-webservice

Application Docker from Vodros' Repository

Vue d'ensemble

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper Use ASR_MODEL to specify which model to run. Available models are tiny, base, small, medium, large, large-v1 and large-v2. Please note that large and large-v2 are the same model. For English-only applications, the .en models tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models. The ASR model is downloaded each time you start the container, using the large model this can take some time. If you want to decrease the time it takes to start your container by skipping the download, you can provide a persistent storage path. Next time you start your container the ASR Model will be loaded from there instead of being downloaded again. For GPU support, add '--gpus all' to 'Extra Parameters'.

Arguments d'exécution

Interface utilisateur Web
http://[IP]:[PORT:9000]
Réseau
bridge
Coquille
sh
Privilégié
false
Paramètres supplémentaires
--gpus all

Configuration du modèle

WebUIPorttcp
Cible
9000
Valeur
9000
ASR_MODELVariable
Défaut
base
Valeur
base
Model storage pathPathrw
Cible
/root/.cache/whisper

Télécharger les statistiques

1,794,454
Total des téléchargements
90,508
Ce mois-ci
72,507
Moyenne / Mois

Total des téléchargements au fil du temps

Chargement du graphique...

Détails

Référentiel
onerahmet/openai-whisper-asr-webservice
Dernière mise à jour2025-07-01
Première vue2023-04-26

Exécutez whisper-asr-webservice sur Unraid.

whisper-asr-webservice est listé dans Community Apps pour Unraid OS. Explorez Unraid pour créer un serveur domestique flexible, un NAS ou un laboratoire domestique.