whisper-asr-webservice
Docker 应用程序 from Vodros' Repository
概述
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper
Use ASR_MODEL to specify which model to run. Available models are tiny, base, small, medium, large, large-v1 and large-v2. Please note that large and large-v2 are the same model. For English-only applications, the .en models tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.
The ASR model is downloaded each time you start the container, using the large model this can take some time. If you want to decrease the time it takes to start your container by skipping the download, you can provide a persistent storage path. Next time you start your container the ASR Model will be loaded from there instead of being downloaded again.
For GPU support, add '--gpus all' to 'Extra Parameters'.
运行时参数
- 网络用户界面
http://[IP]:[PORT:9000]- 网络
bridge- 外壳
sh- 特权
- false
- 额外参数
--gpus all
模板配置
WebUIPorttcp
- 目标
- 9000
- 价值
- 9000
ASR_MODELVariable
- 默认值
- base
- 价值
- base
Model storage pathPathrw
- 目标
- /root/.cache/whisper
下载统计数据
1,794,454
下载总数
90,508
本月
72,507
平均每月
长期下载总量
加载图表...
详细信息
存储库
onerahmet/openai-whisper-asr-webservice最后更新2025-07-01
初见2023-04-26
在Unraid 上运行 whisper-asr-webservice 。
whisper-asr-webservice 已被列入Unraid OS 的社区应用程序。探索Unraid ,构建灵活的家庭服务器、NAS 或家庭实验室。