NVIDIA GTX 9xx, 10xx, 20xx, 30xx, 40xx (x86_64) kokoro-fastapi-gpu:latest-cu126 or kokoro-fastapi-gpu:latest NVIDIA RTX 50-series / Blackwell (x86_64) kokoro-fastapi-gpu:latest-cu128 NVIDIA on arm64 ...
This project is a real-time transcription application that uses the OpenAI Whisper model to convert speech input into text output. It can be used to transcribe both live audio input from microphone ...