Used to overwrite the audio context size of the Encoder. For example, setting "audio_ctx = 512" will make it run about 3 times faster, processing about 10s of audio, instead of 30s. The transcription quality drops, but this can be used for real-time streaming purposes where performance is important. |
||
|---|---|---|
| .. | ||
| bench | ||
| main | ||
| stream | ||
| whisper.nvim | ||
| whisper.objc | ||
| whisper.wasm | ||
| CMakeLists.txt | ||
| dr_wav.h | ||
| generate-karaoke.sh | ||