whisper.cpp/src
Daniel Bevenius e41bc5c61a
vad : add initial Voice Activity Detection (VAD) support (#3065)
* vad : add initial Voice Activity Detection (VAD) support

This commit add support for Voice Activity Detection (VAD). When enabled
this feature will process the audio input and detect speech segments.
This information is then used to reduce the number of samples that need
to be processed by whisper_full.

Resolves: https://github.com/ggml-org/whisper.cpp/issues/3003

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-05-12 16:10:11 +02:00
..
coreml coreml: fix Whisper to CoreML conversion by disabling SDPA [no ci] (#2979) 2025-04-01 18:01:23 +02:00
openvino whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt whisper : add support for backends with multiple ggml_backend_buffer_type (#2863) 2025-03-26 16:54:02 +02:00
whisper-arch.h vad : add initial Voice Activity Detection (VAD) support (#3065) 2025-05-12 16:10:11 +02:00
whisper.cpp vad : add initial Voice Activity Detection (VAD) support (#3065) 2025-05-12 16:10:11 +02:00