whisper.cpp/src
Daniel Worthington-Bodart 166c20b473
whisper : add stateless VAD detect + explicit state reset for streaming (#3677)
whisper_vad_detect_speech resets LSTM state on every call, which is
correct for batch processing but prevents temporal continuity when
calling per-chunk in a streaming loop.

Add whisper_vad_detect_speech_no_reset (skips buffer clear) and
whisper_vad_reset_state (explicit clear between utterances).
Existing whisper_vad_detect_speech is now a thin wrapper — zero
behavior change for current callers.

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-17 13:36:27 +02:00
..
coreml coreml : backport CoreML features to macos < 14 (#3255) 2025-06-24 09:24:27 +02:00
openvino whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt examples : fix executable example targets (#3600) 2026-01-13 08:08:18 +01:00
whisper-arch.h vad : add initial Voice Activity Detection (VAD) support (#3065) 2025-05-12 16:10:11 +02:00
whisper.cpp whisper : add stateless VAD detect + explicit state reset for streaming (#3677) 2026-04-17 13:36:27 +02:00