whisper.cpp/ggml
Oliver Simons 75b9543856
CUDA : Fix CUB's argsort when nrows % block_size == 0 CCCL < 3.1 (llama/21181)
* CUDA: Fix CUB's argsort when nrows % block_size == 0 CCCL < 3.1

We wrongly calculated offset_grid as `ceildiv(nrows, block_size)`,
while it must be `ceildiv(nrows + 1, block_size)`. As a consequence, we
had uninitialized values in `offset_iterator[nrows]` for the case when
`nrows % block_size == 0`.

Fixes #21162

* Reduce nrows in test case to 256, don't need 768
2026-04-30 11:28:57 +03:00
..
cmake cmake : remove unused file (ggml/1419) 2026-02-08 09:29:10 +02:00
include llama: fix llama-model-saver (llama/20503) 2026-03-29 15:04:36 +03:00
src CUDA : Fix CUB's argsort when nrows % block_size == 0 CCCL < 3.1 (llama/21181) 2026-04-30 11:28:57 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : bump version to 0.9.9 (ggml/1449) 2026-04-30 11:28:52 +03:00