whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	2b6d0d2200	rename : ggerganov -> ggml-org (#3005 )	2025-04-04 16:11:52 +03:00
Daniel Bevenius	0b17d4507e	examples : update server.py to match github pages app [no ci] (#3004 ) This commit updates examples/server.py which is used to serve the wasm examples locally. The changes include: - Added a redirect from the root URL to /whisper.cpp. So now accessing http://localhost:8000/ will redirect to http://localhost:8000/whisper.cpp/ which matches the url for the app deployed to github pages. - Custom handling for coi-serviceworker.js to serve it to avoid and error in the console. This file is not strictly necessary for the local server to work as the headers are provided already but it is nice to not have an error in the console. - Fixed the shutdown of the server to ensure it exits cleanly on Ctrl+C. Previously it would continue to hang onto the port even after the processed had exited.	2025-04-04 10:23:53 +02:00
Daniel Bevenius	77e0c86ab6	whisper.wasm : fix unknown language issue (#3000 ) * whisper.wasm : fix unknown language issue This commit addresses an issue with whisper.wasm where the following error was being displayed when running the application in github pages: ``` whisper_lang_id: unknown language 'д=␙c' ``` This turned out to be a memory corruption issue and further details can be found in the reference issue below. Refs: https://github.com/ggerganov/whisper.cpp/issues/2998	2025-04-03 19:50:47 +02:00
Georgi Gerganov	eac1bc9c47	examples : add new sources ggml-ci	2025-04-03 10:30:16 +03:00
Daniel Bevenius	854c0518bc	examples : clarify Core ML encoder model usage [no ci] (#2987 ) This commit clarifies the usage of the Core ML encoder model in the whisper.obj and whisper.swiftui examples. Refs: https://github.com/ggerganov/whisper.cpp/issues/2783	2025-04-02 08:32:14 +02:00
Daniel Bevenius	b358de2458	whisper.objc : fix typo in README.md [no ci] (#2985 ) This commit fixes a typo in the README.md file of the whisper.objc example. Resolves: https://github.com/ggerganov/whisper.cpp/issues/2984	2025-04-02 08:26:57 +02:00
Daniel Bevenius	e153b8eaa2	android.java : re-add ggml source updates (#2975 ) This commit updates the ggml source to include the new unary and binary operations. I merged https://github.com/ggerganov/whisper.cpp/pull/2958 which seems to have overwritten the changes to the ggml source which were added in https://github.com/ggerganov/whisper.cpp/pull/2972. Sorry about this.	2025-03-31 16:14:33 +02:00
Georgi Gerganov	0a40ae9728	android : add new ggml source files ggml-ci	2025-03-31 14:56:53 +03:00
Daniel Bevenius	2d8e40e2a0	examples : update README links to point to pages deployment (#2971 ) This commit updates the README links to point to the pages deployment instead of whisper.ggerganov.com.	2025-03-31 12:32:27 +02:00
Daniel Bevenius	e17af6524f	ci : add github pages workflow for wasm examples (#2969 ) * ci : add github pages workflow for wasm examples This commit adds a github workflow to build and deploy the wasm examples to github pages. The whisper.wasm example is deployed as the main page. This workflow is trigged by a push to master and will deploy the examples to: https://ggerganov.github.io/whisper.cpp/. This requires that the repository has enabled github actions in `Settings` -> `Pages` -> `Build and deployment` -> `Source` be set to `GitHub Actions`. One thing to note is that this commit removes the `talk` example as I'm not sure how this example is built yet. Refs: https://github.com/ggerganov/whisper.cpp/issues/2784	2025-03-31 11:34:40 +02:00
Sacha Arbonel	88d13a17a7	feat: add health check endpoint to server (#2968 )	2025-03-31 11:03:41 +03:00
Lin Xiaodong	1279f0d0bc	examples : support progress_callback API for addon.node (#2941 ) * feat: progress supported * fix: missing params * style: Format the code to improve readability Unified code indentation ensures consistent coding style, enhancing code readability and maintainability. * feat: support prompt api --------- Co-authored-by: linxiaodong <calm.lin@wukongsch.com>	2025-03-28 06:34:26 +01:00
Daniel Bevenius	996581c5e2	whisper.android : add GGML_USE_CPU compile definition (#2945 ) This commit add GGML_USE_CPU to built target library to enable CPU backend. The motivation for this that without the compile definition the CPU backend is not enabled and the app will crash when trying to use it.	2025-03-25 18:01:18 +01:00
Daniel Bevenius	226d344f56	whisper.android.java : update build with ggml source changes (#2942 ) * whisper.android.java : update build with ggml source changes This commit updates the whisper.android.java build to include the new ggml source files and directories. The gradle build configuration is also updated to include the aliyun maven repository.	2025-03-25 16:01:59 +01:00
Daniel Bevenius	30cf30ca82	examples : reduce initial memory to 512MB (#2939 ) * examples : reduce initial memory to 512MB This commit reduces the initial memory size to 512MB. This is done to to avoid WebAssembly memory allocation issues on some platforms. It also adds a flag to allow the memory to grow dynamically (up to the maximum). The motivation for this change is that currently the initial memory is set to 2GB which might be to large for some platforms. This will lead to an error being thrown from the JavaScript code generated by Emscripten when trying to allocate memory. More details can be found in the referenced issue below. * examples : set MAXIMUM_MEMORY instead of TOTAL_MEMORY This commit sets MAXIMUM_MEMORY instead of TOTAL_MEMORY in the whisper.wasm example. The motivation for this is that TOTAL_MEMORY and INITIAL_MEMORY are actually the same thing. Instead we want to set MAXIMUM_MEMORY to 2GB. Refs: https://github.com/ggerganov/whisper.cpp/issues/2920 Refs: https://emscripten.org/docs/tools_reference/settings_reference.html#initial-memory	2025-03-24 14:42:12 +01:00
Daniel Bevenius	ee6286c35d	examples : fix nthread parsing in whisper.wasm (#2938 ) This commit fixes the nthread parsing in the whisper.wasm example when using the `Threads` slider to change the number of threads to be used. Currently this results in the following error: ```console main.js:5597 Uncaught TypeError: Cannot convert "5" to int at checkAssertions (main.js:5597:21) at Object.toWireType (main.js:5611:15) at Object.full_default (eval at new_ (main.js:5292:27), <anonymous>:10:26) at whisper.wasm/:649:42 ```	2025-03-24 14:40:00 +01:00
Daniel Bevenius	c7941d5ccc	examples : fix request path for local worker files (#2937 ) This commit adds a fix to the server.py file to handle requests for web worker files when running the local python server to test the wasm examples. The motivation for this is that currently the server is serving files from the build-em/bin directory which is where the .worker.js files exist. But when examples access these resources they do so with the application context path, for example /whisper.wasm/libmain.worker.js but this will not be found as it currently works.	2025-03-24 14:33:45 +01:00
Peter	edf1ee1ef8	whisper : enhance model download scripts functionality and resolve compiler warning (#2925 ) * whisper : improve whisper-cli executable path detection in model download shell scripts If whisper-cli is found on the path, do not suggest invoking from build directory. This improves flexibility and usability for distribution and packaging scenarios. * whisper : enhance Windows model download batch script to have comparable functionality and behaviour as shell scripts * Download models to the current directory if the script is executed from the \bin\ directory (for future distribution scenarios where the script is in the \bin\ subdirectory of a Windows build) * Add model_path command line argument * If whisper-cli is found on the path, do not suggest invoking from build directory * whisper : resolve compiler warning by removing duplicate definition of NOMINMAX in whisper-cli code	2025-03-24 10:39:50 +02:00
Daniel Bevenius	3fc6ad97a3	whisper.swiftui : Add Core ML support to README [no ci] (#2921 ) This commit updates the README to include instructions on how to use a Core ML model with the example.	2025-03-21 11:38:32 +01:00
Daniel Bevenius	21fb513ef1	examples : update whisper.objc README.md (#2916 ) This commit updates the hisper.objc README.md to reflect the changes of using the xcframework and the new build process. Since whisper.cpp is no longer compiled by the example project, instead the library from the xframework will be used, the build instructions have been removed.	2025-03-21 09:52:53 +01:00
Daniel Bevenius	80dad86b2c	examples : add WHISPER_SDL2 check to deprecation executables (#2911 ) This commit adds a check for `WHISPER_SDL2` to the deprecation warning examples. This is to prevent the examples from being built when WHISPER_SDL2 is not enabled. The motivation for this is that currently these deprecation executables are generate and when run they refer the user to examples with other names, for example `whisper-command` but unless they have built with `WHISPER_SDL2` those executable will not be present: ```console $ ls build/bin/ bench command main quantize stream whisper-bench whisper-cli whisper-server $ ./build/bin/command WARNING: The binary 'command' is deprecated. Please use 'whisper-command' instead. See https://github.com/ggerganov/whisper.cpp/tree/master/examples/deprecation-warning/README.md for more information. ```	2025-03-20 18:36:02 +01:00
Daniel Bevenius	e7d9d8687a	examples : update wasm examples to include server.py [no ci] (#2908 ) This commit updates the README files for the wasm examples to include instructions on how to run the examples using the provided server.py which was included in Commit `6e8242f7fe` ("examples : command.wasm updates (#2904)"). The motivation for this is consistency with the command.wasm example.	2025-03-20 09:07:43 +01:00
Daniel Bevenius	6e8242f7fe	examples : command.wasm updates (#2904 ) This commit updates the command.wasm example by adding a server.py script to make it easy to start a local http server to try out the example, updates the build instructions, and also addresses some of the compiler warnings that were being generated. * emscripten : fix TOTAL_STACK for wasm This commit moves the TOTAL_STACK setting from the compile flags to the linker flags. This is because the TOTAL_STACK setting is a linker setting. The motivation for this change is that currently the following warnings are generated when building: ```console em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'TOTAL_STACK' [-Wunused-command-line-argument] ``` * examples : suppress C++17 deprecation warning for std::codecvt_utf8 This commit suppresses the C++17 deprecation warning for std::codecvt_utf8 similar to what is done in examples/talk-llama/unicode.cpp. The motivation for this change is to suppress these warnings: ```console /Users/danbev/work/ai/whisper-work/examples/common.cpp:251:31: warning: 'codecvt_utf8<wchar_t>' is deprecated [-Wdeprecated-declarations] 251 \| std::wstring_convert<std::codecvt_utf8<wchar_t>> converter; \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/codecvt:193:28: note: 'codecvt_utf8<wchar_t>' has been explicitly marked deprecated here 193 \| class _LIBCPP_TEMPLATE_VIS _LIBCPP_DEPRECATED_IN_CXX17 codecvt_utf8 : public __codecvt_utf8<_Elem> { \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:723:41: note: expanded from macro '_LIBCPP_DEPRECATED_IN_CXX17' 723 \| # define _LIBCPP_DEPRECATED_IN_CXX17 _LIBCPP_DEPRECATED \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:688:49: note: expanded from macro '_LIBCPP_DEPRECATED' 688 \| # define _LIBCPP_DEPRECATED __attribute__((__deprecated__)) \| ^ /Users/danbev/work/ai/whisper-work/examples/common.cpp:251:10: warning: 'wstring_convert<std::codecvt_utf8<wchar_t>>' is deprecated [-Wdeprecated-declarations] 251 \| std::wstring_convert<std::codecvt_utf8<wchar_t>> converter; \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/locale:3145:28: note: 'wstring_convert<std::codecvt_utf8<wchar_t>>' has been explicitly marked deprecated here 3145 \| class _LIBCPP_TEMPLATE_VIS _LIBCPP_DEPRECATED_IN_CXX17 wstring_convert { \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:723:41: note: expanded from macro '_LIBCPP_DEPRECATED_IN_CXX17' 723 \| # define _LIBCPP_DEPRECATED_IN_CXX17 _LIBCPP_DEPRECATED \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:688:49: note: expanded from macro '_LIBCPP_DEPRECATED' 688 \| # define _LIBCPP_DEPRECATED __attribute__((__deprecated__)) \| ^ /Users/danbev/work/ai/whisper-work/examples/common.cpp:257:31: warning: 'codecvt_utf8<wchar_t>' is deprecated [-Wdeprecated-declarations] 257 \| std::wstring_convert<std::codecvt_utf8<wchar_t>> converter; \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/codecvt:193:28: note: 'codecvt_utf8<wchar_t>' has been explicitly marked deprecated here 193 \| class _LIBCPP_TEMPLATE_VIS _LIBCPP_DEPRECATED_IN_CXX17 codecvt_utf8 : public __codecvt_utf8<_Elem> { \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:723:41: note: expanded from macro '_LIBCPP_DEPRECATED_IN_CXX17' 723 \| # define _LIBCPP_DEPRECATED_IN_CXX17 _LIBCPP_DEPRECATED \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:688:49: note: expanded from macro '_LIBCPP_DEPRECATED' 688 \| # define _LIBCPP_DEPRECATED __attribute__((__deprecated__)) \| ^ /Users/danbev/work/ai/whisper-work/examples/common.cpp:257:10: warning: 'wstring_convert<std::codecvt_utf8<wchar_t>>' is deprecated [-Wdeprecated-declarations] 257 \| std::wstring_convert<std::codecvt_utf8<wchar_t>> converter; \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/locale:3145:28: note: 'wstring_convert<std::codecvt_utf8<wchar_t>>' has been explicitly marked deprecated here 3145 \| class _LIBCPP_TEMPLATE_VIS _LIBCPP_DEPRECATED_IN_CXX17 wstring_convert { \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:723:41: note: expanded from macro '_LIBCPP_DEPRECATED_IN_CXX17' 723 \| # define _LIBCPP_DEPRECATED_IN_CXX17 _LIBCPP_DEPRECATED \| ^ /Users/danbev/work/wasm/emsdk/upstream/emscripten/cache/sysroot/include/c++/v1/__config:688:49: note: expanded from macro '_LIBCPP_DEPRECATED' 688 \| # define _LIBCPP_DEPRECATED __attribute__((__deprecated__)) \| ^ 4 warnings generated. ``` * ggml : suppress double-promotion warning in GGML_F16x4_REDUCE This commit adds a cast to `ggml_float` in the `GGML_F16x4_REDUCE` macro to suppress a double-promotion warning. Currently the following warning is generated when compiling the command.wasm example: ```console /whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:1592:5: warning: implicit conversion increases floating-point precision: 'float' to 'ggml_float' (aka 'double') [-Wdouble-promotion] 1592 \| GGML_F16_VEC_REDUCE(sumf, sum); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /Users/danbev/work/ai/whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:932:37: note: expanded from macro 'GGML_F16_VEC_REDUCE' 932 \| #define GGML_F16_VEC_REDUCE GGML_F16x4_REDUCE \| ^ /Users/danbev/work/ai/whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:920:44: note: expanded from macro 'GGML_F16x4_REDUCE' 918 \| res = wasm_f32x4_extract_lane(x[0], 0) + \ \| ~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 919 \| wasm_f32x4_extract_lane(x[0], 1) + \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 920 \| wasm_f32x4_extract_lane(x[0], 2) + \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~ 921 \| wasm_f32x4_extract_lane(x[0], 3); \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:1640:9: warning: implicit conversion increases floating-point precision: 'float' to 'ggml_float' (aka 'double') [-Wdouble-promotion] 1640 \| GGML_F16_VEC_REDUCE(sumf[k], sum[k]); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /Users/danbev/work/ai/whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:932:37: note: expanded from macro 'GGML_F16_VEC_REDUCE' 932 \| #define GGML_F16_VEC_REDUCE GGML_F16x4_REDUCE \| ^ /Users/danbev/work/ai/whisper-work/ggml/src/ggml-cpu/ggml-cpu.c:920:44: note: expanded from macro 'GGML_F16x4_REDUCE' 918 \| res = wasm_f32x4_extract_lane(x[0], 0) + \ \| ~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 919 \| wasm_f32x4_extract_lane(x[0], 1) + \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 920 \| wasm_f32x4_extract_lane(x[0], 2) + \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~ 921 \| wasm_f32x4_extract_lane(x[0], 3); \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2 warnings generated. ``` wasm_f32x4_extract_lane returns a 32-bit float and this is what the addition is performed on. But there is an implicit conversion from 32-bit float to 64-bit double when the result is assigned to `res`, which is of type `ggml_float`. My understanding here is that this is intentional and adding a cast to `ggml_float` should suppress the warning. * emscripten : add -Wno-deprecated to for emscripten This commit adds -Wno-deprecated to the CMAKE_CXX_FLAGS for emscripten builds. The motivation for this is that currently there a number of warnings generated like the following: ```console warning: JS library symbol '$print' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] warning: JS library symbol '$printErr' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] em++: warning: warnings in JS library compilation [-Wjs-compiler] em++: warning: linker setting ignored during compilation: 'ENVIRONMENT' [-Wunused-command-line-argument] warning: JS library symbol '$print' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] warning: JS library symbol '$printErr' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] em++: warning: warnings in JS library compilation [-Wjs-compiler] warning: JS library symbol '$print' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] warning: JS library symbol '$printErr' is deprecated. Please open a bug if you have a continuing need for this symbol [-Wdeprecated] em++: warning: warnings in JS library compilation [-Wjs-compiler] em++: warning: linker setting ignored during compilation: 'ENVIRONMENT' [-Wunused-command-line-argument] em++: warning: linker setting ignored during compilation: 'ENVIRONMENT' [-Wunused-command-line-argument] ``` The downside of this is that we might miss other deprecation warnings in the future so I'm not sure if this is acceptable. But it make the wasm examples cleaner without the warnings. * examples : fix tautological-compare warning in stb_vorbis.c [no ci] This commit applies a fix to address a tautological-compare warning in stb_vorbis.c. The motivation for this is that currently the following warning is generated when compiling the commmand-wasm example: ```console /Users/danbev/work/ai/whisper-work/examples/stb_vorbis.c:1404:75: warning: pointer comparison always evaluates to false [-Wtautological-compare] 1404 \| if (f->stream_start + loc >= f->stream_end \|\| f->stream_start + loc < f->stream_start) { \| ^ 1 warning generated. ``` This fix was taken from an open pull request on the stb repository that addreses this issue: https://github.com/nothings/stb/pull/1746 * squash! examples : update command.wasm instructions [no ci] This commit adds a Python script to serve the the wasm examples build in the `build-em` directory. Initially I thought that it would be enough to start a simple python server but I did not notice that there was an error in the browser console when I did that: ```console command.js:1 Uncaught (in promise) DataCloneError: Failed to execute 'postMessage' on 'Worker': SharedArrayBuffer transfer requires self.crossOriginIsolated. at command.js:1:1206224 at new Promise (<anonymous>) at loadWasmModuleToWorker (command.js:1:1204981) at Array.map (<anonymous>) at Object.loadWasmModuleToAllWorkers (command.js:1:1206428) at command.js:1:1204318 at callRuntimeCallbacks (command.js:1:1202062) at preRun (command.js:1:6136) at run (command.js:1:1294094) at removeRunDependency (command.js:1:7046) ``` We need a few CORS headers to be set and in order hopefully make this easy for users a Python script is added to the examples directory. This should be able to server all the wasm examples provided they have been built. command.wasm's README.md is updated to reflect this change. * examples : remove unused functions This commit removed the unused functions convert_to_utf8 and convert_to_wstring from examples/common.cpp. * Revert "examples : fix tautological-compare warning in stb_vorbis.c [no ci]" This reverts commit `8e3c47d961`. We should not make this change here and instead when the upstream PR is merged we can sync with it. Refs: https://github.com/ggerganov/whisper.cpp/issues/2784	2025-03-20 07:02:18 +01:00
Daniel Bevenius	83b14c357c	examples : use xcframework in whisper.objc example (#2882 ) * examples : use xcframework in whisper.objc example This commit updates the whisper.objc example to use the xcframework. The motivation for this to be consistent with the swift example and to also act as a reference for how to use the xcframework in an objc project. Resolves: https://github.com/ggerganov/whisper.cpp/issues/2881 * examples : setup audio session viewDidload This commit adds the setup of the audio session in the viewDidload method of the ViewController.m file. This is necessary to allow the app to record audio. The motivation for this is that without this it was not possible to caputue audio from the microphone. It was possible to click on the Capture button but nothing happened after that, and the button was not marked red indicating that the button could be clicked again to stop capturing. With this change it is possible to capture audio from the microphone and get it transcribed.	2025-03-17 13:01:24 +01:00
Daniel Bevenius	e0f3c9d4dd	examples : add GGML_USE_CPU=ON flag to whisper.objc (#2880 ) This commit adds the GGML_USE_CPU=ON flag to the whisper.objc project in order to enable the CPU backend for the whisper.objc project. The motivation for this change is that currently the following error is generated when running the example: ```console ggml_backend_buffer_type_t ggml_backend_get_default_buffer_type(ggml_backend_t backend) { return ggml_backend_dev_buffer_type(backend->device); <- Thread 1: EXC_BAD_ACCESS (code=1, address=0x70) } ``` If we inspect the `backend` variable we can see that it is a `nullptr`. ```console (lldb) p backend (ggml_backend_t) nullptr ``` When running in a simulator and that automatically means that there will be no gpu as there is a check for this in the code. But the CPU backend should still be present. The objective-c code will compile the whisper sources including the ggml sources. And if `-DGGMLL_USE_CPU` is not defined then there will be no CPU backend, and in this particular case of backend at all. Resolves: https://github.com/ggerganov/whisper.cpp/issues/2870	2025-03-14 15:40:20 +01:00
Daniel Bevenius	d5cc27ee4d	examples : add dl to the list of libraries linked (#2875 ) * examples : add dl to the list of libraries linked This commit adds the dynamic linker library to the list of libraries linked by the examples. The motivation for this change is that when building the examples on ubuntu 20.04, which uses GCC 9.4.0, the dynamic linker requires explicit linking or the following error is generated: ```console [ 64%] Linking CXX executable ../../bin/whisper-cli cd /app/whisper.cpp/build/examples/cli && /usr/bin/cmake -E cmake_link_script CMakeFiles/whisper-cli.dir/link.txt --verbose=1 /usr/bin/c++ -O3 -DNDEBUG CMakeFiles/whisper-cli.dir/cli.cpp.o -o ../../bin/whisper-cli -Wl,-rpath,/app/whisper.cpp/build/src:/app/whisper.cpp/build/ggml/src: ../libcommon.a ../../src/libwhisper.so.1.7.4 -pthread ../../ggml/src/libggml.so ../../ggml/src/libggml-cpu.so ../../ggml/src/libggml-base.so /usr/bin/ld: ../libcommon.a(common-whisper.cpp.o): undefined reference to symbol 'dlclose@@GLIBC_2.2.5' /usr/bin/ld: /lib/x86_64-linux-gnu/libdl.so.2: error adding symbols: DSO missing from command line collect2: error: ld returned 1 exit status make[2]: * [examples/cli/CMakeFiles/whisper-cli.dir/build.make:89: bin/whisper-cli] Error 1 make[2]: Leaving directory '/app/whisper.cpp/build' make[1]: * [CMakeFiles/Makefile2:433: examples/cli/CMakeFiles/whisper-cli.dir/all] Error 2 make[1]: Leaving directory '/app/whisper.cpp/build' make: *** [Makefile:130: all] Error 2 ``` Resolves: https://github.com/ggerganov/whisper.cpp/issues/2854	2025-03-14 04:42:20 +01:00
Martin Destagnol	5bb1d58c6a	whisper: add xcframework build script (#2873 ) * whisper: add xcframework build script * added apple validation scripts * fixed Readme * validation script fix	2025-03-13 13:56:39 +01:00
Georgi Gerganov	7d14005717	objc : fix build, tmp remove GPU support, use C++17	2025-03-08 15:13:01 +02:00
Ivy233	ef40950c4a	common : more general m_audio_len update logic (#2855 ) Co-authored-by: Ivy233 <wangjinrun@uniontech.com>	2025-03-07 10:10:03 +02:00
Dmitry Atamanov	5b481a27a6	common : fix audio loading by miniaudio (#2862 )	2025-03-04 19:05:21 +02:00
Lin Xiaodong	fc7b1ee521	fix: missing include common-whisper (#2858 )	2025-03-02 20:55:11 +02:00
Diego Devesa	339a1cba5d	whisper : support GGML_BACKEND_DL (#2843 ) * whisper : support GGML_BACKEND_DL * fix DTW crash * whisper.objc : fix build - add ggml-cpp.h --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-02-27 13:35:07 +01:00
Georgi Gerganov	c64f3e8ada	common : separate whisper sources (#2846 ) * common : separate whisper sources * examples : add chrono * examples : add more headers	2025-02-27 12:50:32 +02:00
Georgi Gerganov	9f83f67221	common : fix build min/max (#2845 ) * common : try to fix build * cont : try another fix	2025-02-27 10:39:13 +02:00
Dmitry Atamanov	7d3da68f79	examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759 )	2025-02-27 09:06:54 +02:00
petterreinholdtsen	b5d21359c1	stream : stop on ^C when no audio is received (#2822 ) Add check for ctrl-c in potentially endless loop while calling audio.get() to receive sound. Co-authored-by: Petter Reinholdtsen <pere@debian.org>	2025-02-27 08:59:51 +02:00
masahji	dfc6ca62f3	stream : add beam size parameter(#2836 ) * feat: Add beam size parameter to stream.cpp for beam search configuration * feat: Add beam size parameter to whisper full params in stream example * fix: Remove duplicate beam search size assignment in server.cpp	2025-02-25 11:39:33 +02:00
Judd	d682e15090	Fixes for Windows (#2790 ) Fixes for Windows: * MSVC default to utf-8 without BOM. * Console output code page changed to utf-8. --------- Co-authored-by: Judd <foldl@boxvest.com>	2025-02-06 15:37:21 +08:00
billyct	cadfc50eab	node : add max_len params in node addon (#2760 )	2025-02-03 22:49:06 +02:00
Georgi Gerganov	3f91832352	talk-llama : sync llama.cpp	2025-02-03 22:42:26 +02:00
Corey Earwood	7a423f1c00	whisper.objc : fix build and CI	2025-01-18 12:06:06 +02:00
Georgi Gerganov	99b011a9f5	talk-llama : sync llama.cpp	2025-01-14 10:38:01 +02:00
Georgi Gerganov	e940fbf283	server : fix build (#2718 )	2025-01-13 08:57:33 +02:00
Georgi Gerganov	35d0e02c72	talk-llama : sync llama.cpp (#2709 )	2025-01-13 08:55:48 +02:00
NETZkultur GmbH	45d3faf961	server : generate unique tmp filenames (#2718 ) #Summary This Merge Request adds a mechanism to generate unique filenames for FFmpeg conversions in whisper_server.cpp. Previously, a single fixed filename was used (e.g., whisper-server-tmp.wav), which could result in unexpected file overwrites under certain circumstances. By generating a unique filename per request, any risk of overwriting temporary files is eliminated. #Background / Motivation • Problem: Relying on a static filename for temporary audio files may lead to overwrites if multiple operations occur simultaneously or if the same file name is reused. • Goal: Dynamically generate unique filenames, ensuring each request or operation uses an isolated temporary file.	2025-01-13 08:55:21 +02:00
Yusuf Redžić	ece3ff88f6	cli : fix segfault on missing argument (#2700 )	2025-01-04 10:47:41 +02:00
Alter	c81b8b910b	objc : rename ggml-cpu-aarch64.c to .cpp (#2687 )	2025-01-02 12:05:09 +02:00
Georgi Gerganov	5136fd92c2	examples : handle "main.exe" deprecation	2024-12-30 13:00:18 +02:00
Andreas Lubbe	7d55637f0b	cli : add --suppress_nst support (#2664 )	2024-12-24 09:30:07 +02:00
Andreas Lubbe	0994506054	cli : add no_speech_thold (#2663 )	2024-12-24 09:29:19 +02:00
Georgi Gerganov	ed09075ca0	server : fix help print	2024-12-22 15:32:05 +02:00
Sacha Arbonel	4183517076	server : add no-speech threshold parameter and functionality (#2654 )	2024-12-21 17:00:08 +02:00
Georgi Gerganov	f4668169a0	whisper : rename suppress_non_speech_tokens to suppress_nst (#2653 )	2024-12-21 12:54:35 +02:00
Sacha Arbonel	944ce49439	server : add option to suppress non-speech tokens (#2649 ) * The parameter will suppress non-speech tokens like [LAUGH], [SIGH], etc. from the output when enabled. * add to whisper_params_parse * add missing param	2024-12-21 12:05:05 +02:00
Georgi Gerganov	2e59dced12	whisper : rename binaries + fix install (#2648 ) * whisper : rename binaries + fix install * cont : try to fix ci * cont : fix emscripten builds	2024-12-21 09:43:49 +02:00
Georgi Gerganov	ba6c2a8fd9	android : try to fix build	2024-12-18 12:52:16 +02:00
Georgi Gerganov	6576af00d7	files : remove old sources	2024-12-18 12:52:16 +02:00
Georgi Gerganov	61edb117a0	talk-llama : sync llama.cpp	2024-12-18 12:52:16 +02:00
Georgi Gerganov	60dc6d003f	common : remove old types ggml-ci	2024-12-18 12:52:16 +02:00
crummyh	d34445e960	stream : improve consistency in README (#2642 )	2024-12-18 08:43:48 +02:00
Georgi Gerganov	199579652e	common : add cstdio header	2024-12-16 08:57:04 +02:00
Georgi Gerganov	d17e7139d8	stream : update build instructions	2024-12-15 21:55:36 +02:00
Thamster	6a52eaea74	android : fix build and ci (#2624 ) * Adding missing CMakeLists.txt include for ggm-cpu needed by whisper.android * attempt to re-enable CI for JNI android --------- Co-authored-by: Your Name <you@example.com>	2024-12-14 17:25:53 +02:00
Georgi Gerganov	472464453d	ci : disable CUDA and Android builds	2024-12-08 20:14:35 +02:00
Georgi Gerganov	11dddfbc9e	ci : disable Obj-C build + fixes	2024-12-08 20:14:35 +02:00
Georgi Gerganov	f2c680f893	talk-llama : sync llama.cpp	2024-12-08 20:14:35 +02:00
Georgi Gerganov	02c6fcbc2c	common : fix compile warning ggml-ci	2024-12-08 20:14:35 +02:00
Georgi Gerganov	7fd8d9c220	whisper : adapt to new ggml (wip)	2024-11-20 21:00:08 +02:00
Georgi Gerganov	06e059b8f8	talk-llama : sync llama.cpp	2024-11-20 21:00:08 +02:00
Stefan Sydow	d24f981fb2	sycl: fix example build (#2570 )	2024-11-18 14:57:23 +02:00
Jhen-Jie Hong	c4e95fb74d	whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562 )	2024-11-15 15:21:53 +02:00
Georgi Gerganov	6477b84eb6	build : fixes	2024-11-15 15:21:04 +02:00
Georgi Gerganov	24d706774d	talk-llama : sync llama.cpp	2024-11-15 15:21:04 +02:00
Jhen-Jie Hong	5f8a086e22	whisper.swiftui : add model download list & bench methods (#2546 ) * swift : fix resources & exclude build * whisper : impl whisper_timings struct & api * whisper.swiftui : model list & bench methods * whisper : return ptr for whisper_get_timings * revert unnecessary change * whisper : avoid designated initializer * whisper.swiftui: code style changes * whisper.swiftui : get device name / os from UIDevice * whisper.swiftui : fix UIDevice usage * whisper.swiftui : add memcpy and ggml_mul_mat (commented)	2024-11-13 21:51:34 +02:00
Stefan Sydow	300c07b94d	examples : fix ffmpeg v5 build (#2543 ) remove call to 'av_register_all()' which does not exist in ffmpeg v5 anymore.	2024-11-13 21:41:52 +02:00
Georgi Gerganov	c65d0fd3c8	talk-llama : sync llama.cpp	2024-11-01 10:19:05 +02:00
Rotem Dan	b6049060dd	whisper : add dtw preset for large-v3-turbo (#2481 )	2024-10-15 21:00:21 +03:00
Georgi Gerganov	6e40108a59	objc : fix build	2024-10-05 15:23:51 +03:00
Georgi Gerganov	941912467d	whisper : adapt to latest ggml (skip) (#0 )	2024-10-05 15:23:51 +03:00
Rahul Vadhyar	2944cb72d9	examples : update dr_wav.h to newer version (#2449 )	2024-10-04 11:04:51 +03:00
Georgi Gerganov	ccc2547210	talk-llama : sync llama.cpp	2024-10-03 12:22:17 +03:00
gilbertgong	ede1718f6d	server : ffmpeg overwrite leftover temp file (#2431 ) * Remove possible leftover ffmpeg temp file from a previous failed conversion * Revert "Remove possible leftover ffmpeg temp file from a previous failed conversion" This reverts commit `00797403bd`. * Flag to force ffmpeg to overwrite output file if it exists	2024-10-02 15:06:40 +03:00
Georgi Gerganov	2ef717b293	whisper : add large-v3-turbo (#2440 )	2024-10-01 15:57:06 +03:00
Georgi Gerganov	451e9ee92c	make : remove "talk" target until updated	2024-09-24 19:45:08 +03:00
Georgi Gerganov	fe18c29ab8	talk-llama : sync llama.cpp	2024-09-24 19:45:08 +03:00
Georgi Gerganov	54e5095765	examples : adapt to ggml.h changes (ggml/0) ggml-ci	2024-09-24 19:45:08 +03:00
Toliver	5b1ce40fa8	server : use OS-generated temp file name for converted files (#2419 )	2024-09-17 15:56:32 +03:00
UsernamesLame	9600fc3eb1	readme : remove invalid flag from Python example (#2396 ) * Update README.md Fix broken C-style API link * Update whisper_processor.py Update examples/python/whisper_processor.py to remove nonexistent flag "-np" from subprocess.Popen call. * Add pywhispercpp to the Pybind11 Python wrapper list abdeladim-s/pywhispercpp wasn't added to the list / was removed at some point (?) It was referenced in issue #9, so I feel like it's worthy of being added as it's the first if not one of the first Python wrappers for whisper.cpp	2024-08-30 14:00:38 +03:00
Georgi Gerganov	da9809f243	talk-llama : sync llama.cpp	2024-08-28 13:22:20 +03:00
Justine Tunney	7f78675008	examples : use colorblind friendly TTY color scheme (#2360 ) This change updates the -pc flag, so that a new xterm256 color scheme is used. This color scheme is believed to be better for three reasons: 1. It should be friendlier to the colorblind. The scheme was designed by Paul Tol (see: https://personal.sron.nl/~pault/). TensorBoard uses it since 2017, so it's already popular in the machine learning community 2. It should appear to be the same colors as before to people who aren't i.e. it's still a red-green spectrum like before but lightly modified 3. It is readable in both white and black background terminals. The neon colors before were probably a bit too intense for white backgrounds.	2024-08-20 10:49:10 +03:00
Georgi Gerganov	58323bf8ed	build : fix aarch64 (#0 )	2024-08-08 22:48:46 +03:00
Georgi Gerganov	22058f2dbc	talk-llama : sync llama.cpp	2024-08-08 22:48:46 +03:00
Georgi Gerganov	c7ea4fd235	common : handle new quant types (ggml/0)	2024-08-08 22:48:46 +03:00
Georgi Gerganov	dbf9c15e30	talk-llama : sync llama.cpp	2024-07-08 14:53:55 +03:00
Georgi Gerganov	d3f6c34976	examples : fix compile warnings [no ci] (#0 )	2024-07-08 14:53:55 +03:00
Emmanuel Schmidbauer	bec9836849	server : add inference path to make OAI API compatible (#2270 )	2024-07-08 14:24:58 +03:00
Georgi Gerganov	4a62efbb95	cmake : minor fixes	2024-06-26 21:42:39 +03:00
Georgi Gerganov	dc8cc2dd6f	whisper : disable CUDA mel + fix FFMPEG	2024-06-26 20:11:38 +03:00
Georgi Gerganov	e30c679928	whisper : reorganize source code + improve CMake (#2256 ) * scripts : update sync [no ci] * files : reorganize [no ci] * sync : llama.cpp * cmake : link math library * cmake : build normal ggml library * files : move headers to include * objc : fix path to ggml-metal.h * ci : fix WHISPER_CUDA -> GGML_CUDA * scripts : sync LICENSE [no ci]	2024-06-26 19:34:09 +03:00
Georgi Gerganov	e293f17d34	talk-llama : sync llama.cpp	2024-06-18 09:45:37 +03:00
slaren	de29b193f6	move BLAS to a separate backend (cont) (llama/6210) ggml-ci	2024-06-18 09:39:40 +03:00
Georgi Gerganov	3b1ac03828	ggml : remove OpenCL (#0 )	2024-06-16 18:19:48 +03:00
Georgi Gerganov	061eeb9f61	talk-llama : sync llama.cpp	2024-06-16 18:19:48 +03:00
Borislav Stanimirov	af5833e298	whisper : remove `speed_up` and `phase_vocoder` functions (#2198 ) whisper : fix cast warning * whisper : remove phase_vocoder functions, ref #2195 * whisper : remove speed_up from whisper_full_params, closes #2195	2024-05-31 11:37:29 +03:00
Daniel Valdivia	a7dc2aab16	server : fix typo (#2181 ) A simple comment typo, PR can be dismissed	2024-05-25 10:46:22 +03:00
William Tambellini	1b51fdf170	examples : add support for decoding input with ffmpeg (Linux) (#2133 ) - search for ffmpeg libs/headers at cmake time - added ffmpeg-transcode.cpp into libcommon if ffmpeg on - hooked ffmpeg trancoding in common read_wav(...) - passed test: ./main -m ggml-base.en.bin -f samples/jfk.mp3	2024-05-21 18:31:41 +03:00
Pedro Probst	adee3f9c1f	node : add flash_attn param (#2170 )	2024-05-20 09:08:48 +03:00
Georgi Gerganov	7094ea5e75	whisper : use flash attention (#2152 ) * whisper : use flash attention in the encoder * whisper : add kv_pad * whisper : remove extra backend instance (huh?) * whisper : use FA for cross-attention * whisper : use FA for self-attention * whisper : simplify encoder FA * whisper : add flash_attn runtime parameter * scripts : add bench log * scripts : add M1 Pro bench log	2024-05-15 09:38:19 +03:00
petterreinholdtsen	9d5771ae43	talk-llama : reject runs without required arguments (#2153 ) * Extended talk-llama example to reject runs without required arguments. Print warning and exit if models are not specified on the command line. * Update examples/talk-llama/talk-llama.cpp * Update examples/talk-llama/talk-llama.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-05-14 21:32:41 +03:00
Georgi Gerganov	4ef8d9f44e	server : return utf-8 (#2138 )	2024-05-13 15:33:46 +03:00
Pedro Probst	3928dbd206	node : add audio_ctx and audio buffer params (#2123 ) * node : add audio_ctx param * node : support passing audio buffer directly * node : parse audio_ctx in index.js --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-05-13 15:22:23 +03:00
valVk	30f73109b8	node : add additional params (#2000 ) * Add additional params to addon.node * Add comma_in_time as parameter * Fix tests	2024-05-13 15:15:43 +03:00
Mark Karpelès	17fa62d3d3	js : remove un-needed request header from fetchRemote (#2119 )	2024-05-13 15:13:19 +03:00
Daniel Ziegenberg	0bb05b113d	main : dont print timings with --no-prints (#2108 ) Signed-off-by: Daniel Ziegenberg <daniel@ziegenberg.at>	2024-05-13 15:00:19 +03:00
Daniel Ziegenberg	f141b2b938	main : add options for temperature control (#2088 ) Add two options: ``` -tp, --temperature N [0.00 ] The sampling temperature, between 0 and 1 -tpi, --temperature-inc N [0.20 ] The increment of temperature, between 0 and 1 ``` The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit. Signed-off-by: Daniel Ziegenberg <daniel@ziegenberg.at>	2024-05-13 14:59:44 +03:00
zhangjixiong	e93081f83f	whisper.android : update example, add field to print timestamp (#2072 )	2024-05-13 14:30:03 +03:00
Xingchen Song(宋星辰)	b6bbce4ae9	cmake : fix json INTERFACE library (#2069 )	2024-05-13 14:29:39 +03:00
mashizora	7705dc52da	main : fix double quote escaping in csv output (#2090 )	2024-05-13 11:55:32 +03:00
Georgi Gerganov	3fa7d29876	talk-llama : sync llama.cpp	2024-05-13 11:02:26 +03:00
Georgi Gerganov	accada542a	ggml : resolve merge (ggml/0) ggml-ci	2024-05-13 11:02:26 +03:00
Pedro Probst	58210d6a76	examples : fix node compilation (#2115 ) * node : fix compilation and update examples * node : fix readme * Update addon.node test	2024-05-02 22:52:55 +01:00
Georgi Gerganov	b0c3cbf2e8	main : pass nullptr when regex is empty (#2070 )	2024-04-17 12:23:47 +03:00
Emmanuel Schmidbauer	9fab28135c	server : add dtw (#2044 ) * server.cpp: add dtw * Update examples/server/server.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-04-15 22:16:58 +03:00
Pedro Probst	1b5439a6c2	node : support no timestamps (#2048 ) * fix: node: do not compute timestamps if you do not need them * feat: add no_timestamps parameter to node addon	2024-04-15 20:03:34 +03:00
Kendrick Taylor	5c554c04ff	whisper.nvim : fix missing reference to "model" variable (#2049 )	2024-04-15 19:41:28 +03:00
Ikko Eltociear Ashimine	c383f091a1	whisper : update grammar-parser.cpp (#2058 ) preceeding -> preceding	2024-04-15 19:40:27 +03:00
ulatekh	c15b4cda7d	common : fix file-handle leak in read_wav() (#2026 ) Now it cleans up in case of error.	2024-04-09 18:34:34 +03:00
Rotem Dan	d3cfb6ca2b	main : set stdin to binary mode on Windows (#2025 )	2024-04-09 18:33:32 +03:00
ulatekh	671b4bde6c	main : allow a response-file as the sole parameter (#2019 ) * The "main" example now allows a response-file as the sole parameter. A response-file is a text file with command-line parameters, one per line. Prefix the name of the response-file with "@" to identify it as such. It's used under MS Windows to work around command-line length limits. It may be useful under other platforms to simplify character-escaping. * minor : style --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-04-09 18:31:16 +03:00
ulatekh	c8eeb93a6a	whisper : suppress tokens with a regex (#1997 ) * Allow a regular expression to describe tokens to suppress. Example: --suppress-tokens-re "[,\.]\|[ ]?[0-9]+" will suppress commas, periods, and numeric tokens. Technique inspired by https://github.com/openai/whisper/discussions/1041 Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Blind change to fix Java test. --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-04-09 18:27:28 +03:00
ulatekh	319fe5146e	cmake : create solution folders (#2004 ) * Create solution folders in the CMake build. * Fixed non-SDL2 build. * Fixed emscripten build.	2024-04-09 18:23:33 +03:00
Georgi Gerganov	81a3c41aa0	talk-llama : sync llama.cpp	2024-04-07 16:21:08 +03:00
ulatekh	fc366b807a	main : add command-style grammar (#1998 ) * Implemented command-style grammar in the main example. Mostly just copied the relevant parts from the command example. * main : code style --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-03-28 12:02:10 +02:00
Georgi Gerganov	9fb308d90f	make : add grammar parser to common objects	2024-03-28 11:59:48 +02:00
Georgi Gerganov	2948c740a2	sync : ggml (#2001 ) * sync : update scripts * sync : ggml * talk-llama : sync llama.cpp * make : WHISPER_CUBLAS -> WHISPER_CUDA * ci : try to fix sycl build * talk-llama : fix make build	2024-03-27 18:55:10 +02:00
Georgi Gerganov	1558ec5a16	whisper : improve handling of prompts (#1981 ) * whisper : improve handling of prompts * whisper : add whisper_token_count helper	2024-03-25 14:48:19 +02:00
Mohammadreza Hendiani	04e48094e4	readme : add Fedora dependencies (#1970 ) * README.md fix documentaion and added fedora liunx dependencies for stream build * fix documentaion and added fedora liunx dependencies for command build * fix documentaion and added fedora liunx dependencies for talk build * fix documentaion and added fedora liunx dependencies for talk-llama build * reverted back mistakenly removed MacOS documentaion	2024-03-20 18:42:11 +02:00
denersc	741abb162c	whisper : token-level timestamps with DTW (#1485 ) * whisper.cpp: impl dtw algo * WIP: producing and placing DTW timestamps on tokens * Fix compile and assertion errors. Attempt to DTW timestamp with single_segment=false. * Fix mistake causing incorrect alignment of dtw timestamps * implement N_TOP_MOST and CUSTOM alignment heads setting * whisper: fix typo on alignment heads enum * Fix issues related to changes in whisper.cpp * Fixed excessive memory use when using DTW timestamps. Other minor fixes to DTW timestamping function * decoder: save cross QKs only if requested * Calling median filter with ggml_map_custom1 * Reimpl aheads n_top_most and custom. Sanity checks on chosen aheads * Copying cross QKs from decoder backend correctly * dtw: cleanup * Fix incorrect n_frames passed to dtw when near end of audio * Fix aheads_masks_init for backend != CPU * whisper : minor style * main : add dtw (wip) * whisper: fix invalid memory access in aheads_masks_init * main : add dtw (cont) * whisper : minor --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2024-03-20 18:25:26 +02:00
Jo Liss	e7794a868f	examples : rename --audio-context to --audio-ctx per help text (#1953 )	2024-03-18 17:53:33 +02:00
Georgi Gerganov	de4d067f1e	talk-llama : sync llama.cpp	2024-03-15 14:21:59 +02:00
slaren	f60ccfd83b	update examples and tests	2024-03-15 14:01:14 +02:00
Georgi Gerganov	2f5a5a66dd	talk-llama : use llama_decode instead of llama_eval	2024-03-08 12:04:43 +02:00
Georgi Gerganov	8e409d1113	talk-llama : sync llama.cpp	2024-03-08 11:55:50 +02:00
Georgi Gerganov	05d1b61af4	talk-llama : sync llama.cpp	2024-03-08 11:52:47 +02:00
F1L1P	2e2626b167	examples : Auto lowercase language parameter in main.cpp (#1928 ) * Auto lowercase language parameter * Update examples/main/main.cpp Co-authored-by: bobqianic <129547291+bobqianic@users.noreply.github.com> --------- Co-authored-by: bobqianic <129547291+bobqianic@users.noreply.github.com>	2024-03-06 22:25:10 +00:00
zhouwg	c0c0ae2dea	examples : fix typo in bench.cpp (#1933 )	2024-03-06 22:21:44 +00:00
zhouwg	f22d27a385	whisper.android.java : fix returns in JNI (#1929 )	2024-03-05 15:59:26 +02:00
Georgi Gerganov	25d313b38b	talk-llama : sync llama.cpp	2024-02-28 13:04:05 +02:00
Georgi Gerganov	1711bb3881	sync : llama.cpp (ggml/0)	2024-02-28 13:00:30 +02:00
Andrew S	0d8fd8483a	stream.wasm : fix invalid memory access when no segments (#1902 ) No segments may be returned when a smaller sample buffer (EG 2048 samples) is sent to the worker.	2024-02-26 10:12:35 +02:00

1 2 3 4 5 ...

584 Commits