whisper.cpp

Commit Graph

Author	SHA1	Message	Date
stormofice	c0fd64a9c0	readme : fix broken links in implementation details section (#2382 )	2024-08-28 11:41:51 +03:00
Eric Curtin	d65786ea54	readme : fix broken links (#2358 ) For whisper.cpp and whisper.h files	2024-08-20 10:57:45 +03:00
Georgi Gerganov	e30c679928	whisper : reorganize source code + improve CMake (#2256 ) * scripts : update sync [no ci] * files : reorganize [no ci] * sync : llama.cpp * cmake : link math library * cmake : build normal ggml library * files : move headers to include * objc : fix path to ggml-metal.h * ci : fix WHISPER_CUDA -> GGML_CUDA * scripts : sync LICENSE [no ci]	2024-06-26 19:34:09 +03:00
Georgi Gerganov	3b1ac03828	ggml : remove OpenCL (#0 )	2024-06-16 18:19:48 +03:00
Martin Delille	b87494bb8f	readme : add conan badge (#2196 ) * Add conan badge * Fix markdown formating	2024-05-30 15:43:28 +03:00
Carlos Zoido	ad130431aa	readme : add install instructions for Conan (#2189 )	2024-05-30 15:06:15 +03:00
Georgi Gerganov	c7b6988678	release : v1.6.2	2024-05-27 10:35:09 +03:00
Georgi Gerganov	08981d1bac	release : v1.6.0	2024-05-15 09:59:48 +03:00
AIWintermuteAI	a750868428	readme : add up-to-date repository for Python bindings (#2063 ) README	2024-04-16 14:15:52 +03:00
Georgi Gerganov	7395c70a74	release : v1.5.5	2024-04-16 14:08:31 +03:00
Georgi Gerganov	52ccd4a3a8	files : rename ./extra to ./scripts	2024-04-09 20:13:41 +03:00
slashlib	956ef860bc	cmake : support for CPU BLAS build via Intel MKL (#2024 )	2024-04-09 18:32:46 +03:00
Georgi Gerganov	2948c740a2	sync : ggml (#2001 ) * sync : update scripts * sync : ggml * talk-llama : sync llama.cpp * make : WHISPER_CUBLAS -> WHISPER_CUDA * ci : try to fix sycl build * talk-llama : fix make build	2024-03-27 18:55:10 +02:00
Gavin Cai	c713eb5e2a	readme : recommend MacOS Sonoma for Core ML (#1917 )	2024-03-04 21:16:13 +02:00
Jumper775	917c56ded4	models : fix openvino setup info (#1874 )	2024-02-19 02:19:47 +00:00
Michael Rienstra	4bbb60efce	docs : make model options / model install methods clearer (#1806 ) * Make models more "discoverable" * Clean up code block language identifiers * make 3 options clearer * undo Prettier formatter change * docs: `$` shell prompt, consistently * docs: minor changes	2024-01-26 17:39:54 +02:00
Georgi Gerganov	0b9af32a8b	release : v1.5.4	2024-01-05 17:11:27 +02:00
Georgi Gerganov	9962371f71	release : v1.5.3	2024-01-03 19:36:33 +02:00
Chaoqun	d2ee117a0a	docker : Dockerize whisper.cpp (#1674 ) * build: add dockerfile for ci * ci: add action to build/push docker image * fix: lowercase repository to fix ci * ci: update cuBLAS flag * build: install curl and ffmped in image * docs: add docker section * fix: improve args check when download model	2023-12-22 11:16:02 +00:00
Georgi Gerganov	88112c8afb	release : v1.5.2	2023-12-14 17:56:39 +02:00
fraxy-v	fd99ece8e3	wchess : whisper assisted chess (#1595 ) * wchess: whisper assisted chess * wchess: fix allowed moves in check * wchess: touchstart, touchend events * wchess: css, disabled button * wchess : html touches * wchess : minor fixes and code style * wchess : bump encoder context to 1280 * wchess : index.html * wchess : fix CI warnings * wchess : add array header * wchess : build static library * wchess : display grammar * wchess : update UX * wchess : add comment * wchess : add README --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-12-14 15:58:26 +02:00
Hang	641f2f4282	readme : update help (#1560 )	2023-11-27 12:04:08 +02:00
Georgi Gerganov	9d6ebd877c	release : v1.5.1	2023-11-24 12:41:55 +02:00
Georgi Gerganov	34209a37a2	readme : add server example	2023-11-23 17:20:33 +02:00
Georgi Gerganov	d38af151a1	release : v1.5.0	2023-11-15 21:02:52 +02:00
Georgi Gerganov	bfbaa4dce5	whisper : make large version explicit + fix data size units (#1493 )	2023-11-15 19:42:25 +02:00
Georgi Gerganov	9f8bbd3fee	readme : update comment about source code	2023-11-12 17:47:37 +02:00
Georgi Gerganov	684bc8bd70	readme : update GPU / CUDA	2023-11-12 15:40:37 +02:00
Georgi Gerganov	6a5d195109	release : v1.4.3	2023-11-07 16:15:48 +02:00
Georgi Gerganov	2cdfc4e025	whisper : add support for large v3 (#1444 ) * whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme	2023-11-07 15:30:18 +02:00
jorismertz	9a7074d4aa	README : fix typo (#1362 )	2023-10-13 16:53:23 +01:00
Neil Chudleigh	9edbd0a204	extra: Add benchmark script implemented in Python (#1298 ) * Create bench.py * Various benchmark results * Update benchmark script with hardware name, and file checks * Remove old benchmark results * Add git shorthash * Round to 2 digits on calculated floats * Fix the header reference when sorting results * FIx order of models * Parse file name * Simplify filecheck * Improve print run print statement * Use simplified model name * Update benchmark_results.csv * Process single or lists of processors and threads * Ignore benchmark results, dont check in * Move bench.py to extra folder * Readme section on how to use * Move command to correct location * Use separate list for models that exist * Handle subprocess error in git short hash check * Fix filtered models list initialization	2023-09-25 23:45:15 +08:00
JJ	7e1592d2cd	readme: Fix spelling error (#1290 ) Fixed branding error: Javascript to JavaScript	2023-09-21 15:55:33 +08:00
Artyom Mezin	903c9579b8	examples: Update README.md of main.cpp (#1306 )	2023-09-18 22:14:36 +08:00
Georgi Gerganov	93935980f8	whisper : Metal and ggml-alloc support (#1270 ) * metal : init * whisper : factor out graph builds * whisper : allocate encoder and decoder using ggml-alloc * whisper : ggml-alloc is now supported * whisper : CoreML support ggml-alloc * build : fix ggml-alloc * ios : update submodule * extra : update sync-ggml.sh script to also sync ggml-alloc * ci : see if this is causing the crash * whisper : refactor ggml-alloc init * whisper.android : try to fix build * whisper : initial Metal version * ci : try to debug vmem issue * metal : decoder works on GPU! * metal : add multi-decoder support * ggml : fix ggml_nbytes (probably temp solution) * metal : run "cross" step on the GPU * whisper : remove ggml_repeat in the encoder * whisper : offload the Encoder to Metal * ggml : use simpler ggml_bytes() implementation * ggml-alloc : try to make CI happy by reducing vram to 128GB * whisper : add whisper_allocr to wrap ggml_allocr * whisper : factor out alloc init in a function * cmake : update to support Metal build * whisper : add <functional> header * objc : fix build (no Metal yet) * ios : add Metal support * swiftui : fix build * metal : speed-up KQ multiplication * metal : sync latest llama.cpp kernels * readme : add Metal info * ios : update submodule * coreml : add code to toggle Core ML config (CPU, ANE, GPU) * bench : fix timings by running a pre-heat * bench : start benching the decoder * whisper : add ggml_mul_mat_pad * bench : fix uninitialized vars * whisper : add comment for disabling mul-mat padding * whisper : add description of ggml_mul_mat_pad * whisper : clean-up ggml_mul_mat_pad * metal : remove the "concurrent" flag * bench : variable n_past * ios : update SPM package	2023-09-15 12:18:18 +03:00
布客飞龙	6780c98e19	readme : update CMake build commands (#1231 ) * Update README.md * Update README.md: `vcpkg install opencl clblast` * readme : update build commands --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-09-05 13:53:34 +03:00
Fangjun Kuang	aad2dad38a	whisper : minor fixes (#1154 )	2023-08-27 19:02:00 +03:00
Ryan Metcalfe	1fa360fc6e	readme : add OpenVINO support details (#1112 )	2023-07-25 19:07:59 +03:00
Martin Warnaar	176d7e4e7b	readme : better wording (#1064 )	2023-07-04 15:30:31 +03:00
Georgi Gerganov	70e6fcd78b	readme : add tinydiarize instructions (#1058 )	2023-07-04 09:51:22 +03:00
GiviMAD	bc2dcf85fe	readme : add java alternative binding (#1029 ) Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>	2023-06-25 14:46:07 +03:00
Larry Battle	a7f822ef59	readme : corrected syntax for markdown link (#995 )	2023-06-25 13:46:44 +03:00
0xsourcecode	4e16a8fb63	readme : highlight OpenBLAS support (#956 ) * highlight openblas support * Update README.md	2023-05-24 11:23:51 +03:00
Nicholas Albion	bc89f285d8	bindings : add java bindings (#931 ) * WIP - java bindings * updated README * failed attempt at JNI * fullTranscribe() test passes * tested on Ubuntu 20 * link to Java bindings	2023-05-20 18:25:02 +03:00
Georgi Gerganov	a5defbc1b9	release : v1.4.2	2023-05-14 19:06:45 +03:00
Jhen-Jie Hong	16564f554f	readme : improve Core ML model conversion guidance (#915 )	2023-05-14 18:11:08 +03:00
Clifford Heath	9931d66400	readme : add instructions on converting to GGML + "--no-config" to wget (#874 )	2023-05-08 20:58:36 +03:00
Vulcan	919e58b96a	readme : partial OpenCL GPU support via CLBlast (#863 ) * ggml : CLBlast support as in llama.cpp Building with CLBlast speeds up whisper.cpp ~2x on low end / older AMD APUs (CPU with integrated GPU) such as the A9. Usage: WHISPER_CLBLAST=1 make * CMake/Makefile : CLBlast support as in llama.cpp Building with CLBlast speeds up whisper.cpp ~2x on low end / older AMD APUs (CPU with integrated GPU) such as the A9. Usage: ``` Makefile: cd whisper.cpp WHISPER_CLBLAST=1 make CMake: cd whisper.cpp ; mkdir build ; cd build cmake -DWHISPER_CLBLAST=ON .. make ``` * Update README.md Added OpenCL Build Instructions * Instruction: Partial OpenCL GPU support via CLBlast Added build instructions and examples for Make and CMake to support OpenCL enabled GPUs.	2023-05-03 19:24:43 +03:00
Georgi Gerganov	9c61f5f585	release : v1.4.1	2023-04-30 22:57:42 +03:00
Georgi Gerganov	fa8dbdc888	release : v1.4.0	2023-04-30 19:23:37 +03:00
Georgi Gerganov	794b162a46	whisper : add integer quantization support (#540 ) * whisper : add integer quantization support * examples : add common-ggml + prepare to add "quantize" tool * whisper : quantization tool ready * whisper : fix F32 support * whisper : try to fix shared lib linkage * wasm : update quantized models to Q5 * bench.wasm : remove "medium" button * bench.wasm : fix custom model button * ggml : add Q5_0 and Q5_1 WASM SIMD * wasm : add quantized models to all WASM examples * wasm : bump DB version number to 2 * talk-llama : update example to latest llama.cpp * node : increase test timeout to 10s * readme : add information for model quantization * wasm : add links to other examples	2023-04-30 18:51:57 +03:00
Georgi Gerganov	5fd1bdd7fc	whisper : add GPU support via cuBLAS (#834 ) * make : add WHISPER_CUBLAS * make : fix CUBLAS build * whisper : disable Flash Attention + adjust memory buffers * whisper : remove old commented code * readme : add cuBLAS instructions * cmake : add WHISPER_CUBLAS option * gitignore : ignore build-cublas	2023-04-30 12:14:33 +03:00
Georgi Gerganov	4d89ee2e59	readme : add logo	2023-04-28 22:41:29 +03:00
Georgi Gerganov	c23588cc4b	release : v1.3.0	2023-04-15 17:30:44 +03:00
Georgi Gerganov	355da83690	readme : fix link	2023-04-15 13:30:36 +03:00
Georgi Gerganov	3e5c49e59a	readme : add usage instructions for Core ML	2023-04-15 13:30:07 +03:00
Aaron Taylor	1c5edc3cb3	readme : add SwiftWhisper to listed bindings (#755 )	2023-04-14 20:24:00 +03:00
Alex Evgrashin	674a8e579b	readme : add unity bindings (#733 )	2023-04-14 19:59:44 +03:00
Sam	b73a4638ac	readme : make the quick start instructions clearer. (#716 ) Users wanting to make use of this implementation of the whisper model with no prior knowledge of C/C++ may download the Whisper model but fail to use of the "make" command as specified given that they forgot or didn't know they needed to clone the repository first. Hope this modification clears things up.	2023-04-14 19:33:06 +03:00
bocytko	ccb47e7e10	readme : add shell command example for --print-colors (#710 ) The section of the readme file explaining `--print-colors` includes only a screenshot with directories that are inconsistent with other examples. This commit adds an example shell command, consistent with the remaining examples.	2023-04-14 19:25:23 +03:00
Zigfrid Zvezdin	859ffc994e	misc : typo (#688 )	2023-03-30 07:51:33 +03:00
Georgi Gerganov	82637b8e9f	readme : add talk-llama example to the table	2023-03-27 21:02:35 +03:00
jwijffels	aec01bb337	Include link to R wrapper in README (#626 )	2023-03-22 22:28:22 +02:00
Jhen-Jie Hong	a5e60c019d	readme : add react-native bindings (#619 )	2023-03-22 21:39:02 +02:00
Georgi Gerganov	1beff6f66d	models : change HF hosting from dataset to model	2023-03-22 20:44:56 +02:00
Georgi Gerganov	fa9d43181f	readme : add bench-wts.sh demo	2023-03-06 21:06:27 +02:00
Georgi Gerganov	ad1389003d	release : v1.2.1	2023-02-28 22:29:12 +02:00
Aaron Pham	d176160f6f	readme : add pybind11 bindings (#538 )	2023-02-27 21:02:11 +02:00
Georgi Gerganov	ca21f7ab16	readme : add cython bindings (#9 )	2023-02-24 08:46:06 +02:00
Georgi Gerganov	2407ae8ef0	readme : add Ruby discussion + update .NET discussion	2023-02-15 19:51:54 +02:00
Georgi Gerganov	9764782bd9	readme : add another .NET repo (#303 )	2023-02-14 20:04:03 +02:00
Georgi Gerganov	3b010f9bed	readme : add .NET repo (#303 )	2023-02-11 17:35:33 +02:00
Georgi Gerganov	b2083c5d02	release : v1.2.0	2023-02-04 09:49:49 +02:00
Georgi Gerganov	f3ee4a9673	whisper : reduce memory usage during inference (#431 ) * ggml : add "scratch" buffer support * ggml : support for scratch ring-buffer * ggml : bug fix in ggml_repeat() * ggml : error on scratch buffer overflow * whisper : use scratch buffers during inference (base model only) * whisper : update memory usage for all models * whisper : fix encoder memory usage * whisper : use whisper_context functions instead of macros * whisper : fix FF + remove it from README * ggml : reuse ggml_new_i32 * ggml : refactor the scratch buffer storage * whisper : reorder scratch buffers in the decoder * main : add option to disable temp fallback * Update README.md	2023-02-04 09:45:52 +02:00
Georgi Gerganov	2c3f50a021	release : v1.1.1	2023-01-23 20:23:44 +02:00
Georgi Gerganov	874bde887e	Update README.md	2023-01-16 18:47:31 +02:00
Georgi Gerganov	8738427dd6	cmake : bump version to 1.1.0	2023-01-15 14:33:13 +02:00
Georgi Gerganov	0b85e8c401	Update README.md	2023-01-15 11:36:20 +02:00
Georgi Gerganov	8de452c18b	Improve decoding (#291 ) * whisper : prepare infra for new decoding strategies * whisper : apply logit filters and compute logprobs * whisper : add whisper_get_logits() * whisper : separate self and cross attention memory Initial step needed for supporting parallel decoders * whisper : move probs_id buffer to whisper_context * whisper : refactor kv cache into separate struct * whisper : move self-attention kv cache to whisper_decoder * whisper : wip decoding parameters + strategies * whisper : wip decoding parameters + strategies (part 2) * whisper : wip decoding parameters + strategies (part 3) * whisper : wip decoding parameters + strategies (part 4) * whisper : fix prompt_past update to not include prompt_init * whisper : temperature + best_of support * whisper : support for compression_ration_threshold We actually use entropy, but it is similar * command : fix example to use logits instead of obsolete probs * whisper : handle empty sequence ranking * whisper : add WHISPER_DEBUG + diagnostic prints + new main args * whisper : minor fixes * whisper : add beam-search support * whisper : bug fix when there no previous context * whisper : add comments * stream : disable temperature fallback For real-time processing, we always want a single decoder running at T=0 * whisper.swiftui : update example - fix paths + add empty folders	2023-01-15 11:29:57 +02:00
Ian Bicking	5e9f33596f	readme : clarify main and stream usage (#391 ) Give an example of ./main that uses a sample file that's already there, and make the stream example clarify you need `make stream`	2023-01-08 20:18:41 +02:00
Thomas Fitzsimmons	1944e7c33e	whisper : document POWER VSX support	2023-01-05 23:53:00 +02:00
Georgi Gerganov	1480a5f1af	Update README.md Add SwiftUI example links	2022-12-23 11:02:46 +02:00
Georgi Gerganov	4c1fe0c813	Update README.md Add bindings links / discussions	2022-12-22 18:22:58 +02:00
Georgi Gerganov	afe2db0fe2	Add Roadmap	2022-12-16 23:41:57 +02:00
Georgi Gerganov	ea19ed33f1	Update README.md (#46 ) Add references to the new Android app	2022-12-16 19:28:51 +02:00
Georgi Gerganov	c37c2443c1	Update README.md (#56 )	2022-12-16 18:01:05 +02:00
Georgi Gerganov	812ae3ffbd	Update README.md	2022-12-12 20:20:51 +02:00
Georgi Gerganov	fcf515de60	bench.wasm : same as "bench" but runs in the browser (#89 )	2022-12-11 11:09:10 +02:00
Georgi Gerganov	3b1aacbe6d	talk : talk with AI in the terminal	2022-12-10 16:51:58 +02:00
Georgi Gerganov	3996ecc156	Update README.md	2022-12-07 05:15:46 +02:00
Georgi Gerganov	9fe7306f4b	models : add the new "large" model release by OpenAI The old "large" model is now renamed "large-v1". If you have been using it, make sure to rename it and download the new "large" model for best results.	2022-12-06 18:48:57 +02:00
Georgi Gerganov	6fd5358dd0	Update README.md	2022-11-27 11:30:32 +02:00
Georgi Gerganov	67e819baf4	minor : remove "examples/" prefix from the README	2022-11-26 13:07:54 +02:00
Georgi Gerganov	a425365b82	yt-wsp.sh : script to easily transcribe VODs Thanks to @DaniruKun ref: https://gist.github.com/DaniruKun/96f763ec1a037cc92fe1a059b643b818 Usage: cd whisper.cpp make ./examples/yt-wsp.sh <video-url>	2022-11-26 12:54:42 +02:00
Georgi Gerganov	e0e864d9ca	Update README.md	2022-11-26 11:56:55 +02:00
Georgi Gerganov	68ecadbbc9	command.wasm : add voice assistant example for the Web (#171 ) Same as the command-line tool "command", but runs in the browser Also, added helper script "extra/deploy-wasm.sh" and fixed some timing constants for the WASM examples.	2022-11-26 11:40:06 +02:00
Georgi Gerganov	1246dd023e	command : add demonstration video	2022-11-25 20:23:58 +02:00
Georgi Gerganov	bc88eb13c6	examples : add "command" tool (#171 )	2022-11-25 19:36:57 +02:00
Georgi Gerganov	b8ce25dec1	refactoring : more readable code	2022-11-25 19:28:04 +02:00
Georgi Gerganov	2c0501b38a	Update README.md	2022-11-24 20:06:51 +02:00

1 2 3 4 5

205 Commits