Releases: RWKV/rwkv.cpp
Releases · RWKV/rwkv.cpp
master-1198892
Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44) * Remove Q4_3 support * Add Q5_0, Q5_1, Q8_0 support * Add more clear message when loading Q4_3 model * Remove Q4_1_O format * Fix indentation in .gitmodules * Simplify sanitizer matrix
master-06dac0f
Use main ggml repo (#45)
master-c736ef5
Improve chat_with_bot.py script (#39)
master-3587ff9
Sync ggml with upstream (#38) * Sync ggml with upstream * Remove file filters from Actions triggers * Update ggml * Add Q4_2 and Q4_3 support * Improve output of perplexity measuring script * Add tests for new formats * Add token limit argument to perplexity measuring script * Update README * Update README * Update ggml * Use master branch of ggml
master-1be9fda
Add robust automatic testing (#33)
master-7b28076
Fix Q4_1_O optimization
master-2ef7ee0
Optimize Q4_1_O by moving outlier multiplication out of the dequantiz… …e+dot loop
master-0a8157d
Merge pull request #28 from saharNooby/ggml-to-submodule Move ggml to submodule
master-84e0698
Merge pull request #16 from saharNooby/outliers-preserving-quantizati… …on-PR Add Q4_1_O quantization format that preserves outliers in weights and does dot in FP32
master-5d99741
Merge pull request #18 from yorkzero831/master Update github action to support linux and macos asset uploading