Skip to content

Commit

Permalink
Fix Q4_1_O optimization
Browse files Browse the repository at this point in the history
  • Loading branch information
saharNooby committed Apr 18, 2023
1 parent 2ef7ee0 commit 7b28076
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -90,7 +90,7 @@ python rwkv/quantize.py ~/Downloads/rwkv.cpp-169M.bin ~/Downloads/rwkv.cpp-169M-

Formats available:

- `4`: `Q4_1_O`, OK quality, fast (comparable to `FP16`).
- `4`: `Q4_1_O`, OK quality, moderately fast (20% slower than `FP16`).
- `3`: `Q4_1`, worst quality, fast (comparable to `FP16`).
- `2`: `Q4_0`, poor quality, very fast.

Expand Down
2 changes: 1 addition & 1 deletion ggml
Submodule ggml updated from fbf4d6 to 538e51

0 comments on commit 7b28076

Please sign in to comment.