(Ubuntu x86_64) Segmentation Fault Running Q4_1_O Model #32

cryscan · 2023-04-18T14:32:10Z

System: Ubuntu 20.04.6 LTS
GCC: 9.4.0
CPU: Intel(R) Xeon(R) Platinum 8358P

Issue:

$ python rwkv/chat_with_bot.py /path/to/models/Raven-14B-v9-Q4.bin 
Loading 20B tokenizer
System info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | VSX = 0 | 
Loading RWKV model
Processing 92 prompt tokens, may take a while
Segmentation fault (core dumped)

saharNooby · 2023-04-18T15:52:22Z

Can it be reproduced with 169M model?

BuilderGuy1 · 2023-04-18T22:47:21Z

I'm having the same issue with Apple Silicon. I tested latest 14B & 7B, each with all 3 quantization options. I also tried with 3B v10 Q4_1_0.

dmahurin · 2023-04-19T04:19:29Z

Same issue on Apple M2, with all revisions with the Q4_1_0. Tried 3B and 169M.

poisson-fish · 2023-04-19T18:23:20Z

confirm this issue under Arch WSL with RWKV-4-Raven-14B-v9-Eng99%-Other1%-20230412-ctx8192_q4_1_0.bin

L-M-Sherlock · 2023-04-20T04:29:41Z

I also encounter this problem in my mac m1. Tried 7B and 3B. It occurred before Processing 92 prompt tokens.

saharNooby · 2023-04-20T05:35:59Z

Probably address misalignment. I'm working on it in #33

saharNooby · 2023-04-20T06:04:59Z

Alignment fix merged. Please clone the repo from scratch and try again:

git clone --recursive https://github.com/saharNooby/rwkv.cpp.git

L-M-Sherlock · 2023-04-20T06:17:03Z

Thanks! #33 solved my problem. But another bug appeared. The bot only repeats,> Bob: Hello, Bob..

python rwkv/convert_pytorch_to_ggml.py ./RWKV-4-Raven-3B-v9x-Eng49%-Chn50%-Other1%-20230417-ctx4096.pth ./rwkv.cpp-3B.bin float16
python rwkv/quantize.py ./rwkv.cpp-3B.bin ./rwkv.cpp-3B-Q4_1_0.bin 4
python rwkv/chat_with_bot.py ./rwkv.cpp-3B-Q4_1_0.bin

saharNooby · 2023-04-20T06:36:52Z

@BuilderGuy1 @poisson-fish @dmahurin If possible, can you also confirm that segfault is fixed? (please clone from scratch or don't forget to update git submodules)

@L-M-Sherlock Thanks for the input. Looks like default prompt is not good for Raven, related issue is #22

dmahurin · 2023-04-20T06:51:54Z

Thanks @saharNooby. It works now on Apple M2 with 3B and 169M.

It also works with rwkv-4_raven-7b-v9 and rwkv-4_raven-14b-v9, though 14b is slow on M2.

saharNooby · 2023-04-20T10:47:08Z

Thanks for testing it!

poisson-fish · 2023-04-20T18:35:26Z

sorry for late reply @saharNooby, the previous crash is fixed however now I get SIGSEGV:

python rwkv/chat_with_bot.py ./build/models/RWKV/RWKV-4-Raven-14B-v9-Eng99\%-Other1\%-20230412-ctx8192_q4_1
_0.bin
Loading 20B tokenizer
System info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | VSX = 0 |
Loading RWKV model
Processing 92 prompt tokens, may take a while
~/Documents/Projects/cpp/llamapi/rwkv.cpp/rwkv/rwkv_cpp_model.py:100: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  state_out.storage().data_ptr(),
~/Documents/Projects/cpp/llamapi/rwkv.cpp/rwkv/rwkv_cpp_model.py:101: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  logits_out.storage().data_ptr()
fish: Job 1, 'python rwkv/chat_with_bot.py ./…' terminated by signal SIGSEGV (Address boundary error)

can open new issue if necessary.

edit: disregard, required a submodule update and works now

saharNooby closed this as completed Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Ubuntu x86_64) Segmentation Fault Running Q4_1_O Model #32

(Ubuntu x86_64) Segmentation Fault Running Q4_1_O Model #32

cryscan commented Apr 18, 2023 •

edited

saharNooby commented Apr 18, 2023

BuilderGuy1 commented Apr 18, 2023 •

edited

dmahurin commented Apr 19, 2023 •

edited

poisson-fish commented Apr 19, 2023

L-M-Sherlock commented Apr 20, 2023

saharNooby commented Apr 20, 2023

saharNooby commented Apr 20, 2023

L-M-Sherlock commented Apr 20, 2023 •

edited

saharNooby commented Apr 20, 2023 •

edited

dmahurin commented Apr 20, 2023 •

edited

saharNooby commented Apr 20, 2023

poisson-fish commented Apr 20, 2023 •

edited

(Ubuntu x86_64) Segmentation Fault Running Q4_1_O Model #32

(Ubuntu x86_64) Segmentation Fault Running Q4_1_O Model #32

Comments

cryscan commented Apr 18, 2023 • edited

saharNooby commented Apr 18, 2023

BuilderGuy1 commented Apr 18, 2023 • edited

dmahurin commented Apr 19, 2023 • edited

poisson-fish commented Apr 19, 2023

L-M-Sherlock commented Apr 20, 2023

saharNooby commented Apr 20, 2023

saharNooby commented Apr 20, 2023

L-M-Sherlock commented Apr 20, 2023 • edited

saharNooby commented Apr 20, 2023 • edited

dmahurin commented Apr 20, 2023 • edited

saharNooby commented Apr 20, 2023

poisson-fish commented Apr 20, 2023 • edited

cryscan commented Apr 18, 2023 •

edited

BuilderGuy1 commented Apr 18, 2023 •

edited

dmahurin commented Apr 19, 2023 •

edited

L-M-Sherlock commented Apr 20, 2023 •

edited

saharNooby commented Apr 20, 2023 •

edited

dmahurin commented Apr 20, 2023 •

edited

poisson-fish commented Apr 20, 2023 •

edited