Fix dtype parsing #97

minosvasilias · 2023-06-16T15:58:22Z

This fixes the issue noted in #94 , as well as other --dtype arguments that may not have been parsed correctly.

Ignore dtype model arg if 8bit specified as --dtype argument (only set load_in_8bit instead)
Use getattr to parse string value (turning "float16" into torch.float16 etc.)

- Ignore dtype model arg if 8bit specified - Use getattr to parse string argument

minosvasilias · 2023-06-16T18:51:15Z

I also added support for the load_in_4bit argument introduced recently: huggingface/transformers#23479

lbeurerkellner · 2023-06-18T12:29:17Z

Awesome, thanks a lot.

Fix dtype parsing

5328e23

- Ignore dtype model arg if 8bit specified - Use getattr to parse string argument

minosvasilias mentioned this pull request Jun 16, 2023

Serving model in 8bit mode fails #94

Closed

Add 4bit support

a06348a

lbeurerkellner merged commit 6aca8e5 into eth-sri:main Jun 18, 2023

Provide feedback