Skip to content

SujanNeupane42/LLM_Quantization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

LLM_Quantization

This repo contains a jupyter notebook that will utilize the GPTQ technique to quantize LLMs. An in-depth explanation combined with examples is included in the notebook which you can follow to quantize any of the LLMs. For simplicity purposes, I have quantized an open-source language model from huggingface called dlite-v2-355m.