Skip to content

Add Q4_1_O quantization format that preserves outliers in weights and does dot in FP32 #21

Add Q4_1_O quantization format that preserves outliers in weights and does dot in FP32

Add Q4_1_O quantization format that preserves outliers in weights and does dot in FP32 #21