Support bf16/fp16 per layer convert for WOQ #1802

Kaihui-intel · 2024-05-17T06:44:41Z

Type of Change

feature

Description

detail description

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

github-actions · 2024-05-20T05:11:31Z

⛈️ Required checks status: Has failure 🔴

Warning
If you do not have the access to re-run the Probot, please contact XuehaoSun for help. If you push a new commit, all of the workflow will be re-triggered.

Groups summary

🟢 Code Scan Tests workflow

Check ID	Status
Code-Scan	success	✅
Code-Scan (Bandit Code Scan Bandit)	success	✅
Code-Scan (DocStyle Code Scan DocStyle)	success	✅
Code-Scan (Pylint Code Scan Pylint)	success	✅

These checks are required after the changes to neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/utils/environ.py.

🟢 Model Tests 3x workflow

Check ID	Status
Model-Test-3x	success	✅
Model-Test-3x (Generate Report GenerateReport)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml)	success	✅

These checks are required after the changes to neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/utils/environ.py.

🔴 Unit Tests 3x-PyTorch workflow

Check ID	Status	Error details
UT-3x-Torch	failure		❌
UT-3x-Torch (Coverage Compare CollectDatafiles)	failure	download	❌
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch)	success		✅
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline)	success		✅

These checks are required after the changes to neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/utils/environ.py, test/3x/torch/quantization/weight_only/test_rtn.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

for more information, see https://pre-commit.ci

chensuyue · 2024-05-20T09:19:42Z

Not need anymore

Kaihui-intel added 2 commits March 14, 2024 15:39

support rtn fp16

433d189

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

minor fix

fbc8954

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

chensuyue added the WIP label May 17, 2024

chensuyue added this to the v2.6 milestone May 17, 2024

rebase

b992f3a

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

e5d2975

for more information, see https://pre-commit.ci

chensuyue closed this May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support bf16/fp16 per layer convert for WOQ #1802

Support bf16/fp16 per layer convert for WOQ #1802

Kaihui-intel commented May 17, 2024

github-actions bot commented May 20, 2024 •

edited

chensuyue commented May 20, 2024

Support bf16/fp16 per layer convert for WOQ #1802

Support bf16/fp16 per layer convert for WOQ #1802

Conversation

Kaihui-intel commented May 17, 2024

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

github-actions bot commented May 20, 2024 • edited

⛈️ Required checks status: Has failure 🔴

Groups summary

chensuyue commented May 20, 2024

github-actions bot commented May 20, 2024 •

edited