OVEP: Bug Fixes, Refactoring, and Contrib Ops Update #23742

saurabhkale17 · 2025-02-18T17:06:49Z

Description

This pull request combines multiple improvements, bug fixes for the OpenVINO Execution Provider (OVEP). The changes are summarized as follows:

Support for various contrib Ops in OVEP.
Dimension Check Fixes for Greater, Pad, and MAX Ops: Fixed dimension check failures for the Greater, Pad, and MAX ops in OVEP, ensuring they now pass validation for all supported models.
Refactor Core and Shared Context Lifetimes: Refactored the lifetimes of the OpenVINO core and shared context to remove dependency on shutdown calls. This change avoids relying on static lifetime management and improves stability and resource cleanup.
Fix for Duplicate DQ Node Removal: Addressed an issue where duplicate Dequantize (DQ) nodes that were initializers were incorrectly removed. Initializers should always be preserved, and this fix ensures that all duplicate DQ nodes that are initializers are retained.

* update: Update MSFT Contrib Ops from OV * modified data_ops.cc to remove unsupported ops * disabled tests for EmbedLayerNormalisation and MatMulNBits --------- Co-authored-by: n1harika <niharika.sathish@intel.com>

* Internal ci for PTL 1.1 (#523) * update: Update MSFT Contrib Ops in OVEP (#521) * update: Update MSFT Contrib Ops from OV * modified data_ops.cc to remove unsupported ops * disabled tests for EmbedLayerNormalisation and MatMulNBits --------- Co-authored-by: n1harika <niharika.sathish@intel.com> * Add Max op to no_dimension_supported list --------- Co-authored-by: jatinwadhwa921 <110383850+jatinwadhwa921@users.noreply.github.com> Co-authored-by: Ankit Maheshkar <ankit.maheshkar@intel.com> Co-authored-by: n1harika <niharika.sathish@intel.com>

Python bindings don't call the provider factory shutdown method. We relied on this to avoid destruction order issues with statically scoped ov::Core objects. Refactor ov core and shared context lifetimes such that we don't need to rely on shutdown calls to manage life times and we avoid a static lifetime of the core. Co-authored-by: Eric Crawford <eric.r.crawford@intel.com>

…582) This change addresses an issue where a Pad op in quantized models fails due to an unsupported dimension input. The fix adds logic to detect if a Pad op is part of a quantized model by checking for a DequantizeLinear input. If found, the op is marked as quantized and the unsupported dimension check is bypassed, ensuring that the pad_value remains constant as required by the VPUX compiler. Related-to: EISW-152222 Co-authored-by: Surendar Rama Sitaraman <surendar.rama.sitaraman@intel.com>

saurabhkale17 · 2025-02-18T17:39:12Z

@jywu-msft

jywu-msft · 2025-02-18T18:21:44Z

/azp run Linux OpenVINO CI Pipeline

azure-pipelines · 2025-02-18T18:21:56Z

Azure Pipelines successfully started running 1 pipeline(s).

yihonglyu · 2025-02-19T15:52:31Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

azure-pipelines · 2025-02-19T15:53:10Z

Azure Pipelines successfully started running 8 pipeline(s).

yihonglyu · 2025-02-19T19:55:16Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Big Models

azure-pipelines · 2025-02-19T19:55:42Z

Azure Pipelines successfully started running 5 pipeline(s).

yihonglyu · 2025-02-19T22:13:56Z

/azp run Linux Android Emulator QNN CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-02-19T22:14:15Z

Azure Pipelines successfully started running 4 pipeline(s).

yihonglyu · 2025-02-19T22:15:28Z

/azp run Win TRT Minimal CUDA Test CI Pipeline

azure-pipelines · 2025-02-19T22:15:36Z

No pipelines are associated with this pull request.

yihonglyu · 2025-02-19T22:16:52Z

/azp run Windows TRT Minimal CUDA Test CI Pipeline

azure-pipelines · 2025-02-19T22:16:59Z

No pipelines are associated with this pull request.

yihonglyu · 2025-02-19T22:17:27Z

/azp run Windows TensorRT Minimal CUDA Test CI Pipeline

azure-pipelines · 2025-02-19T22:17:33Z

No pipelines are associated with this pull request.

yihonglyu · 2025-02-19T22:18:29Z

/azp run Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-02-19T22:18:41Z

Azure Pipelines successfully started running 1 pipeline(s).

yihonglyu · 2025-02-19T23:49:16Z

Description

This pull request combines multiple improvements, bug fixes for the OpenVINO Execution Provider (OVEP). The changes are summarized as follows:

Support for various contrib Ops in OVEP.

Dimension Check Fixes for Greater, Pad, and MAX Ops: Fixed dimension check failures for the Greater, Pad, and MAX ops in OVEP, ensuring they now pass validation for all supported models.

Refactor Core and Shared Context Lifetimes: Refactored the lifetimes of the OpenVINO core and shared context to remove dependency on shutdown calls. This change avoids relying on static lifetime management and improves stability and resource cleanup.

Fix for Duplicate DQ Node Removal: Addressed an issue where duplicate Dequantize (DQ) nodes that were initializers were incorrectly removed. Initializers should always be preserved, and this fix ensures that all duplicate DQ nodes that are initializers are retained.

Could you list the contrib ops supported by this PR?

saurabhkale17 · 2025-02-20T04:57:17Z

These are contrib ops

SkipLayerNormalization
MatMulNBits
FusedGemm
FusedConv
EmbedLayerNormalization
BiasGelu
Attention

@yihonglyu

jywu-msft · 2025-02-20T05:05:42Z

/azp run Linux GPU CI Pipeline

azure-pipelines · 2025-02-20T05:05:55Z

Azure Pipelines successfully started running 1 pipeline(s).

yihonglyu · 2025-02-20T15:22:07Z

These are contrib ops

SkipLayerNormalization

MatMulNBits

FusedGemm

FusedConv

EmbedLayerNormalization

BiasGelu

Attention

@yihonglyu

@saurabhkale17 Please list them in the git commit message.

yihonglyu

Could you enable tests for the contrib ops added in this PR?

onnxruntime/test/contrib_ops/embed_layer_norm_op_test.cc

onnxruntime/core/providers/openvino/ov_versions/data_ops.cc

yihonglyu

Please add tests for:

Dimension check fixes for Greater, Pad, and MAX ops.
Fix for duplicate DQ node removal.

in the following PR.

### Description This pull request combines multiple improvements, bug fixes for the OpenVINO Execution Provider (OVEP). The changes are summarized as follows: 1. Support for various contrib Ops in OVEP. 2. Dimension Check Fixes for Greater, Pad, and MAX Ops: Fixed dimension check failures for the Greater, Pad, and MAX ops in OVEP, ensuring they now pass validation for all supported models. 3. Refactor Core and Shared Context Lifetimes: Refactored the lifetimes of the OpenVINO core and shared context to remove dependency on shutdown calls. This change avoids relying on static lifetime management and improves stability and resource cleanup. 4. Fix for Duplicate DQ Node Removal: Addressed an issue where duplicate Dequantize (DQ) nodes that were initializers were incorrectly removed. Initializers should always be preserved, and this fix ensures that all duplicate DQ nodes that are initializers are retained. --------- Co-authored-by: Ankit Maheshkar <ankit.maheshkar@intel.com> Co-authored-by: n1harika <niharika.sathish@intel.com> Co-authored-by: rayngun <103146671+rayngun@users.noreply.github.com> Co-authored-by: jatinwadhwa921 <110383850+jatinwadhwa921@users.noreply.github.com> Co-authored-by: Eric Crawford <eric.r.crawford@intel.com> Co-authored-by: Surendar Rama Sitaraman <surendar.rama.sitaraman@intel.com>

ankitm3k and others added 7 commits February 17, 2025 14:55

update: Update MSFT Contrib Ops in OVEP (#521)

dae8097

* update: Update MSFT Contrib Ops from OV * modified data_ops.cc to remove unsupported ops * disabled tests for EmbedLayerNormalisation and MatMulNBits --------- Co-authored-by: n1harika <niharika.sathish@intel.com>

Add Greater Op to dimension check (#560)

41ef60e

handling duplicate dq which are initializers (#575)

9ad8e3d

fix lint issues

Loading
Loading status checks…

b73284e

jywu-msft requested a review from yihonglyu February 19, 2025 00:03

yihonglyu reviewed Feb 20, 2025

View reviewed changes

onnxruntime/test/contrib_ops/embed_layer_norm_op_test.cc Show resolved Hide resolved

ankitm3k mentioned this pull request Feb 20, 2025

Additional Contrib Ops Support from OV 2025.0 Onwards intel/onnxruntime#585

Closed

yihonglyu reviewed Feb 20, 2025

View reviewed changes

onnxruntime/core/providers/openvino/ov_versions/data_ops.cc Show resolved Hide resolved

yihonglyu reviewed Feb 20, 2025

View reviewed changes

jywu-msft approved these changes Feb 21, 2025

View reviewed changes

jywu-msft merged commit 754ee21 into microsoft:main Feb 21, 2025
76 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OVEP: Bug Fixes, Refactoring, and Contrib Ops Update #23742

OVEP: Bug Fixes, Refactoring, and Contrib Ops Update #23742

saurabhkale17 commented Feb 18, 2025

saurabhkale17 commented Feb 18, 2025

jywu-msft commented Feb 18, 2025

azure-pipelines bot commented Feb 18, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

Description

saurabhkale17 commented Feb 20, 2025

jywu-msft commented Feb 20, 2025

azure-pipelines bot commented Feb 20, 2025

yihonglyu commented Feb 20, 2025

yihonglyu left a comment

yihonglyu left a comment •

edited

Loading

OVEP: Bug Fixes, Refactoring, and Contrib Ops Update #23742

OVEP: Bug Fixes, Refactoring, and Contrib Ops Update #23742

Conversation

saurabhkale17 commented Feb 18, 2025

Description

saurabhkale17 commented Feb 18, 2025

jywu-msft commented Feb 18, 2025

azure-pipelines bot commented Feb 18, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

azure-pipelines bot commented Feb 19, 2025

yihonglyu commented Feb 19, 2025

Description

saurabhkale17 commented Feb 20, 2025

jywu-msft commented Feb 20, 2025

azure-pipelines bot commented Feb 20, 2025

yihonglyu commented Feb 20, 2025

yihonglyu left a comment

Choose a reason for hiding this comment

yihonglyu left a comment • edited Loading

Choose a reason for hiding this comment

yihonglyu left a comment •

edited

Loading