Use `deformable_detr` kernel from the Hub #36853

danieldk · 2025-03-20T12:10:25Z

What does this PR do?

Remove the deformable_detr kernel from kernels/ and use the pre-built kernel from the Hub instead.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@ArthurZucker @LysandreJik

github-actions · 2025-03-20T12:10:41Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

ArthurZucker

Super super super nice! 🚀 let's just wait for @LysandreJik 's opinion on adding a new core dep!

ArthurZucker · 2025-03-20T13:33:46Z

setup.py

@@ -432,6 +434,7 @@ def run(self):
 install_requires = [
    deps["filelock"],  # filesystem locks, e.g., to prevent parallel downloads
    deps["huggingface-hub"],
+    deps["kernels"],  # download kernels from the Hub


That's the only place I'd want to wait @LysandreJik's input on! IMO this is the longterm plan for sure, wondering if we are gonna do that slowly (first optional soft dep) or not!

src/transformers/integrations/hub_kernels.py

HuggingFaceDocBuilderDev · 2025-03-21T11:04:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Let's GO!!! 🤗

ArthurZucker · 2025-03-21T11:34:18Z

src/transformers/integrations/hub_kernels.py

+
+    register_kernel_mapping(_KERNEL_MAPPING)
+
+except ImportError:


Remove the `deformable_detr` kernel from `kernels/` and use the pre-built kernel from the Hub instead.

Also add it to `testing`, so that the kernel replacement gets tested when using CUDA in CI.

qubvel · 2025-03-21T14:48:26Z

Hey @danieldk! Super nice to see kernels integrated! Is there a way to disable the kernel path dynamically? RT-DETR's compile full graph and torch.export are broken now, they should go through the torch path.

ArthurZucker · 2025-03-21T15:40:11Z

Ah normally if cuda is not available it should fallback to noraml forwar + if not available should do nothing

ArthurZucker · 2025-03-21T15:40:30Z

(@qubvel export can be fast test no? sorry that it broke)

qubvel · 2025-03-21T16:06:41Z

it's too slow, even though it's on a small model

qubvel · 2025-03-21T16:09:18Z

we discussed with @danieldk in Slack, the idea is to have a way to control which path to go through, smth like

if self.disable_custom_kernels or is_torchdynamo_compiling():
    with use_kernel_mapping({}, inherit_mapping=False): # inherit_mapping would be a new argument
        output = self.attn(
            value,
            spatial_shapes,
            spatial_shapes_list,
            level_start_index,
            sampling_locations,
            attention_weights,
            self.im2col_step,
        )
else:
    ...

ArthurZucker · 2025-03-24T10:55:02Z

Or we register the use kernel with torch API to allow compile ?

qubvel · 2025-03-24T12:04:38Z

Not aware of this but it can be an option, yes. Anyway it would be nice to be able to disable it manually in case there are any issues with the kernel + understand which path is executed (e.g. for RT-DETR to export for CoreML we need 5D tensors instead of 6D passed into the kernel, so I refactored pytorch path of deformable attention in this PR)

github-actions bot marked this pull request as draft March 20, 2025 12:10

danieldk force-pushed the kernels-deformable-detr branch 5 times, most recently from d6bef46 to 05b6ee7 Compare March 20, 2025 12:49

ArthurZucker approved these changes Mar 20, 2025

View reviewed changes

danieldk marked this pull request as ready for review March 21, 2025 11:06

ArthurZucker approved these changes Mar 21, 2025

View reviewed changes

src/transformers/integrations/hub_kernels.py

register_kernel_mapping(_KERNEL_MAPPING)

except ImportError:

Copy link

Collaborator

ArthurZucker Mar 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

perfect!

danieldk added 3 commits March 21, 2025 11:37

Use deformable_detr kernel from the Hub

9ff5857

Remove the `deformable_detr` kernel from `kernels/` and use the pre-built kernel from the Hub instead.

Add license header

2083474

Add kernels as an extra hub-kernels

Loading
Loading status checks…

96ae475

Also add it to `testing`, so that the kernel replacement gets tested when using CUDA in CI.

danieldk force-pushed the kernels-deformable-detr branch from 837ac4a to 96ae475 Compare March 21, 2025 11:37

ArthurZucker merged commit f94b0c5 into huggingface:main Mar 21, 2025
21 of 23 checks passed

qubvel mentioned this pull request Mar 24, 2025

Fix pytorch defomr attn path #36923

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `deformable_detr` kernel from the Hub #36853

Use `deformable_detr` kernel from the Hub #36853

danieldk commented Mar 20, 2025

github-actions bot commented Mar 20, 2025

ArthurZucker left a comment

ArthurZucker Mar 20, 2025

HuggingFaceDocBuilderDev commented Mar 21, 2025

ArthurZucker left a comment

ArthurZucker Mar 21, 2025

qubvel commented Mar 21, 2025

ArthurZucker commented Mar 21, 2025

ArthurZucker commented Mar 21, 2025

qubvel commented Mar 21, 2025

qubvel commented Mar 21, 2025

ArthurZucker commented Mar 24, 2025

qubvel commented Mar 24, 2025

Use deformable_detr kernel from the Hub #36853

Use deformable_detr kernel from the Hub #36853

Conversation

danieldk commented Mar 20, 2025

What does this PR do?

Before submitting

Who can review?

github-actions bot commented Mar 20, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Mar 20, 2025

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 21, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Mar 21, 2025

Choose a reason for hiding this comment

qubvel commented Mar 21, 2025

ArthurZucker commented Mar 21, 2025

ArthurZucker commented Mar 21, 2025

qubvel commented Mar 21, 2025

qubvel commented Mar 21, 2025

ArthurZucker commented Mar 24, 2025

qubvel commented Mar 24, 2025

Use `deformable_detr` kernel from the Hub #36853

Use `deformable_detr` kernel from the Hub #36853