Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use 8x SIMD conversions when appropriate #66

Closed
starkat99 opened this issue Dec 31, 2022 · 2 comments
Closed

Use 8x SIMD conversions when appropriate #66

starkat99 opened this issue Dec 31, 2022 · 2 comments

Comments

@starkat99
Copy link
Owner

Use the 8x x86 SIMD conversions when appropriate, not just the 4x. These are _mm256_cvtph_ps and _mm256_cvtps_ph. Needs some profiling to determine which to use over the other and in what circumstances, possible that it might be better to use 8x always.

@Shnatsel
Copy link
Contributor

Shnatsel commented Jan 7, 2023

The commit that closed this also sped up the fallback codepath, presumably because it's a loop instead of 4 independent calls to the fallback function, so there's less code bloat.

When will a new release with this change ship? We'd like to use it in the exr crate: johannesvollmer/exrs#191

@starkat99
Copy link
Owner Author

Was going to wait until doing some more ARM work, but went ahead and did a 2.2.1 release just now for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants