Optimize Vector64.Create for constants #101662

EgorBo · 2024-04-28T13:40:30Z

EgorBo · 2024-04-29T15:31:38Z

PTAL @kunalspathak @tannergooding cc @dotnet/jit-contrib

Empy size diffs, 283 contexts with text diffs (becuase size is the same)

tannergooding · 2024-04-29T16:13:31Z

src/coreclr/jit/codegenarm64.cpp

@@ -2406,14 +2406,36 @@ void CodeGen::genSetRegToConst(regNumber targetReg, var_types targetType, GenTre
                    }
                    else
                    {
-                        // Get a temp integer register to compute long address.
-                        regNumber addrReg = tree->GetSingleTempReg();
+                        simd8_t val = vecCon->gtSimd8Val;


The same handling should presumably be added for TYP_SIMD16 as well, yes?

It should also be fine to expose for TYP_SIMD12, since any operations are already required to mask out the unused bit when its impactful.

I also wonder if this is missing some cases or potentially conflicting with the IsValidConstForMovImm in LowerHWIntrinsicCreate handling

That check is supposed to recognize and catch this case, lowering it instead to Arm64.DuplicateToVectorXXX, which is itself supposed to emit the more optimized mov instruction

It might be a case where we need both since sometimes we'll have Create(...) and sometimes we'll have CNS_VEC isntead.

Oh, good point, I'll check what we generate for other VectorX.Create

Added Vector128 as well. I think I'll leave SIMD12 unchanged since I think we probably don't want to have a garbage value in the last 4 bytes of it and prefer it being zero.

I'm fine with not doing TYP_SIMD12 in this PR, but I think we probably want to handle it as well.

The general logic for TYP_SIMD12 is explicitly setup to allow for the last element to be undefined for perf reasons. Most places it doesn't matter and the places where it does, values like 0 are often just as bad as denormals or other random values (as it frequently causes NaN to be produced).

x64, for example, will optimize it to a broadcast in many cases so that the denser codegen can be emitted

Ok if you think it's fine I pushed the SIMD12 as well - it allowed to unify them all

tannergooding

Changes as is look correct, there's just likely more we could do here to ensure other cases are handled

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 28, 2024

dotnet-policy-service bot assigned EgorBo Apr 28, 2024

EgorBo marked this pull request as ready for review April 28, 2024 19:25

build-analysis bot mentioned this pull request Apr 28, 2024

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

3 tasks

EgorBo requested review from tannergooding and kunalspathak April 29, 2024 15:30

tannergooding reviewed Apr 29, 2024

View reviewed changes

tannergooding approved these changes Apr 29, 2024

View reviewed changes

Handle SIMD16 as well

8185bfe

EgorBo force-pushed the vec64-cns branch from dc6a12a to 8185bfe Compare May 16, 2024 21:44

EgorBo added 2 commits May 17, 2024 00:36

handle SIMD12 as well

265819a

Update codegenarm64.cpp

8aab6ea

EgorBo merged commit 91518ca into dotnet:main May 17, 2024
107 checks passed

EgorBo deleted the vec64-cns branch May 17, 2024 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Vector64.Create for constants #101662

Optimize Vector64.Create for constants #101662

EgorBo commented Apr 28, 2024

EgorBo commented Apr 29, 2024

tannergooding Apr 29, 2024

tannergooding Apr 29, 2024 •

edited

EgorBo Apr 29, 2024

EgorBo May 16, 2024

tannergooding May 16, 2024

EgorBo May 16, 2024

tannergooding left a comment

Optimize Vector64.Create for constants #101662

Optimize Vector64.Create for constants #101662

Conversation

EgorBo commented Apr 28, 2024

EgorBo commented Apr 29, 2024

tannergooding Apr 29, 2024

Choose a reason for hiding this comment

tannergooding Apr 29, 2024 • edited

Choose a reason for hiding this comment

EgorBo Apr 29, 2024

Choose a reason for hiding this comment

EgorBo May 16, 2024

Choose a reason for hiding this comment

tannergooding May 16, 2024

Choose a reason for hiding this comment

EgorBo May 16, 2024

Choose a reason for hiding this comment

tannergooding left a comment

Choose a reason for hiding this comment

tannergooding Apr 29, 2024 •

edited