Clang generates worse code than GCC for a simple case #92649

KanRobert · 2024-05-18T13:29:15Z

int f(int *a) {
  if (*a & 1234)
    return 0;
  return 1;
}

bash$ gcc -O2 -S 1.c -o -
        .file   "1.c"
        .text
        .p2align 4
        .globl  f
        .type   f, @function
f:
.LFB0:
        .cfi_startproc
        xorl    %eax, %eax
        testl   $1234, (%rdi)
        sete    %al
        ret

bash$ clang -O2 -S 1.c -o -
        .text
        .file   "1.c"
        .globl  f                               # -- Begin function f
        .p2align        4, 0x90
        .type   f,@function
f:                                      # @f
        .cfi_startproc
# %bb.0:                                # %entry
        movzwl  (%rdi), %ecx
        xorl    %eax, %eax
        testl   $1234, %ecx                     # imm = 0x4D2
        sete    %al
        retq

https://www.godbolt.org/z/he4cj4a8G

The text was updated successfully, but these errors were encountered:

llvmbot · 2024-05-18T13:29:31Z

@llvm/issue-subscribers-backend-x86

Author: Shengchen Kan (KanRobert)

``` int f(int *a) { if (*a & 1234) return 0; return 1; } ```

bash$ gcc -O2 -S 1.c -o -
        .file   "1.c"
        .text
        .p2align 4
        .globl  f
        .type   f, @<!-- -->function
f:
.LFB0:
        .cfi_startproc
        xorl    %eax, %eax
        testl   $1234, (%rdi)
        sete    %al
        ret

bash$ clang -O2 -S 1.c -o -
        .text
        .file   "1.c"
        .globl  f                               # -- Begin function f
        .p2align        4, 0x90
        .type   f,@<!-- -->function
f:                                      # @<!-- -->f
        .cfi_startproc
# %bb.0:                                # %entry
        movzwl  (%rdi), %ecx
        xorl    %eax, %eax
        testl   $1234, %ecx                     # imm = 0x4D2
        sete    %al
        retq

https://www.godbolt.org/z/he4cj4a8G

KanRobert · 2024-05-18T13:31:14Z

CC @phoebewang @RKSimon @topperc b/c I'm not if it's by design.

phoebewang · 2024-05-18T13:35:38Z

Maybe similar to #92251

topperc · 2024-05-18T20:05:06Z

I don't think this is intentional. It looks like TargetLowering::SimplifySetCC is narrowing the setcc+and+load to i16 here.

    // If the LHS is '(and load, const)', the RHS is 0, the test is for          
    // equality or unsigned, and all 1 bits of the const are in the same         
    // partial word, see if we can shorten the load.                             
    if (DCI.isBeforeLegalize() &&                                                
        !ISD::isSignedIntSetCC(Cond) &&                                          
        N0.getOpcode() == ISD::AND && C1 == 0 &&                                 
        N0.getNode()->hasOneUse() &&                                             
        isa<LoadSDNode>(N0.getOperand(0)) &&                                     
        N0.getOperand(0).getNode()->hasOneUse() &&                               
        isa<ConstantSDNode>(N0.getOperand(1))) {

The and gets promoted to i32 later due to isTypeDesirableForOp. That creates an anyext load which later becomes a zextload.

We could probably add a DAGCombine to promote the load back to i32 for the AND based on alignment.

KanRobert added backend:X86 llvm:codegen labels May 18, 2024

EugeneZelenko added the missed-optimization label May 18, 2024

KanRobert mentioned this issue May 18, 2024

[X86][CodeGen] Support lowering for CCMP/CTEST #91747

Merged

KanRobert self-assigned this May 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clang generates worse code than GCC for a simple case #92649

Clang generates worse code than GCC for a simple case #92649

KanRobert commented May 18, 2024

llvmbot commented May 18, 2024

KanRobert commented May 18, 2024

phoebewang commented May 18, 2024

topperc commented May 18, 2024

Clang generates worse code than GCC for a simple case #92649

Clang generates worse code than GCC for a simple case #92649

Comments

KanRobert commented May 18, 2024

llvmbot commented May 18, 2024

KanRobert commented May 18, 2024

phoebewang commented May 18, 2024

topperc commented May 18, 2024