Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lowered LLVM is preventing inlining #1478

Open
bcardosolopes opened this issue Mar 13, 2025 · 5 comments
Open

Lowered LLVM is preventing inlining #1478

bcardosolopes opened this issue Mar 13, 2025 · 5 comments
Assignees
Labels
good first issue Good for newcomers IR difference A difference in ClangIR-generated LLVM IR that could complicate reusing original CodeGen tests

Comments

@bcardosolopes
Copy link
Member

In clang/test/CIR/CodeGen/tbaa-vptr.cpp:

The LLVM IR produced by OG is as follows:

define dso_local void @_ZN1BD0Ev(ptr noundef nonnull align 8 dereferenceable(9) initializes((0, 8)) %this) unnamed_addr #0 align 2 {
entry:
  store ptr getelementptr inbounds nuw inrange(-16, 16) (i8, ptr @_ZTV1B, i64 16), ptr %this, align 8, !tbaa !5
  %0 = getelementptr inbounds nuw i8, ptr %this, i64 8
  tail call void @_ZN6MemberD1Ev(ptr noundef nonnull align 1 dereferenceable(1) %0) #3
  tail call void @_ZN1AD2Ev(ptr noundef nonnull align 8 dereferenceable(9) %this) #3
  tail call void @_ZdlPvm(ptr noundef %this, i64 noundef 16) #4
  ret void
}

In contrast, the LLVM IR generated by ClangIR is as follows:

define dso_local void @_ZN1BD0Ev(ptr %0) #1 {
  tail call void @_ZN1BD1Ev(ptr %0) #0
  tail call void @_ZdlPvm(ptr %0, i64 16)
  ret void
}

Is there something missing in the ClangIR?

Originally posted by @PikachuHyA in #1463 (comment)

@bcardosolopes bcardosolopes self-assigned this Mar 13, 2025
@AdUhTkJm
Copy link
Contributor

AdUhTkJm commented Mar 14, 2025

The behaviour seems to be correct, just that the destructor is not inlined. It might be more obvious if we demangle the names.

This is simplified output from OG with -O0:

; non-deleting destructor
define dso_local void @B::~B()(ptr %0) {
  %2 = alloca ptr, align 8
  store ptr %0, ptr %2, align 8
  %3 = load ptr, ptr %2, align 8
  store ptr getelementptr ({ [4 x ptr] }, ptr @vtable for B, i32 0, i32 0, i32 2), ptr %3
  %4 = getelementptr i8, ptr %3, i64 8
  call void @Member::~Member()(ptr %4)
  call void @A::~A()(ptr %3)
  ret void
}

; deleting destructor
define dso_local void @B::~B()(ptr %0) {
  %2 = alloca ptr, align 8
  store ptr %0, ptr %2, align 8
  %3 = load ptr, ptr %2, align 8
  call void @B::~B()(ptr %3) ; this is the non-deleting one
  call void @operator delete(void*, unsigned long)(ptr %3, i64 16)
  ret void
}

As we can see, the OG output in the example comes from inlining the non-deleting destructor into the deleting one:

define dso_local void @_ZN1BD0Ev(ptr %this) {
entry:
  store ptr getelementptr (i8, ptr @_ZTV1B, i64 16), ptr %this, align 8
  %0 = getelementptr inbounds nuw i8, ptr %this, i64 8
  tail call void @_ZN6MemberD1Ev(ptr %0) ; Member::~Member()
  tail call void @_ZN1AD2Ev(ptr %this) ; A::~A()
  tail call void @_ZdlPvm(ptr %this, i64 16) ; operator delete(void *, unsigned long)
  ret void
}

While the Clang IR one does not perform the inline:

define dso_local void @_ZN1BD0Ev(ptr %0) #1 {
  tail call void @_ZN1BD1Ev(ptr %0) ; B::~B(), which is the non-deleting one
  tail call void @_ZdlPvm(ptr %0, i64 16) ; operator delete(void *, unsigned long)
  ret void
}

(cc @PikachuHyA)

@bcardosolopes
Copy link
Member Author

Thanks for checking. Seems like CIR -> LLVM lowering is probably lacking some attributes that would make the inliner kick in - the idea is that LLVM should be able to continue doing same opts after lowered to CIR.

@bcardosolopes bcardosolopes changed the title Missing dtor call Lowered LLVM is preventing inlining Mar 17, 2025
@bcardosolopes bcardosolopes added IR difference A difference in ClangIR-generated LLVM IR that could complicate reusing original CodeGen tests good first issue Good for newcomers labels Mar 17, 2025
@yimdx
Copy link

yimdx commented Mar 18, 2025

I would like to work on this to learn ClangIR!

@siya100
Copy link

siya100 commented Mar 26, 2025

hey @bcardosolopes i want to work on this issue to delve into ClangIR

@bcardosolopes bcardosolopes assigned siya100 and yimdx and unassigned bcardosolopes Mar 29, 2025
@bcardosolopes
Copy link
Member Author

I'm fine with whoever submits the PR, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers IR difference A difference in ClangIR-generated LLVM IR that could complicate reusing original CodeGen tests
Projects
None yet
Development

No branches or pull requests

4 participants