[SYCL] Stop emission of modf intrinsic for NVPTX/AMDGCN #17958

frasercrmck · 2025-04-10T14:57:43Z

A recent change in #126750 changed the lowering of the modf builtin(s) to lower to the llvm.modf intrinsic. Certain SYCL targets such as NVPTX and AMDGCN cannot currently handle this intrinsic in the backend because they don't have the necessary target library info. Even if they did, the SYCL device libraries are linked in earlier at the IR level so we'd be left with an unresolved symbol.

We have been working around this at clang codegen time by avoiding the emission of intrinsics altogether by (ab)using a guard on math intrinsics.

The problem with us hooking into GenerateIntrinsics like this is that SYCL is wanting to use it as a guarantee that (a certain subset of) intrinsics won't be emitted if GenerateIntrinsics is false, whereas upstream does not make this guarantee: it's an opt-in toggle for a more optimal lowering strategy. Thus we're always going to be liable to this sort of upstream change.

This doesn't feel like the right mechanism for SYCL to handle these builtins (or intrinsics) long term, but this fix restores the previous behaviour without making things much worse.

Fixes the AMDGCN/NVPTX aspects of #17813

A recent change in #126750 changed the lowering of the modf builtin(s) to lower to the llvm.modf intrinsic. Certain SYCL targets such as NVPTX and AMDGCN cannot currently handle this intrinsic in the backend because they don't have the necessary target library info. Even if they did, the SYCL device libraries are linked in earlier at the IR level so we'd be left with an unresolved symbol. We have been working around this at clang codegen time by avoiding the emission of intrinsics altogether by (ab)using a guard on math intrinsics. The problem with us hooking into GenerateIntrinsics like this is that SYCL is wanting to use it as a guarantee that (a certain subset of) intrinsics won't be emitted if GenerateIntrinsics is false, whereas upstream does not make this guarantee: it's an opt-in toggle for a more optimal lowering strategy. Thus we're always going to be liable to this sort of upstream change. This doesn't feel like the right mechanism for SYCL to handle these builtins (or intrinsics) long term, but this fix restores the previous behaviour without making things much worse. Fixes intel#17813

bader · 2025-04-10T16:55:19Z

AMDGCN cannot currently handle this intrinsic

And still Matt Arsenault has approved this change. I suppose that change is safe for the clang (i.e. OpenCL/HIP/CUDA targeting AMDGCN).

If I get it right, this the trend in community to lower all math functions builtins into LLVM instrinsics. Long term we should align DPC++ with the community approach.

frasercrmck · 2025-04-14T09:50:28Z

AMDGCN cannot currently handle this intrinsic

And still Matt Arsenault has approved this change. I suppose that change is safe for the clang (i.e. OpenCL/HIP/CUDA targeting AMDGCN).

If I get it right, this the trend in community to lower all math functions builtins into LLVM instrinsics. Long term we should align DPC++ with the community approach.

Yep I'll admit I don't fully know what's going on and what the intention and trend is. But we should try and get a handle on where things are heading and, yes, align with that approach.

I'd like to remove this workaround so we can at least align AMDGCN/NVPTX paths with the SPIR-V and other SYCL targets. If we are left with codegen intrinsics we could perhaps have an IR-level pass which expands intrinsics (with/without specific flags or ULP requirements) to libdevice/libclc function calls. I'm assume we are ruling out binary-level linking.

frasercrmck · 2025-04-15T11:15:12Z

Tracker #17813 is quite coarse. It says CUDA in the title but also says the spirv backend is failing. It doesn't mention HIP.

Indeed with this patch, all the same tests are failing with the same problem, but only for the SPIR-V backend.

I was hoping to keep the tracker in place and retitle/reword it to reflect that fact that it's still failing for the SPIR-V backend, rather than closing it and opening another almost identical one. Is that okay?

Fznamznon · 2025-04-17T10:57:14Z

I was hoping to keep the tracker in place and retitle/reword it to reflect that fact that it's still failing for the SPIR-V backend, rather than closing it and opening another almost identical one. Is that okay?

I'm not following . The original change broke NVPTX and AMDGCN, why SPIR-V backend is failing?

frasercrmck · 2025-04-17T11:27:05Z

I was hoping to keep the tracker in place and retitle/reword it to reflect that fact that it's still failing for the SPIR-V backend, rather than closing it and opening another almost identical one. Is that okay?

I'm not following . The original change broke NVPTX and AMDGCN, why SPIR-V backend is failing?

The SPIR-V backend can't handle the llvm.modf intrinsic, so has been failing after that original change. My best guess is the NVPTX/AMDGCN failures were obscuring the SPIR-V backend failures, and the ticket isn't complete in its assessment. Note the original fix was to add UNSUPPORTED: true to the tests so they haven't been running for any target, and have been obscuring the SPIR-V backend issue. It's fundamentally the same issue though.

I've been made vaguely aware of changes in the closed source repo that align this GenerateIntrinsics code for the SPIR-V backend too. So I would imagine eventually the three targets will be aligned in some fashion.

frasercrmck requested review from a team as code owners April 10, 2025 14:57

frasercrmck requested a review from sergey-semenov April 10, 2025 14:57

frasercrmck had a problem deploying to WindowsCILock April 10, 2025 14:58 — with GitHub Actions Failure

frasercrmck requested review from npmiller and Naghasan April 10, 2025 14:58

npmiller approved these changes Apr 10, 2025

View reviewed changes

frasercrmck temporarily deployed to WindowsCILock April 10, 2025 15:43 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 10, 2025 16:11 — with GitHub Actions Inactive

fix clang

f84b710

frasercrmck had a problem deploying to WindowsCILock April 14, 2025 09:44 — with GitHub Actions Error

Merge remote-tracking branch 'origin/sycl' into fix-sycl-modf-intrinsic

71ce31b

frasercrmck had a problem deploying to WindowsCILock April 14, 2025 09:52 — with GitHub Actions Error

fix check

37399f7

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 09:55 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 10:16 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 10:45 — with GitHub Actions Inactive

fix gpu intrinsic selection

86cd4f8

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 15:17 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 16:00 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 14, 2025 16:14 — with GitHub Actions Inactive

change unsupported

35dbd87

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 09:02 — with GitHub Actions Inactive

frasercrmck had a problem deploying to WindowsCILock April 15, 2025 09:45 — with GitHub Actions Error

re-add unsupported tracker

9a974b5

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 10:09 — with GitHub Actions Inactive

Merge remote-tracking branch 'origin/sycl' into fix-sycl-modf-intrinsic

9ad9723

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 11:01 — with GitHub Actions Inactive

frasercrmck had a problem deploying to WindowsCILock April 15, 2025 11:01 — with GitHub Actions Error

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 11:14 — with GitHub Actions Inactive

re-enable three other tests

6ecf34c

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 14:09 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 14:29 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock April 15, 2025 14:49 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Stop emission of modf intrinsic for NVPTX/AMDGCN #17958

[SYCL] Stop emission of modf intrinsic for NVPTX/AMDGCN #17958

frasercrmck commented Apr 10, 2025 •

edited

Loading

bader commented Apr 10, 2025

frasercrmck commented Apr 14, 2025

frasercrmck commented Apr 15, 2025

Fznamznon commented Apr 17, 2025

frasercrmck commented Apr 17, 2025

[SYCL] Stop emission of modf intrinsic for NVPTX/AMDGCN #17958

Are you sure you want to change the base?

[SYCL] Stop emission of modf intrinsic for NVPTX/AMDGCN #17958

Conversation

frasercrmck commented Apr 10, 2025 • edited Loading

bader commented Apr 10, 2025

frasercrmck commented Apr 14, 2025

frasercrmck commented Apr 15, 2025

Fznamznon commented Apr 17, 2025

frasercrmck commented Apr 17, 2025

frasercrmck commented Apr 10, 2025 •

edited

Loading