Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WorkGroupScratchMemory/copy_dynamic_size.cpp Fails on cuda when built for multiple triples #16968

Open
ayylol opened this issue Feb 11, 2025 · 0 comments
Labels
bug Something isn't working cuda CUDA back-end

Comments

@ayylol
Copy link
Contributor

ayylol commented Feb 11, 2025

Describe the bug

WorkGroupScratchMemory/copy_dynamic_size.cpp will fail executing on cuda devices when it is built for more than just the nvptx triple. The test passes fine when it is built only for nvptx.

To reproduce

To reproduce first build the test for spir and nvptx:

llvm-lit WorkGroupScratchMemory/copy_dynamic_size.cpp --param test-mode=build-only --param sycl_build_targets="spir;nvidia" -a

Then run for cuda:gpu

llvm-lit WorkGroupScratchMemory/copy_dynamic_size.cpp --param test-mode=run-only --param sycl_devices=cuda:gpu -a

I got the following error:

-- Testing: 1 tests, 1 workers --
FAIL: SYCL :: WorkGroupScratchMemory/copy_dynamic_size.cpp (1 of 1)
******************** TEST 'SYCL :: WorkGroupScratchMemory/copy_dynamic_size.cpp' FAILED ********************
Exit Code: 134

Command Output (stderr):
--
RUN: at line 2: env UR_CUDA_ENABLE_IMAGE_SUPPORT=1 ONEAPI_DEVICE_SELECTOR=cuda:gpu  /iusers/dgarciao/llvm/build/tools/sycl/test-e2e/WorkGroupScratchMemory/Output/copy_dynamic_size.cpp.tmp.out
+ env UR_CUDA_ENABLE_IMAGE_SUPPORT=1 ONEAPI_DEVICE_SELECTOR=cuda:gpu /iusers/dgarciao/llvm/build/tools/sycl/test-e2e/WorkGroupScratchMemory/Output/copy_dynamic_size.cpp.tmp.out
<CUDA>[ERROR]:
UR CUDA ERROR:
        Value:           700
        Name:            CUDA_ERROR_ILLEGAL_ADDRESS
        Description:     an illegal memory access was encountered
        Function:        wait
        Source Location: /iusers/dgarciao/llvm/build/_deps/unified-runtime-src/source/adapters/cuda/event.cpp:134

terminate called after throwing an instance of 'sycl::_V1::exception'
  what():  Native API failed. Native API returns: 2147483646 (UR_RESULT_ERROR_UNKNOWN)
/iusers/dgarciao/llvm/build/tools/sycl/test-e2e/WorkGroupScratchMemory/Output/copy_dynamic_size.cpp.script: line 1: 4046523 Aborted                 (core dumped) env UR_CUDA_ENABLE_IMAGE_SUPPORT=1 ONEAPI_DEVICE_SELECTOR=cuda:gpu /iusers/dgarciao/llvm/build/tools/sycl/test-e2e/WorkGroupScratchMemory/Output/copy_dynamic_size.cpp.tmp.out

--

If instead we only build for nvptx in the first step, the test will pass

Environment

  • OS: Linux
  • Target device and vendor: cuda:gpu
  • DPC++ version: b8a2ab7

Additional context

No response

@ayylol ayylol added bug Something isn't working cuda CUDA back-end labels Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cuda CUDA back-end
Projects
None yet
Development

No branches or pull requests

1 participant