Skip to content

[SYCL][CUDA] Bug fix: non-uniform work-groups error on CUDA #2857

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

huanghua1994
Copy link

Bug fix for issue #2821

@huanghua1994 huanghua1994 requested a review from a team as a code owner December 3, 2020 15:33
Copy link
Contributor

@alexbatashev alexbatashev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, but I'd like @v-klochkov to review. Also a test would be great.

@bader bader requested a review from Pennycook December 3, 2020 15:38
@bader bader linked an issue Dec 3, 2020 that may be closed by this pull request
@bader
Copy link
Contributor

bader commented Dec 3, 2020

@huanghua1994, could add a regression test following this guide, please?

@huanghua1994
Copy link
Author

@bader Sure, will do

@huanghua1994
Copy link
Author

Tested on vs-sycl-2.hd.intel.com.
Configuration: python3 buildbot/configure.py --cuda
Compiling: python3 buildbot/compile.py
Testing: python3 buildbot/check.py

Test results:
Unsupported : 17902
Passed : 51284
Expectedly Failed: 86

@bader
Copy link
Contributor

bader commented Dec 3, 2020

Sorry for confusion. What I meant is that we should add a new regression test to validate that issue #2821 is fixed.
We have CI system, which runs existing regression tests to verify that new patches do not break existing functionality.
I'd like to have a similar test covering non-uniform work-groups on CUDA.

@v-klochkov
Copy link
Contributor

Hi, I am in process of debugging a bit bigger and more complete patch for exactly same issue. The intention was to upload it for review yesterday, but the new LIT test added by me there showed some new unexpected issues with work-group size for Aux kernels with reductions. I'll take some more time to analyze it.

@v-klochkov
Copy link
Contributor

I uploaded a fix that includes similar code as this patch, but is a bit bigger patch and includes a LIT test: #2859

@huanghua1994
Copy link
Author

@v-klochkov Thanks! That's a better fix and should be merged. @Pennycook please review Vyecheslav's PR first, and close my PR after passing his PR.

@bader
Copy link
Contributor

bader commented Dec 6, 2020

The bug is fixed by #2859.

@bader bader closed this Dec 6, 2020
@bader bader added the cuda CUDA back-end label Apr 20, 2021
jsji pushed a commit that referenced this pull request Dec 14, 2024
Ensure that we do correct translation of matrices (SPV_KHR_cooperative_matrix and SPV_INTEL_joint_matrix extensions) when untyped pointers are enabled.
This mainly fixes mangling issues in reverse translation for the untyped pointer.
Also added handling for typed and untyped SPIR-V friendly access chain instructions in forward translation (a point to review and discuss).

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@a1f6742cd6a2ce2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cuda CUDA back-end
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[SYCL] DPC++ reduction library non-uniform work-groups error
4 participants