-
Notifications
You must be signed in to change notification settings - Fork 125
Report device fp support via config rather than extension string. #2231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Report device fp support via config rather than extension string. #2231
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I might just be out of the loop but I'm not sure why this is being done. I'm not sure how to review it if I don't understand the rationale.
5c227f9
to
c740185
Compare
My bad, updated the commit message and description. Basically I think returning hard coded opencl extension strings from every adapter instead of reporting stuff via device info queries is heinous and I'm trying get rid of it. |
llvm PR intel/llvm#15811 |
88ef7a8
to
5d3b502
Compare
5d3b502
to
febe542
Compare
ping @oneapi-src/unified-runtime-native-cpu-write @oneapi-src/unified-runtime-opencl-write @oneapi-src/unified-runtime-level-zero-write @oneapi-src/unified-runtime-cuda-write @oneapi-src/unified-runtime-hip-write |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for L0
febe542
to
b982b09
Compare
ping @oneapi-src/unified-runtime-native-cpu-write @oneapi-src/unified-runtime-opencl-write @oneapi-src/unified-runtime-cuda-write @oneapi-src/unified-runtime-hip-write |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CUDA/HIP LGTM
b982b09
to
fabf436
Compare
case UR_DEVICE_INFO_DOUBLE_FP_CONFIG: { | ||
ur_device_fp_capability_flags_t DoubleFPValue = 0; | ||
return ReturnValue(DoubleFPValue); | ||
// All fp types are supported, return minimum flags to indicate support. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did we falsely say before we didn't support double etc?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no the adapter was reporting "cl_khr_fp16, cl_khr_fp64 " for UR_DEVICE_INFO_EXTENSIONS, which I've removed to replace with this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
so this bit didn't get called for the extensions then, but was still wrong?
ur_device_fp_capability_flags_t SingleFPValue = 0; | ||
return ReturnValue(SingleFPValue); | ||
} | ||
case UR_DEVICE_INFO_HALF_FP_CONFIG: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think half support will depend on the architecture - although x86 seems to emulate - still this may still be better than what we had before, and we don't officially support the "bad" architectures I think yet.
Unified Runtime -> intel/llvm Repo Move NoticeInformationThe source code of Unified Runtime has been moved to intel/llvm under the unified-runtime top-level directory, The code will be mirrored to oneapi-src/unified-runtime and the specification will continue to be hosted at oneapi-src.github.io/unified-runtime. The contribution guide has been updated with new instructions for contributing to Unified Runtime. PR MigrationAll open PRs including this one will be labelled auto-close and shall be automatically closed after 30 days. Should you wish to continue with your PR you will need to migrate it to intel/llvm. This is an automated comment. |
We're trying to move the UR adapters away from returning hard coded OpenCL extension strings to report device capabilities, this is the first change in that direction.
fabf436
to
94ea653
Compare
We're trying to move the UR adapters away from returning hard coded OpenCL
extension strings to report device capabilities, this is the first change
in that direction.
See #1374