Skip to content

Commit 498d56c

Browse files
authored
[SYCL][CUDA] Implement the Intel USM extension for the CUDA backend (#1241)
Device - USM-related calls to piDeviceGetInfo Kernel - piextKernelSetArgPointer USM - piextUSMHostAlloc - piextUSMDeviceAlloc - piextUSMSharedAlloc - piextUSMFree - piextUSMEnqueueMemset - piextUSMEnqueueMemcpy - piextUSMEnqueuePrefetch (*) - piextUSMEnqueueMemAdvise (*) - piextUSMGetMemAllocInfo (*) due to the incomplete documentation of the USM extension: - piextUSMEnqueuePrefetch ignores the "flags" argument; - piextUSMEnqueueMemAdvise does nothing. * Fix CUDA version conversion * Mark LIT tests for supported USM features Signed-off-by: Andrea Bocci <[email protected]>
1 parent 0d56408 commit 498d56c

18 files changed

+457
-23
lines changed

sycl/plugins/cuda/pi_cuda.cpp

Lines changed: 453 additions & 3 deletions
Large diffs are not rendered by default.

sycl/test/usm/allocator_vector.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==---- allocator_vector.cpp - Allocator Container test -------------------==//
87
//

sycl/test/usm/allocator_vector_fail.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==-- allocator_vector_fail.cpp - Device Memory Allocator fail test -------==//
87
//

sycl/test/usm/allocatorll.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==---- allocatorll.cpp - Device Memory Linked List Allocator test --------==//
87
//

sycl/test/usm/badmalloc.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,6 @@
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
55

66
// UNSUPPORTED: windows
7-
// XFAIL: cuda
87

98
//==----------------- badmalloc.cpp - Bad Mallocs test ---------------------==//
109
//

sycl/test/usm/depends_on.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==----------------- depends_on.cpp - depends_on test ---------------------==//
87
//

sycl/test/usm/dmemll.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==------------------- dmemll.cpp - Device Memory Linked List test --------==//
87
//

sycl/test/usm/dmemllaligned.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==---- dmemllaligned.cpp - Aligned Device Memory Linked List test --------==//
87
//

sycl/test/usm/hmemll.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==------------------- hmemll.cpp - Host Memory Linked List test ----------==//
87
//

sycl/test/usm/hmemllaligned.cpp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
// RUN: env SYCL_DEVICE_TYPE=HOST %t1.out
33
// RUN: %CPU_RUN_PLACEHOLDER %t1.out
44
// RUN: %GPU_RUN_PLACEHOLDER %t1.out
5-
// XFAIL: cuda
65

76
//==---- hmemllaligned.cpp - Aligned Host Memory Linked List test ----------==//
87
//

0 commit comments

Comments
 (0)