CUDA: add mean operation #14313

am17an · 2025-06-21T06:00:03Z

Refactor sum_rows to use also do norm. Added a performance test as well. Sum-rows and mean can be abstracted even more but I think it's a cleaner API to keep them like this.

Backend	Device	us/run	Bandwidth	Speedup
CPU	Ryzen 3800XT 8-core	116.68	6.30	1.00
GPU	RTX 3090	2.72 us	270.31	42.9

JohannesGaessler

Since you're already working on reduction ops, you could take a look at the discussion in ggml-org/ggml#1005 . The person who said they'd do it has so far not delivered anything so I think it's safe to say they won't in the future.

For the CUDA code specifically my preference would be to have the reduction ops in a single file so that the template and the code using it is close together but this is a minor issue.

ggml/src/ggml-cuda/common.cuh

github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jun 21, 2025

am17an force-pushed the cuda_add_mean branch from 30cb3f3 to 402858f Compare June 21, 2025 06:41

am17an requested a review from JohannesGaessler June 21, 2025 07:48

JohannesGaessler approved these changes Jun 21, 2025

View reviewed changes

ggml/src/ggml-cuda/common.cuh Outdated Show resolved Hide resolved

am17an added 3 commits June 22, 2025 02:28

CUDA: add mean operation

5c745ae

add back sum_rows_f32_cuda

866cdfe

Review: early exit if col!=0

9334357

am17an force-pushed the cuda_add_mean branch from 05affaa to 9334357 Compare June 21, 2025 18:28

am17an merged commit aa064b2 into ggml-org:master Jun 22, 2025
87 of 88 checks passed

am17an deleted the cuda_add_mean branch June 22, 2025 04:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: add mean operation #14313

CUDA: add mean operation #14313

am17an commented Jun 21, 2025 •

edited

Loading

Uh oh!

JohannesGaessler left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CUDA: add mean operation #14313

CUDA: add mean operation #14313

Conversation

am17an commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

am17an commented Jun 21, 2025 •

edited

Loading