Skip to content

[BUGFIX] Excessive number of copy kernel calls #795

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 21, 2022

Conversation

oleksandr-pavlyk
Copy link
Contributor

@oleksandr-pavlyk oleksandr-pavlyk commented Mar 21, 2022

Using ze_tracer from pti-gpu/ repo reported two kernel calls for every setitem call.

Missing return was the culprit

@oleksandr-pavlyk oleksandr-pavlyk changed the title onetrace revealed twice as many copy kernel calls as expected [BUGFIX] Excessive number of copy kernel calls Mar 21, 2022
@github-actions
Copy link

@coveralls
Copy link
Collaborator

Coverage Status

Coverage decreased (-0.01%) to 81.87% when pulling 60e06dd on avoid-extra-copy-call into 8e06a6c on master.

@oleksandr-pavlyk oleksandr-pavlyk merged commit c274840 into master Mar 21, 2022
@oleksandr-pavlyk oleksandr-pavlyk deleted the avoid-extra-copy-call branch March 21, 2022 19:39
@github-actions
Copy link

Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants