Fix #1638: Faster returning of results by mdboom · Pull Request #1647 · NVIDIA/cuda-python

mdboom · 2026-02-18T16:35:49Z

See #1638 for details as to why this works.

On my machine with the #659 benchmark, I see a reduction from 3.02us to 2.54us per iteration.

copy-pr-bot · 2026-02-18T16:35:54Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

mdboom · 2026-02-18T17:50:02Z

/ok to test

cpcloud

Seems legit!

github-actions · 2026-02-18T19:02:28Z

Doc Preview CI
Preview removed because the pull request was closed or merged.

leofang · 2026-02-18T23:42:47Z

@mdboom sorry for noticing this late, but I am confused by the diff -- it seems like this PR just fixes a recently-introduced (I think) regression? We've fixed this enum issue long time ago by a fast dict lookup (#546), but that fix seems to be gone (and replaced by a presumably even-faster memoization in this PR).

leofang · 2026-02-19T15:24:31Z

it seems like this PR just fixes a recently-introduced (I think) regression?

Ah, I see. The "regression" was introduced in the fast enum refactoring (#1581). So with this PR it means regardless of how fast the enum implementation (builtin or custom) is the memoization is always needed... 🙂

Fix NVIDIA#1638: Faster returning of results

6872165

mdboom self-assigned this Feb 18, 2026

Merge remote-tracking branch 'upstream/main' into fast-result

d4223bd

This comment has been minimized.

Sign in to view

cpcloud approved these changes Feb 18, 2026

View reviewed changes

mdboom added cuda.bindings Everything related to the cuda.bindings module performance labels Feb 18, 2026

mdboom merged commit 7a73c60 into NVIDIA:main Feb 18, 2026
93 checks passed

mdboom mentioned this pull request Feb 19, 2026

[PERF]: Add one more fast success path #1656

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix #1638: Faster returning of results#1647

Fix #1638: Faster returning of results#1647
mdboom merged 2 commits intoNVIDIA:mainfrom
mdboom:fast-result

mdboom commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

mdboom commented Feb 18, 2026

Uh oh!

This comment has been minimized.

cpcloud left a comment

Uh oh!

Uh oh!

github-actions bot commented Feb 18, 2026

Uh oh!

leofang commented Feb 18, 2026

Uh oh!

leofang commented Feb 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

mdboom commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

mdboom commented Feb 18, 2026

Uh oh!

This comment has been minimized.

cpcloud left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Feb 18, 2026

Uh oh!

leofang commented Feb 18, 2026

Uh oh!

leofang commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

leofang commented Feb 19, 2026 •

edited

Loading