Skip to content

Support prefetch pipeline in bounds_check_indices #3923

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

sryap
Copy link
Contributor

@sryap sryap commented Apr 3, 2025

Summary:
Backend of D72365505

This diff reduces the grid dimension of the bounds_check_indices
kernel when pipeline prefetching is used (in embedding memory
offloading). We need to use the v2 kernel since v1 does not support
grid dimension reduction.

Reviewed By: q10

Differential Revision: D72343128

sryap added 3 commits April 2, 2025 16:16
Differential Revision: D72334377
Differential Revision: D72345566
Summary:
Backend of D72365505

This diff reduces the grid dimension of the bounds_check_indices
kernel when pipeline prefetching is used (in embedding memory
offloading).  We need to use the v2 kernel since v1 does not support
grid dimension reduction.

Reviewed By: q10

Differential Revision: D72343128
Copy link

netlify bot commented Apr 3, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit e9795b0
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/67ee2b1031d6890008cdab2f
😎 Deploy Preview https://deploy-preview-3923--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D72343128

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants