-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nvshmem 3.1.7 #28647
base: main
Are you sure you want to change the base?
nvshmem 3.1.7 #28647
Conversation
Hi! This is the staged-recipes linter and your PR looks excellent! 🚀 |
Hi! This is the friendly automated conda-forge-linting service. I wanted to let you know that I linted all conda-recipes in your PR ( Here's what I've got... For recipes/libnvshmem/meta.yaml:
For recipes/libnvshmem/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12416875254. Examine the logs at this URL for more detail. |
c71139b
to
19e4343
Compare
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( I do have some suggestions for making it better though... For recipes/libnvshmem/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12835109483. Examine the logs at this URL for more detail. |
19e4343
to
40d4c1f
Compare
40d4c1f
to
a82dd91
Compare
rdma-core: MOFED provides it openmpi: DOE/DOD supercomputing clusters have their own versions of MPI installed already libfabric: Slingshot and EFA NIC have a custom version and plugin
recipes/libnvshmem/meta.yaml
Outdated
test: | ||
commands: | ||
- test -f $PREFIX/lib/libnvshmem_device.a | ||
- test -f $PREFIX/lib/libnvshmem.a |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If there are any size constraints on the package, we can leave libnvshmem.a out
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we also add the bitcode library now? That might be easier than doing another pass.The first 3.2 builds are in URM now:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Size is about 130MB and I understand conda generally discourage static packages so I guess I'll take 'em out. About the bitcode - this PR is for nvshmem 3.1.7, so once this has gone through vetting and is released for conda, I can work on the next version and include bitcode at that time.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, just libnvshmem.a. libnvshmem_device.a likely has to stay for jit compiling. This is something that's been communicated to me from cuBLASMp.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see. Anyway, I just realized that out of 130MB, the static conda package only takes up 12MB so it is inconsequential either way :) For now I'll take out libnvshmem.a
name: libnvshmem-split | ||
version: {{ version }} | ||
|
||
source: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have checked to make sure the files are in the required places.
Let me run an additional test to make sure the binaries can find the libraries correctly.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have confirmed the following:
Various NVSHMEM Bootstraps can be used, as long as the LD_LIBRARY_PATH is properly specified to include the desired packages.
The Bootstraps can be found automatically by the library
The performance tests in src can be compiled against NVSHMEM. The kitmaker libraries work properly.
From my end, the configuration is good.
@conda-forge/cuda please review |
Checklist
url
) rather than a repo (e.g.git_url
) is used in your recipe (see here for more details).Xref: #28111
Please also refer to https://docs.nvidia.com/nvshmem/release-notes-install-guide/install-guide/abstract.html for NVSHMEM requirements and how much this package were able/unable to meet.