nvshmem 3.1.7 #28647

billysuh7 · 2024-12-19T16:35:10Z

Checklist

Please also refer to https://docs.nvidia.com/nvshmem/release-notes-install-guide/install-guide/abstract.html for NVSHMEM requirements and how much this package were able/unable to meet.

github-actions · 2024-12-19T16:36:33Z

Hi! This is the staged-recipes linter and your PR looks excellent! 🚀

conda-forge-admin · 2024-12-19T16:36:49Z

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipes/nvshmem/meta.yaml, recipes/libnvshmem/meta.yaml) and found some lint.

Here's what I've got...

For recipes/libnvshmem/meta.yaml:

❌ The license item is expected in the about section.

For recipes/libnvshmem/meta.yaml:

ℹ️ The recipe is not parsable by parser conda-souschef (grayskull). This parser is not currently used by conda-forge, but may be in the future. We are collecting information to see which recipes are compatible with grayskull.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12416875254. Examine the logs at this URL for more detail.}

conda-forge-admin · 2024-12-19T16:42:07Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipes/nvshmem/meta.yaml, recipes/libnvshmem/meta.yaml) and found it was in an excellent condition.

I do have some suggestions for making it better though...

For recipes/libnvshmem/meta.yaml:

ℹ️ The recipe is not parsable by parser conda-souschef (grayskull). This parser is not currently used by conda-forge, but may be in the future. We are collecting information to see which recipes are compatible with grayskull.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12835109483. Examine the logs at this URL for more detail.}

rdma-core: MOFED provides it openmpi: DOE/DOD supercomputing clusters have their own versions of MPI installed already libfabric: Slingshot and EFA NIC have a custom version and plugin

…whitelist

seth-howell · 2025-01-16T00:10:11Z

recipes/libnvshmem/meta.yaml

+    test:
+      commands:
+        - test -f $PREFIX/lib/libnvshmem_device.a
+        - test -f $PREFIX/lib/libnvshmem.a


If there are any size constraints on the package, we can leave libnvshmem.a out

Can we also add the bitcode library now? That might be easier than doing another pass.The first 3.2 builds are in URM now:

https://urm.nvidia.com/ui/repos/tree/General/sw-nvshmem-generic-local/NVSHMEM/gpu_comms_cuda12.0_compatibility/3.2.2/libnvshmem_cuda12-linux-x86_64-3.2.2.tar.gz

Size is about 130MB and I understand conda generally discourage static packages so I guess I'll take 'em out. About the bitcode - this PR is for nvshmem 3.1.7, so once this has gone through vetting and is released for conda, I can work on the next version and include bitcode at that time.

Sorry, just libnvshmem.a. libnvshmem_device.a likely has to stay for jit compiling. This is something that's been communicated to me from cuBLASMp.

I see. Anyway, I just realized that out of 130MB, the static conda package only takes up 12MB so it is inconsequential either way :) For now I'll take out libnvshmem.a

seth-howell · 2025-01-16T01:08:43Z

recipes/libnvshmem/meta.yaml

+  name: libnvshmem-split
+  version: {{ version }}
+
+source:


I have checked to make sure the files are in the required places.

Let me run an additional test to make sure the binaries can find the libraries correctly.

I have confirmed the following:

Various NVSHMEM Bootstraps can be used, as long as the LD_LIBRARY_PATH is properly specified to include the desired packages.
The Bootstraps can be found automatically by the library
The performance tests in src can be compiled against NVSHMEM. The kitmaker libraries work properly.

From my end, the configuration is good.

billysuh7 · 2025-01-17T19:44:41Z

@conda-forge/cuda please review

billysuh7 force-pushed the topic/bsuh/libnvshmem branch from c71139b to 19e4343 Compare December 19, 2024 16:40

billysuh7 force-pushed the topic/bsuh/libnvshmem branch from 19e4343 to 40d4c1f Compare December 19, 2024 21:17

libnvshmem 3.1.7

a82dd91

billysuh7 force-pushed the topic/bsuh/libnvshmem branch from 40d4c1f to a82dd91 Compare December 19, 2024 21:54

billysuh7 added 2 commits January 14, 2025 23:04

Take out rdma-core, openmpi and libfabric libs.

4ea677d

rdma-core: MOFED provides it openmpi: DOE/DOD supercomputing clusters have their own versions of MPI installed already libfabric: Slingshot and EFA NIC have a custom version and plugin

Add UCX to host dependency so we could remove libucs/libucp from DSO …

6a4b86d

…whitelist

seth-howell reviewed Jan 16, 2025

View reviewed changes

take out libnvshmem.a

da59307

billysuh7 marked this pull request as ready for review January 17, 2025 19:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvshmem 3.1.7 #28647

nvshmem 3.1.7 #28647

billysuh7 commented Dec 19, 2024 •

edited

Loading

github-actions bot commented Dec 19, 2024

conda-forge-admin commented Dec 19, 2024

conda-forge-admin commented Dec 19, 2024 •

edited

Loading

seth-howell Jan 16, 2025

seth-howell Jan 16, 2025

billysuh7 Jan 16, 2025 •

edited

Loading

seth-howell Jan 16, 2025

billysuh7 Jan 16, 2025

seth-howell Jan 16, 2025

seth-howell Jan 17, 2025

billysuh7 commented Jan 17, 2025

nvshmem 3.1.7 #28647

Are you sure you want to change the base?

nvshmem 3.1.7 #28647

Conversation

billysuh7 commented Dec 19, 2024 • edited Loading

github-actions bot commented Dec 19, 2024

conda-forge-admin commented Dec 19, 2024

conda-forge-admin commented Dec 19, 2024 • edited Loading

seth-howell Jan 16, 2025

Choose a reason for hiding this comment

seth-howell Jan 16, 2025

Choose a reason for hiding this comment

billysuh7 Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

seth-howell Jan 16, 2025

Choose a reason for hiding this comment

billysuh7 Jan 16, 2025

Choose a reason for hiding this comment

seth-howell Jan 16, 2025

Choose a reason for hiding this comment

seth-howell Jan 17, 2025

Choose a reason for hiding this comment

billysuh7 commented Jan 17, 2025

billysuh7 commented Dec 19, 2024 •

edited

Loading

conda-forge-admin commented Dec 19, 2024 •

edited

Loading

billysuh7 Jan 16, 2025 •

edited

Loading