Skip to content

Commit d97a32f

Browse files
committedApr 18, 2023
2.18.1-1
Add support for IB SHARP to NVLS (NVLink SHARP algorithm). Add NVLS+Tree algorithm. Add support for memory management using cuMem* functions. Use all NICs for Send/Receive operations on systems with more than one NIC per GPU (NVIDIA#804). Add ncclCommSplit primitive, with resource sharing option in config. Fix alltoallv hang (NVIDIA#788) Increase number of channels on H100 when we're not limited by NVLink. Improve error reporting in case of IB failure, printing local and remote ID (NVIDIA#779). Add build option to allow compilation against RDMA includes instead of dynamically loading IB verbs symbols (NVIDIA#802). Fix context creation for progress thread (NVIDIA#803). NET/IB: add option to use multiple QPs in round-robin mode. Fix tree performance issue when NVB is disabled on HCM topologies.
1 parent 9b7d5ed commit d97a32f

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

64 files changed

+4752
-3125
lines changed
 

‎makefiles/common.mk

+5
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ DEBUG ?= 0
1212
TRACE ?= 0
1313
PROFAPI ?= 1
1414
NVTX ?= 1
15+
RDMA_CORE ?= 0
1516

1617
NVCC = $(CUDA_HOME)/bin/nvcc
1718

@@ -106,3 +107,7 @@ endif
106107
ifneq ($(PROFAPI), 0)
107108
CXXFLAGS += -DPROFAPI
108109
endif
110+
111+
ifneq ($(RDMA_CORE), 0)
112+
CXXFLAGS += -DNCCL_BUILD_RDMA_CORE=1
113+
endif

‎makefiles/version.mk

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
##### version
22
NCCL_MAJOR := 2
3-
NCCL_MINOR := 17
3+
NCCL_MINOR := 18
44
NCCL_PATCH := 1
55
NCCL_SUFFIX :=
66
PKG_REVISION := 1

0 commit comments

Comments
 (0)
Please sign in to comment.