Skip to content

Add TransformerEngine test workflow #4070

Add TransformerEngine test workflow

Add TransformerEngine test workflow #4070

Triggered via pull request March 12, 2025 12:24
Status Startup failure
Total duration
Artifacts

ci.yaml

on: pull_request
metadata
metadata
bump-manifest
bump-manifest
Matrix: amd64 / test-distribution
Waiting for pending jobs
Matrix: amd64 / test-te-unit-a100 / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-distribution
Waiting for pending jobs
Matrix: arm64 / test-te-unit-a100 / run-unit-test
Waiting for pending jobs
amd64  /  ...  /  launch-slurm-runner
amd64 / test-te-unit-a100 / runner / launch-slurm-runner
arm64  /  ...  /  launch-slurm-runner
arm64 / test-te-unit-a100 / runner / launch-slurm-runner
amd64  /  ...  /  build-base
amd64 / build-base / build-base
arm64  /  ...  /  build-base
arm64 / build-base / build-base
amd64  /  ...  /  build
amd64 / build-jax / build
arm64  /  ...  /  build
arm64 / build-jax / build
Matrix: amd64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-te-h100 / transformer-engine-test-eks
Waiting for pending jobs
amd64  /  ...  /  launch-slurm-runner
amd64 / test-jax / runner / launch-slurm-runner
amd64  /  test-nsys-jax-eks
amd64 / test-nsys-jax-eks
amd64  /  ...  /  build
amd64 / build-gemma / build
amd64  /  ...  /  build
amd64 / build-levanter / build
amd64  /  ...  /  build
amd64 / build-maxtext / build
amd64  /  ...  /  build
amd64 / build-triton / build
amd64  /  ...  /  build
amd64 / build-upstream-t5x / build
Matrix: amd64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-te-multigpu-a100 / te-multi-gpu
Waiting for pending jobs
amd64  /  ...  /  build
amd64 / build-equinox / build
amd64  /  ...  /  launch-slurm-runner
amd64 / test-nsys-jax / runner / launch-slurm-runner
Matrix: arm64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te-h100 / transformer-engine-test-eks
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-jax / runner / launch-slurm-runner
arm64  /  test-nsys-jax-eks
arm64 / test-nsys-jax-eks
arm64  /  ...  /  build
arm64 / build-gemma / build
arm64  /  ...  /  build
arm64 / build-levanter / build
arm64  /  ...  /  build
arm64 / build-maxtext / build
arm64  /  ...  /  build
arm64 / build-triton / build
arm64  /  ...  /  build
arm64 / build-upstream-t5x / build
Matrix: arm64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te-multigpu-a100 / te-multi-gpu
Waiting for pending jobs
arm64  /  ...  /  build
arm64 / build-equinox / build
arm64  /  ...  /  launch-slurm-runner
arm64 / test-nsys-jax / runner / launch-slurm-runner
Matrix: amd64 / test-gemma / run-unit-test
Waiting for pending jobs
amd64  /  ...  /  launch-slurm-runner
amd64 / test-gemma / runner / launch-slurm-runner
Matrix: amd64 / test-levanter / run-unit-test
Waiting for pending jobs
amd64  /  ...  /  launch-slurm-runner
amd64 / test-levanter / runner / launch-slurm-runner
Matrix: amd64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: amd64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
Matrix: amd64 / test-triton / run-unit-test
Waiting for pending jobs
amd64  /  ...  /  launch-slurm-runner
amd64 / test-triton / runner / launch-slurm-runner
amd64  /  ...  /  build-rosetta
amd64 / build-rosetta-t5x / build-rosetta
Matrix: amd64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
amd64  /  ...  /  sitrep
amd64 / test-te-multigpu-a100 / sitrep
Matrix: amd64 / test-nsys-jax-archive
Waiting for pending jobs
Matrix: arm64 / test-gemma / run-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-gemma / runner / launch-slurm-runner
Matrix: arm64 / test-levanter / run-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-levanter / runner / launch-slurm-runner
Matrix: arm64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: arm64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-triton / run-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-triton / runner / launch-slurm-runner
arm64  /  ...  /  build-rosetta
arm64 / build-rosetta-t5x / build-rosetta
Matrix: arm64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
arm64  /  ...  /  sitrep
arm64 / test-te-multigpu-a100 / sitrep
Matrix: arm64 / test-nsys-jax-archive
Waiting for pending jobs
amd64  /  ...  /  test-maxtext-summary
amd64 / test-maxtext / test-maxtext-summary
amd64  /  ...  /  test-maxtext-metrics
amd64 / test-maxtext / test-maxtext-metrics
amd64  /  collect-docker-tags
amd64 / collect-docker-tags
Matrix: amd64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
amd64  /  ...  /  test-upstream-t5x-summary
amd64 / test-upstream-t5x / test-upstream-t5x-summary
amd64  /  ...  /  test-upstream-t5x-metrics
amd64 / test-upstream-t5x / test-upstream-t5x-metrics
arm64  /  ...  /  test-maxtext-summary
arm64 / test-maxtext / test-maxtext-summary
arm64  /  ...  /  test-maxtext-metrics
arm64 / test-maxtext / test-maxtext-metrics
arm64  /  collect-docker-tags
arm64 / collect-docker-tags
Matrix: arm64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
arm64  /  ...  /  test-upstream-t5x-summary
arm64 / test-upstream-t5x / test-upstream-t5x-summary
arm64  /  ...  /  test-upstream-t5x-metrics
arm64 / test-upstream-t5x / test-upstream-t5x-metrics
amd64  /  ...  /  sitrep
amd64 / test-maxtext / test-maxtext-sitrep / sitrep
amd64  /  ...  /  test-t5x-rosetta-summary
amd64 / test-rosetta-t5x / test-t5x-rosetta-summary
amd64  /  ...  /  test-t5x-rosetta-metrics
amd64 / test-rosetta-t5x / test-t5x-rosetta-metrics
amd64  /  ...  /  sitrep
amd64 / test-upstream-t5x / test-upstream-t5x-sitrep / sitrep
arm64  /  ...  /  sitrep
arm64 / test-maxtext / test-maxtext-sitrep / sitrep
arm64  /  ...  /  test-t5x-rosetta-summary
arm64 / test-rosetta-t5x / test-t5x-rosetta-summary
arm64  /  ...  /  test-t5x-rosetta-metrics
arm64 / test-rosetta-t5x / test-t5x-rosetta-metrics
arm64  /  ...  /  sitrep
arm64 / test-upstream-t5x / test-upstream-t5x-sitrep / sitrep
amd64  /  ...  /  test-maxtext-outcome
amd64 / test-maxtext / test-maxtext-outcome
amd64  /  ...  /  sitrep
amd64 / test-rosetta-t5x / test-t5x-rosetta-sitrep / sitrep
amd64  /  ...  /  test-upstream-t5x-outcome
amd64 / test-upstream-t5x / test-upstream-t5x-outcome
arm64  /  ...  /  test-maxtext-outcome
arm64 / test-maxtext / test-maxtext-outcome
arm64  /  ...  /  sitrep
arm64 / test-rosetta-t5x / test-t5x-rosetta-sitrep / sitrep
arm64  /  ...  /  test-upstream-t5x-outcome
arm64 / test-upstream-t5x / test-upstream-t5x-outcome
amd64  /  ...  /  test-t5x-rosetta-outcome
amd64 / test-rosetta-t5x / test-t5x-rosetta-outcome
arm64  /  ...  /  test-t5x-rosetta-outcome
arm64 / test-rosetta-t5x / test-t5x-rosetta-outcome
make-publish-configs
make-publish-configs
merge-new-manifest
merge-new-manifest
Matrix: publish-containers
Waiting for pending jobs
finalize  /  workflow-badge
finalize / workflow-badge
finalize  /  report
finalize / report
finalize  /  upload-badge
finalize / upload-badge
finalize  /  publish-badge
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

1 error
Invalid workflow file: .github/workflows/ci.yaml#L494
The workflow is not valid. NVIDIA/JAX-Toolbox/.github/workflows/_ci.yaml@beba3aa7d47d8f59ed50dc8f81cd3958511e5d21 (Line: 494, Col: 11): Input JAX_DOCKER_IMAGE is required, but not provided while calling. NVIDIA/JAX-Toolbox/.github/workflows/_ci.yaml@beba3aa7d47d8f59ed50dc8f81cd3958511e5d21 (Line: 496, Col: 18): Invalid input, JAX_IMAGE is not defined in the referenced workflow.