Skip to content

Synchronization of Trigger JAX Toolbox action jobs from XLA CI. #1676

Synchronization of Trigger JAX Toolbox action jobs from XLA CI.

Synchronization of Trigger JAX Toolbox action jobs from XLA CI. #1676

Triggered via pull request January 22, 2024 22:50
Status Failure
Total duration 1d 14h 35m 4s
Artifacts 40
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

ci.yaml

on: pull_request
metadata
0s
metadata
Matrix: amd64 / test-distribution / test-create-distribution
amd64  /  ...  /  build-base
5m 51s
amd64 / build-base / build-base
Matrix: arm64 / test-distribution / test-create-distribution
arm64  /  ...  /  build-base
5m 14s
arm64 / build-base / build-base
amd64  /  ...  /  build-jax
8m 57s
amd64 / build-jax / build-jax
arm64  /  ...  /  build-jax
12m 30s
arm64 / build-jax / build-jax
amd64  /  ...  /  build-upstream-pax
6m 48s
amd64 / build-pax / build-upstream-pax
amd64  /  ...  /  build-upstream-t5x
8m 21s
amd64 / build-t5x / build-upstream-t5x
Matrix: amd64 / test-jax / jax-unit-test
amd64  /  ...  /  launch-slurm-runner
5h 52m
amd64 / test-jax / runner / launch-slurm-runner
amd64  /  ...  /  build-pallas
21m 36s
amd64 / build-pallas / build-pallas
amd64  /  ...  /  build-maxtext
5m 11s
amd64 / build-maxtext / build-maxtext
arm64  /  ...  /  build-upstream-pax
25m 16s
arm64 / build-pax / build-upstream-pax
arm64  /  ...  /  build
arm64 / build-t5x / build
Matrix: arm64 / test-jax / jax-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-jax / runner / launch-slurm-runner
arm64  /  ...  /  build
arm64 / build-pallas / build
arm64  /  ...  /  build
arm64 / build-maxtext / build
amd64  /  ...  /  build-rosetta
6m 43s
amd64 / build-rosetta-pax / build-rosetta
Matrix: amd64 / test-upstream-pax / pax-multi-node
Matrix: amd64 / test-upstream-pax / single-process-evaluation
Matrix: amd64 / test-upstream-pax / single-process-multi-device
Matrix: amd64 / test-te / te-multi-gpu
amd64  /  ...  /  te-unit-tests
22m 11s
amd64 / test-te / te-unit-tests
amd64  /  ...  /  build-rosetta
11m 8s
amd64 / build-rosetta-t5x / build-rosetta
Matrix: amd64 / test-upstream-t5x / t5x-multi-gpu
Matrix: amd64 / test-upstream-t5x / t5x-multi-node
Matrix: amd64 / test-pallas / pallas-unit-test
amd64  /  ...  /  launch-slurm-runner
6h 0m
amd64 / test-pallas / runner / launch-slurm-runner
arm64  /  ...  /  build-rosetta
16m 35s
arm64 / build-rosetta-pax / build-rosetta
Matrix: arm64 / test-upstream-pax / pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-evaluation
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-te / te-multi-gpu
Waiting for pending jobs
arm64  /  ...  /  te-unit-tests
arm64 / test-te / te-unit-tests
arm64  /  ...  /  build-rosetta
arm64 / build-rosetta-t5x / build-rosetta
Matrix: arm64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
Matrix: arm64 / test-upstream-t5x / t5x-multi-node
Waiting for pending jobs
Matrix: arm64 / test-pallas / pallas-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-pallas / runner / launch-slurm-runner
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node-te
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node
Matrix: amd64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Matrix: amd64 / test-rosetta-pax / single-process-evaluation-te
Matrix: amd64 / test-rosetta-pax / single-process-multi-device-te
amd64  /  ...  /  summary
0s
amd64 / test-upstream-pax / summary
amd64  /  ...  /  metrics
13s
amd64 / test-upstream-pax / metrics
Matrix: amd64 / test-rosetta-t5x / multi-gpu-multi-node
Matrix: amd64 / test-rosetta-t5x / single-process-multi-device
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Matrix: amd64 / test-rosetta-t5x / vit-single-process-multi-device
amd64  /  ...  /  summary
0s
amd64 / test-upstream-t5x / summary
amd64  /  ...  /  metrics
0s
amd64 / test-upstream-t5x / metrics
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-evaluation-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-multi-device-te
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-upstream-pax / summary
arm64  /  ...  /  metrics
arm64 / test-upstream-pax / metrics
Matrix: arm64 / test-rosetta-t5x / multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-single-process-multi-device
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-upstream-t5x / summary
arm64  /  ...  /  metrics
arm64 / test-upstream-t5x / metrics
amd64  /  ...  /  summary
0s
amd64 / test-rosetta-pax / summary
amd64  /  ...  /  metrics
0s
amd64 / test-rosetta-pax / metrics
amd64  /  ...  /  sitrep
12s
amd64 / test-upstream-pax / sitrep / sitrep
amd64  /  ...  /  summary
0s
amd64 / test-rosetta-t5x / summary
amd64  /  ...  /  publish
34s
amd64 / test-rosetta-t5x / publish-test / publish
amd64  /  ...  /  sitrep
32s
amd64 / test-upstream-t5x / sitrep / sitrep
arm64  /  ...  /  summary
arm64 / test-rosetta-pax / summary
arm64  /  ...  /  metrics
arm64 / test-rosetta-pax / metrics
arm64  /  ...  /  sitrep
arm64 / test-upstream-pax / sitrep / sitrep
arm64  /  ...  /  summary
arm64 / test-rosetta-t5x / summary
arm64  /  ...  /  publish
arm64 / test-rosetta-t5x / publish-test / publish
arm64  /  ...  /  sitrep
arm64 / test-upstream-t5x / sitrep / sitrep
amd64  /  ...  /  publish
5s
amd64 / test-rosetta-pax / publish-test / publish
amd64  /  ...  /  outcome
0s
amd64 / test-upstream-pax / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-rosetta-t5x / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-upstream-t5x / outcome
arm64  /  ...  /  publish
arm64 / test-rosetta-pax / publish-test / publish
arm64  /  ...  /  outcome
arm64 / test-upstream-pax / outcome
arm64  /  ...  /  outcome
arm64 / test-rosetta-t5x / outcome
arm64  /  ...  /  outcome
arm64 / test-upstream-t5x / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-rosetta-pax / outcome
arm64  /  ...  /  outcome
arm64 / test-rosetta-pax / outcome
finalize  /  upload-badge
33s
finalize / upload-badge
finalize  /  report
9s
finalize / report
finalize  /  publish-badge
0s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

67 errors
amd64 / test-upstream-pax / outcome
Process completed with exit code 1.
amd64 / test-upstream-t5x / t5x-multi-node (1, 1)
The job running on runner GitHub Actions 134 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (1, 1)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-gpu (1)
The job running on runner GitHub Actions 9 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-gpu (1)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (8, 1)
The job running on runner GitHub Actions 163 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (8, 1)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (2, 2)
The job running on runner GitHub Actions 138 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (2, 2)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (1, 2)
The job running on runner GitHub Actions 136 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (4, 2)
The job running on runner GitHub Actions 162 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (4, 2)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (1, 2)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (2, 1)
The job running on runner GitHub Actions 137 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (2, 1)
The operation was canceled.
amd64 / test-upstream-t5x / t5x-multi-node (4, 1)
The job running on runner GitHub Actions 161 has exceeded the maximum execution time of 360 minutes.
amd64 / test-upstream-t5x / t5x-multi-node (4, 1)
The operation was canceled.
amd64 / test-upstream-t5x / outcome
Process completed with exit code 1.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 1)
The job running on runner GitHub Actions 165 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 1, 1, 1)
The job running on runner GitHub Actions 170 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 1, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 2)
The job running on runner GitHub Actions 166 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 2)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (1, 8, 1, 1)
The job running on runner GitHub Actions 135 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node (1, 8, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 4, 1, 2)
The job running on runner GitHub Actions 172 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 4, 1, 2)
The operation was canceled.
amd64 / test-rosetta-pax / single-process-multi-device-te (1, 8, 1, 1)
The job running on runner GitHub Actions 168 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / single-process-multi-device-te (1, 8, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 16, 1, 1)
The job running on runner GitHub Actions 171 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 16, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te (1, 8, 1, 1)
The job running on runner GitHub Actions 5 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 1, 8, 1)
The job running on runner GitHub Actions 169 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 1, 8, 1)
The operation was canceled.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 8, 1, 1)
The job running on runner GitHub Actions 173 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / rosetta-pax-multi-node-te (1, 8, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / single-process-evaluation-te (1, 8, 1, 1)
The job running on runner GitHub Actions 174 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-pax / single-process-evaluation-te (1, 8, 1, 1)
The operation was canceled.
amd64 / test-rosetta-pax / publish-test / publish
Process completed with exit code 2.
amd64 / test-rosetta-pax / outcome
Process completed with exit code 2.
amd64 / test-rosetta-t5x / vit-single-process-multi-device (8)
The job running on runner GitHub Actions 42 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-single-process-multi-device (8)
The operation was canceled.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (8, 2)
The job running on runner GitHub Actions 177 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (8, 2)
The operation was canceled.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (1, 1)
The job running on runner GitHub Actions 132 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (1, 1)
The operation was canceled.
amd64 / test-rosetta-t5x / single-process-multi-device (1P1G_te-0, 1, --enable-te 0)
The job running on runner GitHub Actions 8 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (1, 2)
The job running on runner GitHub Actions 175 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (1, 2)
The operation was canceled.
amd64 / test-rosetta-t5x / multi-gpu-multi-node (1N1G-te-1, 1, 1, --gin.train/utils.DatasetConfig.pack=False --gin.train_eva...
The job running on runner GitHub Actions 45 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (8, 1)
The job running on runner GitHub Actions 176 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node (8, 1)
The operation was canceled.
amd64 / test-rosetta-t5x / multi-gpu-multi-node (2N2G_te-0, 2, 2, --enable-te 0)
The job running on runner GitHub Actions 126 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / multi-gpu-multi-node (1N8G-te-1, 8, 1, --gin.train/utils.DatasetConfig.pack=False --gin.train_eva...
The job running on runner GitHub Actions 46 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / multi-gpu-multi-node (2N8G-te-1, 8, 2, --gin.train/utils.DatasetConfig.pack=False --gin.train_eva...
The job running on runner GitHub Actions 127 has exceeded the maximum execution time of 360 minutes.
amd64 / test-rosetta-t5x / publish-test / publish
Process completed with exit code 2.
amd64 / test-rosetta-t5x / outcome
Process completed with exit code 2.
amd64 / test-pallas / runner / launch-slurm-runner
The job running on runner GitHub Actions 186 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pallas / runner / launch-slurm-runner
The operation was canceled.
amd64 / test-pallas / pallas-unit-test (A100)
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.

Artifacts

Produced during runtime
Name Size
PAXML-7618372438-16DP1FSDP1TP1PP Expired
160 KB
PAXML-7618372438-1DP1FSDP1TP1PP Expired
9.81 KB
PAXML-7618372438-1DP2FSDP4TP1PP_single_process Expired
9.92 KB
PAXML-7618372438-1DP8FSDP1TP1PP Expired
79.1 KB
PAXML-7618372438-2DP1FSDP1TP4PP Expired
79.1 KB
PAXML-7618372438-2DP1FSDP2TP4PP Expired
160 KB
PAXML-7618372438-4DP1FSDP2TP1PP Expired
79.1 KB
PAXML-7618372438-8DP1FSDP1TP1PP Expired
79.1 KB
PAXML-7618372438-8DP1FSDP1TP1PP_eval Expired
9.89 KB
PAXML-7618372438-8DP1FSDP1TP1PP_single_process Expired
9.92 KB
T5X-7618372438-1P2G Expired
601 KB
T5X-7618372438-1P4G Expired
601 KB
T5X-7618372438-1P8G Expired
601 KB
T5X-7618372438-8G2N Expired
3.3 MB
artifact-base-build-amd64 Expired
385 Bytes
artifact-base-build-arm64 Expired
385 Bytes
artifact-final-report Expired
2.05 KB
artifact-jax-build-amd64 Expired
363 Bytes
artifact-jax-build-arm64 Expired
363 Bytes
artifact-jax-unit-test-A100 Expired
80.3 KB
artifact-jax-unit-test-V100 Expired
75.8 KB
artifact-maxtext-build-amd64 Expired
379 Bytes
artifact-pallas-build-amd64 Expired
375 Bytes
artifact-pallas-unit-test-V100 Expired
861 Bytes
artifact-pax-build-amd64 Expired
399 Bytes
artifact-pax-build-arm64 Expired
399 Bytes
artifact-pax-mgmn-test Expired
358 Bytes
artifact-pax-rosetta-mgmn-testrosetta-pax-7618372438-1DP2FSDP4TP1PP_single_process_TE Expired
10.4 KB
artifact-pax-rosetta-mgmn-testrosetta-pax-7618372438-4DP1FSDP2TP1PP Expired
79.6 KB
artifact-rosetta-build-pax-amd64 Expired
387 Bytes
artifact-rosetta-build-pax-arm64 Expired
387 Bytes
artifact-rosetta-build-t5x-amd64 Expired
387 Bytes
artifact-t5x-build-amd64 Expired
399 Bytes
artifact-t5x-mgmn-test Expired
347 Bytes
artifact-t5x-rosetta-mgmn-testrosetta-T5X-7618372438-1P1G_te-1 Expired
341 KB
artifact-t5x-rosetta-mgmn-testrosetta-T5X-7618372438-1P8G_te-1 Expired
342 KB
artifact-te-mg-integration-test-1P1G Expired
54.7 KB
artifact-te-mg-integration-test-1P2G Expired
55.1 KB
artifact-te-unit-test Expired
6.26 MB
metrics-test-log Expired
60.7 KB