Skip to content

Add documentation and scripts for Pax benchmarking using synthetic dataset #1673

Add documentation and scripts for Pax benchmarking using synthetic dataset

Add documentation and scripts for Pax benchmarking using synthetic dataset #1673

Triggered via pull request January 22, 2024 22:15
Status Failure
Total duration 7h 4m 47s
Artifacts 67
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

ci.yaml

on: pull_request
metadata
0s
metadata
Matrix: amd64 / test-distribution / test-create-distribution
amd64  /  ...  /  build-base
4m 50s
amd64 / build-base / build-base
Matrix: arm64 / test-distribution / test-create-distribution
arm64  /  ...  /  build-base
4m 44s
arm64 / build-base / build-base
amd64  /  ...  /  build-jax
13m 59s
amd64 / build-jax / build-jax
arm64  /  ...  /  build-jax
22m 39s
arm64 / build-jax / build-jax
amd64  /  ...  /  build-upstream-pax
6m 29s
amd64 / build-pax / build-upstream-pax
amd64  /  ...  /  build-upstream-t5x
8m 6s
amd64 / build-t5x / build-upstream-t5x
Matrix: amd64 / test-jax / jax-unit-test
amd64  /  ...  /  launch-slurm-runner
1h 2m
amd64 / test-jax / runner / launch-slurm-runner
amd64  /  ...  /  build-pallas
20m 33s
amd64 / build-pallas / build-pallas
amd64  /  ...  /  build-maxtext
4m 47s
amd64 / build-maxtext / build-maxtext
arm64  /  ...  /  build-upstream-pax
25m 36s
arm64 / build-pax / build-upstream-pax
arm64  /  ...  /  build
arm64 / build-t5x / build
Matrix: arm64 / test-jax / jax-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-jax / runner / launch-slurm-runner
arm64  /  ...  /  build
arm64 / build-pallas / build
arm64  /  ...  /  build
arm64 / build-maxtext / build
amd64  /  ...  /  build-rosetta
6m 35s
amd64 / build-rosetta-pax / build-rosetta
Matrix: amd64 / test-upstream-pax / pax-multi-node
Matrix: amd64 / test-upstream-pax / single-process-evaluation
Matrix: amd64 / test-upstream-pax / single-process-multi-device
Matrix: amd64 / test-te / te-multi-gpu
amd64  /  ...  /  te-unit-tests
21m 52s
amd64 / test-te / te-unit-tests
amd64  /  ...  /  build-rosetta
10m 36s
amd64 / build-rosetta-t5x / build-rosetta
Matrix: amd64 / test-upstream-t5x / t5x-multi-gpu
Matrix: amd64 / test-upstream-t5x / t5x-multi-node
Matrix: amd64 / test-pallas / pallas-unit-test
amd64  /  ...  /  launch-slurm-runner
6h 0m
amd64 / test-pallas / runner / launch-slurm-runner
arm64  /  ...  /  build-rosetta
16m 37s
arm64 / build-rosetta-pax / build-rosetta
Matrix: arm64 / test-upstream-pax / pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-evaluation
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-te / te-multi-gpu
Waiting for pending jobs
arm64  /  ...  /  te-unit-tests
arm64 / test-te / te-unit-tests
arm64  /  ...  /  build-rosetta
arm64 / build-rosetta-t5x / build-rosetta
Matrix: arm64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
Matrix: arm64 / test-upstream-t5x / t5x-multi-node
Waiting for pending jobs
Matrix: arm64 / test-pallas / pallas-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-pallas / runner / launch-slurm-runner
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node-te
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node
Matrix: amd64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Matrix: amd64 / test-rosetta-pax / single-process-evaluation-te
Matrix: amd64 / test-rosetta-pax / single-process-multi-device-te
amd64  /  ...  /  summary
0s
amd64 / test-upstream-pax / summary
amd64  /  ...  /  metrics
45s
amd64 / test-upstream-pax / metrics
Matrix: amd64 / test-rosetta-t5x / multi-gpu-multi-node
Matrix: amd64 / test-rosetta-t5x / single-process-multi-device
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Matrix: amd64 / test-rosetta-t5x / vit-single-process-multi-device
amd64  /  ...  /  summary
0s
amd64 / test-upstream-t5x / summary
amd64  /  ...  /  metrics
1m 2s
amd64 / test-upstream-t5x / metrics
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-evaluation-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-multi-device-te
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-upstream-pax / summary
arm64  /  ...  /  metrics
arm64 / test-upstream-pax / metrics
Matrix: arm64 / test-rosetta-t5x / multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-single-process-multi-device
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-upstream-t5x / summary
arm64  /  ...  /  metrics
arm64 / test-upstream-t5x / metrics
amd64  /  ...  /  summary
0s
amd64 / test-rosetta-pax / summary
amd64  /  ...  /  metrics
45s
amd64 / test-rosetta-pax / metrics
amd64  /  ...  /  sitrep
18s
amd64 / test-upstream-pax / sitrep / sitrep
amd64  /  ...  /  summary
0s
amd64 / test-rosetta-t5x / summary
amd64  /  ...  /  publish
17s
amd64 / test-rosetta-t5x / publish-test / publish
amd64  /  ...  /  sitrep
24s
amd64 / test-upstream-t5x / sitrep / sitrep
arm64  /  ...  /  summary
arm64 / test-rosetta-pax / summary
arm64  /  ...  /  metrics
arm64 / test-rosetta-pax / metrics
arm64  /  ...  /  sitrep
arm64 / test-upstream-pax / sitrep / sitrep
arm64  /  ...  /  summary
arm64 / test-rosetta-t5x / summary
arm64  /  ...  /  publish
arm64 / test-rosetta-t5x / publish-test / publish
arm64  /  ...  /  sitrep
arm64 / test-upstream-t5x / sitrep / sitrep
amd64  /  ...  /  publish
32s
amd64 / test-rosetta-pax / publish-test / publish
amd64  /  ...  /  outcome
0s
amd64 / test-upstream-pax / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-rosetta-t5x / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-upstream-t5x / outcome
arm64  /  ...  /  publish
arm64 / test-rosetta-pax / publish-test / publish
arm64  /  ...  /  outcome
arm64 / test-upstream-pax / outcome
arm64  /  ...  /  outcome
arm64 / test-rosetta-t5x / outcome
arm64  /  ...  /  outcome
arm64 / test-upstream-t5x / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-rosetta-pax / outcome
arm64  /  ...  /  outcome
arm64 / test-rosetta-pax / outcome
finalize  /  publish-badge
0s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

4 errors
amd64 / test-rosetta-pax / outcome
Process completed with exit code 1.
amd64 / test-pallas / runner / launch-slurm-runner
The job running on runner GitHub Actions 7 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pallas / runner / launch-slurm-runner
The operation was canceled.
amd64 / test-pallas / pallas-unit-test (A100)
The self-hosted runner: A100-7618045604 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.

Artifacts

Produced during runtime
Name Size
PAXML-7618045604-16DP1FSDP1TP1PP Expired
16 MB
PAXML-7618045604-1DP1FSDP1TP1PP Expired
1.93 MB
PAXML-7618045604-1DP2FSDP4TP1PP_single_process Expired
1.95 MB
PAXML-7618045604-1DP8FSDP1TP1PP Expired
8.48 MB
PAXML-7618045604-2DP1FSDP1TP4PP Expired
5.6 MB
PAXML-7618045604-2DP1FSDP2TP4PP Expired
10.8 MB
PAXML-7618045604-4DP1FSDP2TP1PP Expired
8.58 MB
PAXML-7618045604-8DP1FSDP1TP1PP Expired
8.47 MB
PAXML-7618045604-8DP1FSDP1TP1PP_eval Expired
1.88 MB
PAXML-7618045604-8DP1FSDP1TP1PP_single_process Expired
1.94 MB
T5X-7618045604-1G1N Expired
601 KB
T5X-7618045604-1G2N Expired
786 KB
T5X-7618045604-1P1G Expired
601 KB
T5X-7618045604-1P2G Expired
601 KB
T5X-7618045604-1P4G Expired
601 KB
T5X-7618045604-1P8G Expired
601 KB
T5X-7618045604-2G1N Expired
786 KB
T5X-7618045604-2G2N Expired
1.13 MB
T5X-7618045604-4G1N Expired
1.13 MB
T5X-7618045604-4G2N Expired
1.85 MB
T5X-7618045604-8G1N Expired
1.85 MB
T5X-7618045604-8G2N Expired
3.3 MB
artifact-base-build-amd64 Expired
385 Bytes
artifact-base-build-arm64 Expired
385 Bytes
artifact-jax-build-amd64 Expired
363 Bytes
artifact-jax-build-arm64 Expired
363 Bytes
artifact-jax-unit-test-A100 Expired
80.3 KB
artifact-jax-unit-test-V100 Expired
75.8 KB
artifact-maxtext-build-amd64 Expired
379 Bytes
artifact-pallas-build-amd64 Expired
375 Bytes
artifact-pallas-unit-test-V100 Expired
861 Bytes
artifact-pax-build-amd64 Expired
399 Bytes
artifact-pax-build-arm64 Expired
399 Bytes
artifact-pax-mgmn-test Expired
5.27 KB
artifact-rosetta-build-pax-amd64 Expired
363 Bytes
artifact-rosetta-build-pax-arm64 Expired
363 Bytes
artifact-rosetta-build-t5x-amd64 Expired
363 Bytes
artifact-t5x-build-amd64 Expired
399 Bytes
artifact-t5x-mgmn-test Expired
11.8 KB
integration-test-logs Expired
221 KB
metrics-test-log Expired
117 KB
rosetta-T5X-7618045604-1N1G-te-1 Expired
341 KB
rosetta-T5X-7618045604-1N8G-te-1 Expired
930 KB
rosetta-T5X-7618045604-1P1G_te-0 Expired
305 KB
rosetta-T5X-7618045604-1P1G_te-1 Expired
341 KB
rosetta-T5X-7618045604-1P8G_te-1 Expired
341 KB
rosetta-T5X-7618045604-2N2G_te-0 Expired
395 KB
rosetta-T5X-7618045604-2N8G-te-1 Expired
1.61 MB
rosetta-VIT-7618045604-VIT1G1N Expired
320 KB
rosetta-VIT-7618045604-VIT1G2N Expired
342 KB
rosetta-VIT-7618045604-VIT1P8G Expired
322 KB
rosetta-VIT-7618045604-VIT8G1N Expired
475 KB
rosetta-VIT-7618045604-VIT8G2N Expired
655 KB
rosetta-pax-7618045604-16DP1FSDP1TP1PP_TE Expired
8.07 MB
rosetta-pax-7618045604-1DP1FSDP1TP1PP_TE Expired
1.19 MB
rosetta-pax-7618045604-1DP2FSDP4TP1PP_single_process_TE Expired
900 KB
rosetta-pax-7618045604-1DP8FSDP1TP1PP_TE Expired
4.24 MB
rosetta-pax-7618045604-2DP1FSDP1TP4PP Expired
4.87 MB
rosetta-pax-7618045604-2DP1FSDP2TP4PP Expired
9.36 MB
rosetta-pax-7618045604-4DP1FSDP2TP1PP Expired
7.41 MB
rosetta-pax-7618045604-4DP1FSDP2TP1PP_TE Expired
4.31 MB
rosetta-pax-7618045604-8DP1FSDP1TP1PP Expired
7.3 MB
rosetta-pax-7618045604-8DP1FSDP1TP1PP_TE Expired
4.23 MB
rosetta-pax-7618045604-8DP1FSDP1TP1PP_eval_TE Expired
1.37 MB
rosetta-pax-7618045604-8DP1FSDP1TP1PP_single_process_TE Expired
894 KB
rosetta-pax-7618045604-8DP_TE_dropout Expired
4.26 MB
unit-test-logs Expired
6.26 MB