Add llama3.3-70b-instruct model configs, tests and HF golden logits #1282

hengtaoguo · 2025-02-18T23:59:06Z

Description

Add support for llama3.3-70b-instruct for maxtext. This model is of the same architecture of llama3.1-70b. Upload golden logits from HuggingFace. Attach converted scanned&unscanned checkpoints paths in 2_test_llama3.3_70b_instruct.sh.

FIXES: b/395908473

Tests

Golden logits match with KL divergence below threshold: screenshot.
Train pipeline works for this model: screenshot.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

hengtaoguo marked this pull request as ready for review February 19, 2025 00:38

hengtaoguo requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla and RissyRan as code owners February 19, 2025 00:38

hengtaoguo force-pushed the hengtaoguo-llama33 branch from e7ddadf to 8d75bc8 Compare February 20, 2025 18:06

hengtaoguo changed the title ~~Add llama3.3-70b-instruct model configs and HF golden logits~~ Add llama3.3-70b-instruct model configs, tests and HF golden logits Feb 20, 2025

merge conflicts

6f20794

hengtaoguo force-pushed the hengtaoguo-llama33 branch from aaeceec to 6f20794 Compare February 20, 2025 22:41

Merge branch 'main' into hengtaoguo-llama33

aab0e9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llama3.3-70b-instruct model configs, tests and HF golden logits #1282

Add llama3.3-70b-instruct model configs, tests and HF golden logits #1282

hengtaoguo commented Feb 18, 2025 •

edited

Loading

Add llama3.3-70b-instruct model configs, tests and HF golden logits #1282

Are you sure you want to change the base?

Add llama3.3-70b-instruct model configs, tests and HF golden logits #1282

Conversation

hengtaoguo commented Feb 18, 2025 • edited Loading

Description

Tests

Checklist

hengtaoguo commented Feb 18, 2025 •

edited

Loading