Bump Transformer v4.49.0 #1735

KuuCi · 2025-03-05T21:55:35Z

🚨🚨🚨 MPT models on the huggingface hub will no longer be usable with trust_remote_code=True and older foundry versions will need to be used for that. 🚨🚨🚨

A couple of interesting notes:

Transformer tokenizers have added an extra_special_tokens attribute + made a new object AddedTokens for special tokens.

Seems we have to explicitly set torch_dtype due to this, where transformer will explicitly converts the torch_dtype to a string for JSON serialization.

Looks like they've gotten rid of LlamaFlashAttention2 and refactored Attention:
huggingface/transformers#35235

Seems like our test was actually hitting this strange interaction:
huggingface/transformers#30305

tests/a_scripts/inference/test_convert_composer_to_hf.py

tests/models/hf/test_hf_config.py

dakinggg

Basically LGTM! Please update the PR description to note that after this PR, the MPT models on the huggingface hub will no longer be usable with trust_remote_code=True and older foundry versions will need to be used for that.

Please also adjust the MPT error to say that it is no longer supported by foundry.

mcli/mcli-hf-eval.yaml

llmfoundry/models/hf/hf_base.py

dakinggg

Change models in yamls to 8b, otherwise LGTM!

bump transformer

a0a1eaa

KuuCi force-pushed the bump-transformer-v4.49.0 branch from 2d8bcb8 to a0a1eaa Compare March 6, 2025 18:46

Vincent Chen and others added 28 commits March 6, 2025 10:47

test

24a7ddc

Merge branch 'main' into bump-transformer-v4.49.0

9d50606

test

f7a3286

cpu tests

9497b7d

typo

6a36a51

cpu test

0793be0

rm debug code

fabcd99

rm debug code

b07e13d

gpu test 1

a6f8fed

clip

fb67ad5

fix

c707e8e

test

b222487

test

c1310de

test

c0a56fc

fix

fa642c6

fix

eecdf6b

fix

7fcd88e

test

7fb5d46

test

c022016

test

4f94d8e

test

52c2205

test

8b93963

flash

83bcc90

typo

b5282f7

precommit

dae4572

test

8bcce0a

precisionchanges

6d2d5cb

pyright

26dc16e

Vincent Chen added 2 commits March 11, 2025 12:44

additional precision tests

7525f6f

clean

0036564

KuuCi requested a review from dakinggg March 11, 2025 20:52

dakinggg reviewed Mar 11, 2025

View reviewed changes

tests/a_scripts/inference/test_convert_composer_to_hf.py Outdated Show resolved Hide resolved

tests/a_scripts/inference/test_convert_composer_to_hf.py Outdated Show resolved Hide resolved

tests/models/hf/test_hf_config.py Outdated Show resolved Hide resolved

Vincent Chen added 3 commits March 11, 2025 14:09

clean

bd1488b

rm duplicate

567f882

update yamls

0c4ba54

KuuCi marked this pull request as ready for review March 11, 2025 21:45

KuuCi requested review from a team as code owners March 11, 2025 21:45

KuuCi requested a review from dakinggg March 11, 2025 21:45

dakinggg reviewed Mar 11, 2025

View reviewed changes

mcli/mcli-hf-eval.yaml Outdated Show resolved Hide resolved

Vincent Chen added 2 commits March 11, 2025 14:57

no longer support mpt :(

b616b8f

llama 3.1

fc11ced

dakinggg reviewed Mar 11, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

Vincent Chen added 3 commits March 11, 2025 15:01

update error

c160580

precommit

1884078

composer

363566c

dakinggg reviewed Mar 11, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

Update llmfoundry/models/hf/hf_base.py

0af94af

dakinggg approved these changes Mar 11, 2025

View reviewed changes

Vincent Chen added 8 commits March 11, 2025 15:39

8b, lol

6ea80d3

precommit

7203209

typo

3a2ec93

update path

348b0bf

update docs

f0aa50b

revert test

4d05bc5

fix test

59d65fe

fix

61e2ede

KuuCi merged commit a0ae025 into main Mar 12, 2025
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump Transformer v4.49.0 #1735

Bump Transformer v4.49.0 #1735

KuuCi commented Mar 5, 2025 •

edited

Loading

dakinggg left a comment

dakinggg left a comment

Bump Transformer v4.49.0 #1735

Bump Transformer v4.49.0 #1735

Conversation

KuuCi commented Mar 5, 2025 • edited Loading

dakinggg left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

KuuCi commented Mar 5, 2025 •

edited

Loading