[LoRA] CogView4 #10981

a-r-r-o-w · 2025-03-06T09:32:51Z

Wandb logs: https://wandb.ai/aryanvs/finetrainers-cogview4

Checkpoints:

Training:

CogView4 ModelSpec a-r-r-o-w/finetrainers#297

HuggingFaceDocBuilderDev · 2025-03-06T09:39:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

a-r-r-o-w · 2025-03-07T14:26:58Z

tests/lora/test_lora_layers_cogview4.py

+class TokenizerWrapper:
+    @staticmethod
+    def from_pretrained(*args, **kwargs):
+        return AutoTokenizer.from_pretrained("THUDM/glm-4-9b-chat", trust_remote_code=True)
+
+
+class TextEncoderWrapper:
+    @staticmethod
+    def from_pretrained(*args, **kwargs):
+        config = GlmConfig(hidden_size=32, intermediate_size=8, num_hidden_layers=2, num_attention_heads=4, head_dim=8)
+        return GlmModel(config)


I had to do this because I'm not quite sure how to make the trust_remote_code=True part compatible our test structure. But it seems like a security hole...

@sayakpaul What are your suggestions for handling this? Is this okay or do we create the dummy checkpoints? Creating dummy checkpoint for text encoder is no problem, but I'm not sure how to handle the tokenizer here -- do I just copy the entire repo for tokenization to our hf-internal-testing org?

Thanks for investigating. How about:

Create a dummy checkpoint for the text encoder and host under internal testing.

Copy over the tokenizer files (they are small enough) to that same repo while crediting the source in the model card.

sayakpaul

LGTM after the test related comments have been addressed.

It's also great that finetrainers now supports image models like CogView4. I think that demands some communications to the communities. Let's tackle that later.

sayakpaul · 2025-03-08T13:38:01Z

src/diffusers/pipelines/cogview4/pipeline_cogview4.py

@@ -627,6 +635,7 @@ def __call__(
                        original_size=original_size,
                        target_size=target_size,
                        crop_coords=crops_coords_top_left,
+                        attention_kwargs=attention_kwargs,


Let's add this to docstrings.

sayakpaul · 2025-03-08T13:41:42Z

tests/lora/test_lora_layers_cogview4.py

+class TokenizerWrapper:
+    @staticmethod
+    def from_pretrained(*args, **kwargs):
+        return AutoTokenizer.from_pretrained("THUDM/glm-4-9b-chat", trust_remote_code=True)
+
+
+class TextEncoderWrapper:
+    @staticmethod
+    def from_pretrained(*args, **kwargs):
+        config = GlmConfig(hidden_size=32, intermediate_size=8, num_hidden_layers=2, num_attention_heads=4, head_dim=8)
+        return GlmModel(config)


Thanks for investigating. How about:

Create a dummy checkpoint for the text encoder and host under internal testing.

Copy over the tokenizer files (they are small enough) to that same repo while crediting the source in the model card.

sayakpaul · 2025-03-08T13:45:38Z

tests/lora/test_lora_layers_cogview4.py

+    @unittest.skip(
+        "Needs additional debugging. OSError: Incorrect path_or_model_id: ''. Please provide either the path to a local folder or the repo_id of a model on the Hub."
+    )
+    def test_simple_inference_save_pretrained(self):
+        pass


This seems like an important test -- should we debug further? I can help look into it after the dummy checkpoints for the text encoder and tokenizer are created. I won't mind passing additional pretrained_kwargs to tokenizer and text encoder if needed.

a-r-r-o-w added 2 commits March 5, 2025 22:47

update

6db52f5

Merge branch 'main' into lora/cogview4

ec9686e

a-r-r-o-w added 2 commits March 7, 2025 15:23

Merge branch 'main' into lora/cogview4

d931aa4

make fix-copies

989162c

a-r-r-o-w requested a review from sayakpaul March 7, 2025 14:24

a-r-r-o-w commented Mar 7, 2025

View reviewed changes

sayakpaul approved these changes Mar 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] CogView4 #10981

[LoRA] CogView4 #10981

a-r-r-o-w commented Mar 6, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 6, 2025

a-r-r-o-w Mar 7, 2025 •

edited

Loading

sayakpaul Mar 8, 2025

sayakpaul left a comment

sayakpaul Mar 8, 2025

sayakpaul Mar 8, 2025

sayakpaul Mar 8, 2025

[LoRA] CogView4 #10981

Are you sure you want to change the base?

[LoRA] CogView4 #10981

Conversation

a-r-r-o-w commented Mar 6, 2025 • edited Loading

HuggingFaceDocBuilderDev commented Mar 6, 2025

a-r-r-o-w Mar 7, 2025 • edited Loading

Choose a reason for hiding this comment

sayakpaul Mar 8, 2025

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul Mar 8, 2025

Choose a reason for hiding this comment

sayakpaul Mar 8, 2025

Choose a reason for hiding this comment

sayakpaul Mar 8, 2025

Choose a reason for hiding this comment

a-r-r-o-w commented Mar 6, 2025 •

edited

Loading

a-r-r-o-w Mar 7, 2025 •

edited

Loading