We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 4d9840c commit 1d2e4e5Copy full SHA for 1d2e4e5
README.md
@@ -64,7 +64,7 @@ You can also run e.g. ``tune ls knowledge_distillation_distributed`` for a full
64
| | LoRA/QLoRA | ✅ | ✅ | ❌ |
65
| PPO | Full | ✅ | ❌ | ❌ |
66
| | LoRA/QLoRA | ❌ | ❌ | ❌ |
67
-| GRPO | Full | 🚧 | 🚧 | 🚧 |
+| GRPO | Full | 🚧 | ✅ | ✅ |
68
69
70
Example: ``tune run lora_dpo_single_device --config llama3_1/8B_dpo_single_device`` <br />
0 commit comments