-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Make HeavyBall compatible with FSDP2 (DTensor) #15
Comments
Interesting, thank you for raising an issue. |
I have it working with FSDP now though a few sharp bits that need to be adjusted, especially around gradient clipping and checkpoint saving |
That is fair enough. I wish there was compatibility, but it seems it can be quite some work to get there. I will have a look at https://github.com/facebookresearch/optimizers/ which seems to support a distributed version of Shampoo. |
@casper-hansen somewhat comes down to whether memory management or speed is the higher priority |
Here is an example traceback of me trying to use
heavyball
in TorchTitan. It seems thatheavyball
is not yet compatible with FSDP2 due to some of the utilities operating on tensors instead of dtensors.Code:
Traceback:
The text was updated successfully, but these errors were encountered: