-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] mask_strategy可以改成可学参数吗? #219
Comments
That is a great idea! But we haven't tried it. |
Yes, a gated sparse pattern would be cool and easy to adapt with. |
|
hi~ 这个你有实现吗,效果怎么样? |
Motivation
mask_strategy可以作为一层可学的参数吗?类似MOBA中的gate,在finetune或者distill里面直接学习进去,这样工程适配工作就少了很多
Related resources
No response
The text was updated successfully, but these errors were encountered: