Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

从DeepSeekV3开始,训练DeepSeek-R1的过程,需要多少的GPU资源,花费多少时间? #485

Open
tomchaozhou75 opened this issue Feb 23, 2025 · 2 comments

Comments

@tomchaozhou75
Copy link

从DeepSeekV3开始,训练DeepSeek-R1的过程,需要多少的GPU资源,花费多少时间?
是否可以披露类似V3论文的数据,训练V3是需要2048 H800 GPUs。

@YILIYAF
Copy link

YILIYAF commented Feb 23, 2025

直接从r1微调可以吧

@YILIYAF
Copy link

YILIYAF commented Feb 23, 2025

抱歉,原来没开源

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants