-
Notifications
You must be signed in to change notification settings - Fork 231
[llm bench] Move calculation of memory consumption to memory_monitor tool #1937
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
to discuss. Is it okay that:
|
04f7441
to
e338fb6
Compare
|
991a7f1
to
f37e4ec
Compare
@eaidova could you please take a look ? |
Also within this PR it was checked approach with multiprocessing.Process according to questions regarding execution time increase in collecting memory data mode. Approach shows total RSS memory increase(more 2x on compilation phase and ~1.5 on generation phase for tiny llama llm task). It can affect compilation/generation phase and lead to crashes due to out of memory limits. Statistics of run below:
Process (forkserver/spawn):
|
memory_monitor.py from https://github.com/openvinotoolkit/nncf/blob/develop/tools/memory_monitor.py
added two custom lines, because of issue with tkiner, founded on text2image pipeline and stable-diffusion-v2-1 with pytorch framework :
Task: CVS-162830 CVS-157590