Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metrics for memory zones #1198

Open
sharnoff opened this issue Jan 8, 2025 · 1 comment
Open

Metrics for memory zones #1198

sharnoff opened this issue Jan 8, 2025 · 1 comment
Labels
c/autoscaling/neonvm Component: autoscaling: NeonVM

Comments

@sharnoff
Copy link
Member

sharnoff commented Jan 8, 2025

Follow-up to INC-361

Problem description / Motivation

Sometimes we get OOM-kills on our VMs, even though they have vm.overcommit_memory=2, because we're out of ZONE_NORMAL and kernel allocations fail.

When this kind of thing happens, it's often hard to validate, and we don't have any way to check how close we are to running out.

Feature idea(s) / DoD

We should expose memory usage per ZONE_NORMAL / ZONE_MOVABLE / etc, from each VM.

Implementation ideas

IMO this is probably unlikely to be included by any standard metrics exporters, so maybe we collect this from neonvm-daemon?

@sharnoff sharnoff added the c/autoscaling/neonvm Component: autoscaling: NeonVM label Jan 8, 2025
@mickael-carl
Copy link

Just noting here that node_exporter supports that metric since 2021. I'd like to suggest switching away from vector for host metrics for this reason also 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
c/autoscaling/neonvm Component: autoscaling: NeonVM
Projects
None yet
Development

No branches or pull requests

2 participants