Allow snapshot tap changes #4731

andrewla · 2024-08-15T21:22:50Z

Changes

Allow renaming of tap devices on snapshot restore

Reason

In some scenarios it is not possible to use the jailer, especially in limited privilege environments where the security is external to firecracker itself. But in these cases a snapshot may have to use a different tap device than the one that it was using when it was snapshotted.

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license. For more information on following Developer
Certificate of Origin and signing off your commits, please check
CONTRIBUTING.md.

PR Checklist

If a specific issue led to this PR, this PR closes the issue.
The description of changes is clear and encompassing.
Any required documentation changes (code and docs) are included in this
PR.
API changes follow the Runbook for Firecracker API changes.
User-facing changes are mentioned in CHANGELOG.md.
All added/changed functionality is tested.
New TODOs link to an issue.
Commits meet
contribution quality standards.

This functionality cannot be added in rust-vmm.

pb8o · 2024-08-19T10:27:22Z

Hi @andrewla thank you for your contribution! We would like to understand the use case better in case it can be resolved through other means first. We recommend using a network namespace where you can create TAP devices with the same name, but that probably requires CAP_SYS_ADMIN, which I understand is what you mean with "limited privilege environments".

Could you elaborate on your use case? Is there a way you could create the namespace in a privileged setting and then use something like nsenter firecracker ...?

andrewla · 2024-08-19T20:41:12Z

That assessment is correct -- basically to run the jailer in a network namespace you need the setns syscall which requires CAP_SYS_ADMIN. So nsenter is not an option.

Our particular case is running in a containerized environment where our privileges are limited by the nature of the general environment. Once we're in our particular container we have lost all relevant privileges.

pb8o · 2024-08-29T14:07:38Z

Hi again @andrewla, we have been talking internally about this PR and we may need to spend some time to decide on the API aspects of it to make sure it doesn't conflict with other efforts.

In the meantime, we thought of another workaround. The snapshot-editor could be enhanced to rename the tap devices in an snapshot file. That would be an easier decision for us, but we want to make sure it would handle your use case.

For example we imagine the tool would work like this:

snapshot-editor edit-vmstate rename-network eth0 tap1

Would this work within your environment?

andrewla · 2024-09-03T17:26:23Z

This was our initial approach as it required minimal changes. But we found that the performance cost of making the copy (as opposed to hardlinking) during the operation (plus serde costs) were more expensive than we were willing to tolerate in our environment.

andrewla · 2024-10-10T19:12:27Z

Hi @pb8o -- is there anything we can do to help move this forward?

pb8o · 2024-10-16T09:46:36Z

Hi @andrewla I haven't had time to look at this, but this is next on my list now. Thanks for your patience!

kanpov · 2024-11-01T19:21:16Z

On a related note, another reason why renaming the tap device is a better approach than namespaced NAT from the "Network for Clones" guide is that the namespaced NAT imposes measurable overhead onto the host kernel due to the addition of about 5 more iptables/nft rules, plus an RTNETLINK route for forwarding the guest IP out of the netns.

Even though I made an effort to support namespaced NAT in fcnet, it increased complexity by a factor of 4-5x in comparison to regular NAT only to support one usecase: two simultaneous microVM clones. So I'd be in favor of this change, or a snapshot-editor equivalent.

pb8o · 2025-01-08T17:46:34Z

Hello @andrewla ! I apologize for the long time between updates, but some other stuff came up. So we have decided to go ahead with this. I gave a first initial review and I only have some minor comments, but mostly looks good to me. I just have a question if the network_overrides field also works when starting from a JSON config file.

tests/integration_tests/functional/test_snapshot_basic.py

tests/framework/microvm.py

CHANGELOG.md

docs/snapshotting/network-for-clones.md

andrewla · 2025-01-09T17:38:50Z

Re: config -- currently there is no config support for snapshots (https://github.com/firecracker-microvm/firecracker/blob/main/src/vmm/src/resources.rs) -- the snapshot configuration and restore has to be done with a running firecracker instance

bchalios

It generally looks good and thanks for the contribution Andrew.

A few comments/questions from me.

Also, I've commented this for the documentation changes, but could you please squash as well the commits for the test changes into a single commit?

src/firecracker/swagger/firecracker.yaml

src/vmm/src/builder.rs

tests/framework/microvm.py

tests/integration_tests/functional/test_snapshot_basic.py

CHANGELOG.md

bchalios · 2025-01-10T10:56:24Z

docs/snapshotting/network-for-clones.md

+
+This may require reconfiguration of the networking inside the VM so that it is
+still routable externally.
+


Could you please add an example here of how this can be used.

Added some sample code and more detail.

It would be great if the example include the steps to start a microVM with a certain network config (it could be the one from "Getting started") then take a snapshot and, finally, load this snapshot in a different host with a different TAP device (show the snapshot load command with the override).

The problem with this in principle is that it does require that the VM reconfigure its networking to match the expectations of the external routing. This complexity is easily managed in the real world where you will have some channel of communication from host to guest (vsocks or swapping block devices or mmds or vmgenid) but hard to capture in a small example.

That is fine. I think we can still add something useful here. The guest configuration you have already is fine. I would add something like that:

For example, if we have a network interface named `eth0` in the snapshotted microVM. We can override it during snapshot resume, like this: curl --unix-socket /tmp/firecracker.socket -i \ -X PUT 'http://localhost/snapshot/load' \ -H 'Accept: application/json' \ -H 'Content-Type: application/json' \ -d '{ "snapshot_path": "./snapshot_file", "mem_backend": { "backend_path": "./mem_file", "backend_type": "File" }, "enable_diff_snapshots": true, "resume_vm": false, "network_overrides": [ { iface_id: "eth0", host_dev_name": "vmtap01" } ] }'

after this bit:

In this case you can use the network_overrides parameter to snapshot restore
to specify which network device (based on the name inside the VM, such as
"eth0") maps to which host tap device (e.g. "vmtap01").

CHANGELOG.md

pb8o · 2025-01-13T13:47:26Z

src/vmm/src/vmm_config/snapshot.rs

+    /// The network devices to override on load.
+    pub network_overrides: Vec<NetworkOverride>,


According to the integration tests failures, LoadSnapshotConfig just below also needs to be updated with this.

Any ideas on what the problem is here? The network_overrides field is present in both structs, and for the life of me I can't see where this error message is originating -- the field is in the yaml, it's in both the config and params struct. I don't have an easy callstack for the firecracker side of the failure, but that's where I'll start looking next, but any insights here would be appreciated

Hi @andrewla I think what is happening is that the test that fails: test_check_vulnerability_files_ab is an A/B test, and is running the new tests with the Firecracker from current main. A workaround would be to not pass network_overrides if it's empty. I will write a comment where I think it should be done.

This does not match what I'm seeing in CI, I see a number of tests failing with the messages

E RuntimeError: ('An error occurred when deserializing the json body of a request: unknown field `network_overrides`, expected one of `snapshot_path`, `mem_file_path`, `mem_backend`, `enable_diff_snapshots`, `resume_vm` at line 1 column 173.', {'fault_message': 'An error occurred when deserializing the json body of a request: unknown field `network_overrides`, expected one of `snapshot_path`, `mem_file_path`, `mem_backend`, `enable_diff_snapshots`, `resume_vm` at line 1 column 173.'}, <Response [400]>)

Mostly these are in integration_tests/security/test_vulnerabilities.py, but not all tests in that file fail consistently. The _ab tests fail but not consistently.

I am unable to reproduce any of this locally -- the tests pass when I run them specifically.

I still think it's that. Try it with the microvm.py patch I mentioned in #4731 (comment)

To reproduce locally try this:

BUILDKITE_PULL_REQUEST=true BUILDKITE_PULL_REQUEST_BASE_BRANCH=main ./tools/devtool test -- -n8 --dist worksteal integration_tests/security/test_vulnerabilities.py::test_check_vulnerability_files_ab

I'll try applying the patch to verify. Trying the command you give has the tests fail with a different error:

framework/microvm.py:206: in __init__ assert fc_binary_path.exists() E AssertionError

I don't know exactly where I've gone wrong that the firecracker binary is not in the right place in the test framework. I'll try from scratch, and in the meantime I'll submit the suggested patch to see if it works in CI

A clean build does not fix this; I get this error when running from scratch. All of the old firecracker binaries are present under build/img/x86_64/firecracker; I have no idea why this doesn't work, or why even in CI some of the ab tests work.

The other failing test is the spectre/meltdown test which also appears to have an ab mode based on whether it is a PR or not, so likely failing for the same reason.

Is there anything I have to do to get buildkite to run the CI? After pushing changes it goes into a "blocked" mode; a 24 hr turnaround on CI is useless when I can't repro the failure locally.

Hi, this is because devtool test apparently only tried to build the "B" binary, but never the "A" binary. In CI, it works because there's a separate step that builds the binary so that they can be shared across all runners. Can you try running ./tools/devtool build --rev main and then rerun the test command pablo posted? That should create the firecracker binaries compiled from main in the locations where the test script looks for them.

Now it can find the binaries (under build/main/..., I see now) but it fails elsewhere still. Error looks like

framework/microvm.py:678: in _wait_create os.stat(self.jailer.api_socket_path()) E FileNotFoundError: [Errno 2] No such file or directory: '/srv/jailer/firecracker/594eec23-7064-4306-9acd-6a6e137c3d30/root/run/firecracker.socket'

Ah, it needs to be ./tools/devtool build --rev main --release (although it should also work without --release, wonder why that breaks 🤔), sorry!

I've also opened a PR to make devtool do that automatically + add some documentation: #4998

docs/snapshotting/network-for-clones.md

tests/framework/microvm.py

In some scenarios it is not possible to use the jailer, especially in limited privilege environments where the security is external to firecracker itself. But in these cases a snapshot may have to use a different tap device than the one that it was using when it was snapshotted. Signed-off-by: Andrew Laucius <[email protected]>

Test that we can correctly parse configuration and API calls in a backwards compatible way. Signed-off-by: Andrew Laucius <[email protected]>

Documenting the ability to rename network interfaces on snapshot restore. Signed-off-by: Andrew Laucius <[email protected]>

codecov · 2025-01-15T09:56:13Z

Codecov Report

Attention: Patch coverage is 21.42857% with 11 lines in your changes missing coverage. Please review.

Project coverage is 83.03%. Comparing base (3fb06e9) to head (c979583).

Files with missing lines	Patch %	Lines
src/vmm/src/persist.rs	15.38%	11 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4731      +/-   ##
==========================================
- Coverage   83.06%   83.03%   -0.04%     
==========================================
  Files         244      244              
  Lines       26658    26671      +13     
==========================================
+ Hits        22144    22146       +2     
- Misses       4514     4525      +11

Flag	Coverage Δ
5.10-c5n.metal	`83.54% <21.42%> (-0.04%)`	⬇️
5.10-m5n.metal	`83.53% <21.42%> (-0.03%)`	⬇️
5.10-m6a.metal	`82.73% <21.42%> (-0.05%)`	⬇️
5.10-m6g.metal	`79.40% <21.42%> (-0.04%)`	⬇️
5.10-m6i.metal	`83.51% <21.42%> (-0.04%)`	⬇️
5.10-m7g.metal	`79.40% <21.42%> (-0.04%)`	⬇️
6.1-c5n.metal	`83.54% <21.42%> (-0.03%)`	⬇️
6.1-m5n.metal	`83.52% <21.42%> (-0.05%)`	⬇️
6.1-m6a.metal	`82.73% <21.42%> (-0.04%)`	⬇️
6.1-m6g.metal	`79.39% <21.42%> (-0.05%)`	⬇️
6.1-m6i.metal	`83.51% <21.42%> (-0.05%)`	⬇️
6.1-m7g.metal	`79.40% <21.42%> (-0.04%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

changes done

bchalios · 2025-01-16T09:01:59Z

src/vmm/src/persist.rs

+        let net_devices = &mut microvm_state.device_states.net_devices;
+        if let Some(device) = net_devices
+            .iter_mut()
+            .find(|x| x.device_state.id == entry.iface_id)
+        {
+            device
+                .device_state
+                .tap_if_name
+                .clone_from(&entry.host_dev_name);
+        } else {
+            return Err(SnapshotStateFromFileError::UnknownNetworkDevice.into());


These lines are uncovered from unit tests? Can we add one (or extend an existing) to try to override a non-existing interface?

I'm investigating this (once I get the tests to pass) because this is really the core of the change, and if the tests are actually running then this code should be exercised. I'm concerned that my test is accidentally a noop but I need to validate this.

Passing in the new flag breaks tests that compare behavior to main. Signed-off-by: Andrew Laucius <[email protected]>

andrewla force-pushed the allow-snapshot-tap-changes branch from 7991d9f to 8d1a0a9 Compare August 15, 2024 21:24

andrewla force-pushed the allow-snapshot-tap-changes branch 5 times, most recently from 3415816 to 03c3be9 Compare August 26, 2024 20:43

andrewla force-pushed the allow-snapshot-tap-changes branch from 03c3be9 to 265ea94 Compare August 27, 2024 13:24

andrewla marked this pull request as ready for review August 27, 2024 13:25

andrewla requested review from xmarcalx, kalyazin and pb8o as code owners August 27, 2024 13:25

pb8o reviewed Jan 8, 2025

View reviewed changes

andrewla requested a review from Manciukic as a code owner January 8, 2025 19:28

andrewla force-pushed the allow-snapshot-tap-changes branch 2 times, most recently from d8c5a44 to ea62e9a Compare January 9, 2025 19:00

bchalios requested changes Jan 10, 2025

View reviewed changes

andrewla force-pushed the allow-snapshot-tap-changes branch 4 times, most recently from 0be3ff7 to bf47436 Compare January 10, 2025 15:02

pb8o mentioned this pull request Jan 10, 2025

[Snaps] Allow resource renaming during snapshot restore #4992

Open

3 tasks

pb8o reviewed Jan 13, 2025

View reviewed changes

pb8o previously requested changes Jan 13, 2025

View reviewed changes

docs/snapshotting/network-for-clones.md Show resolved Hide resolved

andrewla force-pushed the allow-snapshot-tap-changes branch from eb815a2 to 97838cb Compare January 13, 2025 21:37

pb8o reviewed Jan 14, 2025

View reviewed changes

tests/framework/microvm.py Outdated Show resolved Hide resolved

andrewla force-pushed the allow-snapshot-tap-changes branch from 97838cb to 07a8237 Compare January 14, 2025 18:13

andrewla added 3 commits January 14, 2025 16:18

Tests for snapshot network renames

0cfc2ed

Test that we can correctly parse configuration and API calls in a backwards compatible way. Signed-off-by: Andrew Laucius <[email protected]>

Adding documentation and changelog

837e744

Documenting the ability to rename network interfaces on snapshot restore. Signed-off-by: Andrew Laucius <[email protected]>

andrewla force-pushed the allow-snapshot-tap-changes branch from 07a8237 to 837e744 Compare January 14, 2025 21:18

andrewla force-pushed the allow-snapshot-tap-changes branch from 7e2c75f to 5bcdf11 Compare January 15, 2025 17:58

bchalios reviewed Jan 16, 2025

View reviewed changes

Remove network_overrides if empty for backcompat

ff7fabd

Passing in the new flag breaks tests that compare behavior to main. Signed-off-by: Andrew Laucius <[email protected]>

andrewla force-pushed the allow-snapshot-tap-changes branch from 5bcdf11 to ff7fabd Compare January 16, 2025 21:59

Merge branch 'main' into allow-snapshot-tap-changes

c979583

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow snapshot tap changes #4731

Allow snapshot tap changes #4731

andrewla commented Aug 15, 2024 •

edited

Loading

pb8o commented Aug 19, 2024

andrewla commented Aug 19, 2024

pb8o commented Aug 29, 2024

andrewla commented Sep 3, 2024

andrewla commented Oct 10, 2024

pb8o commented Oct 16, 2024

kanpov commented Nov 1, 2024

pb8o commented Jan 8, 2025 •

edited

Loading

andrewla commented Jan 9, 2025

bchalios left a comment

bchalios Jan 10, 2025

andrewla Jan 10, 2025

bchalios Jan 13, 2025

andrewla Jan 13, 2025

bchalios Jan 16, 2025

pb8o Jan 13, 2025

andrewla Jan 13, 2025

pb8o Jan 14, 2025

andrewla Jan 14, 2025 •

edited

Loading

pb8o Jan 15, 2025 •

edited

Loading

andrewla Jan 15, 2025

andrewla Jan 15, 2025 •

edited

Loading

roypat Jan 15, 2025

andrewla Jan 15, 2025

roypat Jan 16, 2025

codecov bot commented Jan 15, 2025 •

edited

Loading

bchalios Jan 16, 2025

andrewla Jan 17, 2025


		This may require reconfiguration of the networking inside the VM so that it is
		still routable externally.

		/// The network devices to override on load.
		pub network_overrides: Vec<NetworkOverride>,

Allow snapshot tap changes #4731

Are you sure you want to change the base?

Allow snapshot tap changes #4731

Conversation

andrewla commented Aug 15, 2024 • edited Loading

Changes

Reason

License Acceptance

PR Checklist

pb8o commented Aug 19, 2024

andrewla commented Aug 19, 2024

pb8o commented Aug 29, 2024

andrewla commented Sep 3, 2024

andrewla commented Oct 10, 2024

pb8o commented Oct 16, 2024

kanpov commented Nov 1, 2024

pb8o commented Jan 8, 2025 • edited Loading

andrewla commented Jan 9, 2025

bchalios left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrewla Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

pb8o Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrewla Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jan 15, 2025 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrewla commented Aug 15, 2024 •

edited

Loading

pb8o commented Jan 8, 2025 •

edited

Loading

andrewla Jan 14, 2025 •

edited

Loading

pb8o Jan 15, 2025 •

edited

Loading

andrewla Jan 15, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading