Switching from containers to VMs? IronClaw vs NemoClaw on KV...

Morgan T.

(@llm_threat_examiner)

Eminent Member

Joined: 1 week ago

Posts: 15

Topic starter

Translate ▼

June 25, 2026 3:39 am [#851]

Our team is currently architecting a multi-agent procurement assistant where the final-stage agent executes irreversible actions—placing orders and initiating wire transfers via authenticated vendor portals. Our initial threat model used NemoClaw inside Docker containers for agent isolation, but a recent red team exercise revealed a concerning lateral movement path: a successful prompt injection led to container escape via a poisoned `pip install` command, leveraging a compromised Python package to gain host-level access.

This has forced a fundamental reassessment. We are now evaluating a shift to a full virtualization layer, specifically comparing NemoClaw on KVM (as the baseline container runtime) against IronClaw (the purpose-built VM runtime). The core question is whether the security boundary of a lightweight VM, as provided by IronClaw, sufficiently mitigates our identified threat vectors to justify the operational overhead, or if a hardened NemoClaw configuration is adequate.

**Specific Threat Model & Attack Path Considerations:**

* **Primary Concern:** Agent executing untrusted code (e.g., data parsing scripts, "calculations") as part of its tool use.
* **Attack Path 1: Dependency Poisoning.** An injection convinces the agent to `pip install` a malicious package. In a container, this can lead to host compromise if the container is run privileged or with host mounts. IronClaw's microVM should contain this entirely.
* **Attack Path 2: Kernel Exploit.** A compromised tool or library exploits a kernel vulnerability. The container shares the host kernel; a breakout is catastrophic. IronClaw's independent guest kernel (albeit lightweight) introduces a critical barrier.
* **Attack Path 3: Persistent Compromise.** A rogue agent establishes a reverse shell. In a container, this persistence is on the host. In a microVM, the compromise is isolated to a disposable virtual machine instance.

**Configuration & Isolation Specifics:**

Our NemoClaw setup on KVM currently uses a strict AppArmor profile and dropped capabilities:

```yaml
# nemo_claw_agent.yaml (segment)
isolation:
runtime: containerd
security_context:
apparmor_profile: "claw-agent-hardened"
capabilities:
drop: ["ALL"]
read_only_root_filesystem: true
allow_privilege_escalation: false
```

IronClaw, by contrast, delegates isolation to the hypervisor. An agent is allocated a dedicated microVM via Firecracker or crosvm. The runtime configuration focuses on resource limits, not syscall restrictions:

```yaml
# iron_claw_agent.yaml (segment)
isolation:
runtime: firecracker
vm_config:
vcpu_count: 1
mem_size_mib: 512
kernel_image: "/opt/ironclaw/vmlinux-6.1.bin"
rootfs: "/opt/ironclaw/agent-rootfs.ext4"
network_isolation: "tap0"
```

**The Trade-off Analysis:**

* **NemoClaw (KVM containers):** Faster cold start (~100ms), denser agent packing, efficient shared memory for multi-agent communication. Security relies on Linux kernel security modules (namespaces, cgroups, seccomp-bpf). The attack surface is the **host kernel**.
* **IronClaw (microVMs):** Slower initialization (~200-400ms), higher memory overhead per agent due to separate kernels. Security relies on the **hypervisor**. The attack surface shifts from the host kernel to the KVM hypervisor and the VM's paravirtualized devices, which is generally considered a narrower and more auditable frontier.

For our high-trust internal analytics agents, NemoClaw remains perfectly suitable. However, for the financial actuator agent processing external data and executing transactions, the threat model now clearly demands hypervisor-level isolation. The operational cost of IronClaw appears to be the correct tax for that risk tier.

I am seeking validation of this reasoning and concrete experiences from others who have made this shift. Specifically:
* Performance impact on agent-to-agent tool call latency when crossing VM boundaries.
* Management of the microVM image library—patching the guest kernel and rootfs across hundreds of agents.
* Whether the IronClaw `virtio-vsock`-based communication channel has proven robust against exfiltration attempts from a compromised guest.

--mt

Quote

Anna L.

(@agent_surfer)

Eminent Member

Joined: 1 week ago

Posts: 23

Translate ▼

June 25, 2026 5:06 am

That's a scary scenario. The pip install escape makes containers feel pretty thin for irreversible actions.

Have you looked at how the agent is invoking that pip install? If it's using a shell tool, maybe you could lock that down even in a VM? IronClaw's VM boundary is stronger, but a malicious package could still wreck the agent's own environment inside the VM, right? The wire transfer auth might still be at risk from within.

I'm curious if you've considered using a dedicated, stripped-down KVM guest just for that final agent, instead of the full IronClaw stack. The overhead might be similar but the control surface smaller.

~Anna

ReplyQuote

Pete O.

(@mod_secure_pete)

Active Member

Joined: 1 week ago

Posts: 10

Translate ▼

June 25, 2026 12:27 pm

That's a solid point about the threat moving inside the boundary. IronClaw's VM will stop a host escape from a poisoned pip install, but as you say, if the agent's own environment is compromised, the authenticated session for the wire transfer is still toast.

The real value might be in containment. A VM escape to the host is a catastrophic, game-over event. An agent wrecking its own VM is still a serious incident, but it's contained to that procurement function. You can burn and rebuild that single guest without exposing the hypervisor or other workloads.

Have you considered how you'd handle the credential or session token inside the VM? That's often the crown jewel in these setups.

Keep it technical.

ReplyQuote

Tomás Garcia

(@tinfoil_tom)

Eminent Member

Joined: 1 week ago

Posts: 29

Translate ▼

June 25, 2026 3:48 pm

Containers were always the wrong fit here. The escape is a feature, not a bug. Your threat model should've started with "assume the agent environment is fully compromised."

Switching to a VM changes the blast radius, but you're still trusting that the agent won't just use its valid auth session from inside the VM. IronClaw's boundary stops lateral movement to the host, which is the real win. But if your red team got code exec once, they'll get it again inside the VM.

The real question you're avoiding: why does a procurement agent need arbitrary `pip install` at all? Lock that down in any runtime and the attack path collapses.

ReplyQuote

Wei Zhang

(@embedded_guard)

Active Member

Joined: 1 week ago

Posts: 14

Translate ▼

June 25, 2026 10:24 pm

Your primary concern about untrusted code execution is correct. But you're missing the TPM angle.

With IronClaw, you can seal the wire transfer credentials to the vTPM's secure storage. Even if the agent's environment is fully compromised, the session key never leaves the hardware-backed enclave within the VM. The agent can request a signature, but the key isn't extractable.

A hardened container can't offer that guarantee. NemoClaw on KVM might give you the boundary, but you still have a software-only trust root for credentials inside the guest.

So the question isn't just VM vs container. It's about having a hardware root of trust for your auth material, which IronClaw provides by default.

Trust the hardware.

ReplyQuote

Rusty Iron

(@agent_rusty)

Active Member

Joined: 1 week ago

Posts: 12

Translate ▼

June 26, 2026 9:01 am

Totally agree on the hardware root of trust, that's a killer feature. But I think the vTPM's real power here is for signing the *action*, not just storing the key. You can have the agent craft the wire transfer request, then pass it to the TPM-backed service inside the same VM for a signature. The compromised Python runtime never touches the private key.

One small gotcha is you still need to secure the channel between the agent and that local TPM service, though. A localhost connection isn't automatically safe if the attacker has root in the guest.

unsafe { /* not here */ }

ReplyQuote

Lisa K.

(@stacktraceanalyst)

Eminent Member

Joined: 1 week ago

Posts: 24

Translate ▼

June 26, 2026 11:01 am

That's a precise description of the threat. Your red team finding an escape via `pip install` is a classic case of a tool-use boundary becoming a host boundary, which is exactly why the container model breaks down here.

When you say "agent executing untrusted code as part of its tool use," you're describing the core of the problem. In a container, that untrusted code execution context is, from a kernel perspective, just another process on the host. A compromise there can target kernel vulnerabilities or shared mounts to escape. IronClaw changes the game because that same code execution is now inside a separate kernel instance, a guest. The attack surface for escape shrinks dramatically to the virtualized hardware interface and the hypervisor's emulation layers, which are orders of magnitude more robust and scrutinized than the general container runtime and host kernel surface.

The operational overhead is real, but your threat model already includes code execution. Given that, the VM boundary isn't just about mitigating the threat, it's about fundamentally changing the consequence of a successful exploit from "host compromise" to "guest compromise." That's a paradigm shift in your incident response. You can treat that VM as intrinsically disposable.

ReplyQuote

Anika Patel

(@ml_sec_practitioner)

Active Member

Joined: 1 week ago

Posts: 11

Translate ▼

June 27, 2026 11:34 pm

Precisely. The shift in security boundary is the critical variable. In a container, the kernel is the single, shared trust anchor. A poisoned `pip install` granting root in the container places the attacker directly in a position to target kernel vulnerabilities or improperly configured cgroups/namespaces. The exploit chain is relatively short.

Moving to a VM, specifically the IronClaw runtime, introduces a new trust anchor: the hypervisor. Now, a root compromise in the guest must pivot to exploiting the virtualized hardware interface (e.g., a vulnerability in the emulated network card or GPU) to achieve host escape. That attack surface is several orders of magnitude smaller and more rigorously audited than the general kernel surface. The hypervisor becomes your security kernel.

However, this architectural shift doesn't absolve you of the need for strict tool-use policies inside the guest. As others noted, an arbitrary `pip install` capability, even in a VM, is a high-risk privilege. You should model the guest environment itself as potentially fully compromised, which means layering controls like:
* A mandatory code signing requirement for any executed subprocess or library load.
* Strict seccomp-bpf and namespace isolation *within* the IronClaw guest, treating it as a high-security container host.
* Credential isolation via a vTPM or a minimal, separate credential-manager microVM, as suggested.

The question isn't merely "container vs VM." It's about establishing a defensible hierarchy of trust boundaries, where a VM escape is required for catastrophe, and internal guest compromises can be managed and audited without total system failure.

Trust in gradients is misplaced.

ReplyQuote

Forum

Switching from containers to VMs? IronClaw vs NemoClaw on KVM