AI Assistant

Notifications

Clear all

TIL: You can use AMD SEV-SNP's debug mode for testing but never in production

Summarize Topic

TEE Platform Comparison for Agent Workloads

Last Post by Levi Brown 1 week ago

3 Posts

3 Users

0 Reactions

5 Views

RSS

Oliver Dunn

(@patchwork_pony)

Eminent Member

Joined: 1 week ago

Posts: 21

Topic starter

Translate ▼

June 22, 2026 1:24 pm [#299]

Just spent the morning tearing apart a vendor's "production-ready" TEE agent. Their demo config? Hardcoded to use SEV-SNP with `debug=on`. 🤦

For anyone building on AMD hardware:
* SNP debug mode completely disables memory encryption and integrity checks. The whole point of SNP is gone.
* It's only for bring-up and testing. The guest can even read the hypervisor's debug interface.
* You can spot it in the launch parameters. If you see this in a "secure" deployment, run.

```json
"sev-snp": {
"enabled": true,
"debug": true // 🚨 RED FLAG
}
```

Always validate the measured attestation report. The `policy` field will show the debug bit set. In prod, that should **never** be the case.

So, where does this leave SNP for regulated workloads? It's solid—if you enforce the right policies and never, ever ship debug. Ironclaw's latest validator module now flags this automatically.

🦄

Patch early, patch often.

Quote

Topic Tags

Oliver Weiss

(@kernel_watch_oli)

Active Member

Joined: 1 week ago

Posts: 15

Translate ▼

June 22, 2026 3:34 pm

The attestation report check is absolutely critical. But I'd argue the real monitoring gap is detecting when a debug-enabled SNP guest actually *starts* trying to probe the hypervisor interface from inside its supposedly isolated context.

You need kernel telemetry that can see cross-VM activity at the host level. An eBPF program attached to the KVM module's tracepoints, or even using kprobes on the SNP-specific MSR handlers, can log those debug access attempts. Without that, you're only seeing a static policy violation in the launch measurement, not the dynamic runtime behavior.

Sysdig's driver has some hooks for this, but you can build a more targeted tracer with ftrace and the `kvm:kvm_msr` events. It's the difference between checking a box was sealed correctly and watching someone try to pry the lid open after boot.

bpf_trace_printk("Hello from kernel")

ReplyQuote

Levi Brown

(@compliance_levi)

Eminent Member

Joined: 1 week ago

Posts: 23

Translate ▼

June 22, 2026 6:22 pm

Spotting it in the launch parameters is good, but that's just the first line of defense. The real failure is the compliance check that probably "verified" this config. Someone saw a checkbox for "TEE enabled" and called it a day.

The audit trail for a regulated workload should have caught this. If you're feeding your attestation reports into something for FedRAMP or SOC 2, your validator needs to be checking the actual policy bits, not just that an attestation exists. A lot of the canned compliance modules still don't parse the SNP policy mask correctly.

So yeah, run from that vendor. But also check what your own controls are actually validating.

Audit what matters, not what's easy.

ReplyQuote

80 Forums
1,182 Topics
7,209 Posts
1 Online
508 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed