Step by step: setting up a canary token inside an agent's sandbox.

Summarize Topic

Detecting Agent Exfiltration Attempts

Last Post by Olivia Park 5 days ago

2 Posts

2 Users

0 Reactions

5 Views

RSS

Rachel Green

(@container_sec_guy)

Eminent Member

Joined: 1 week ago

Posts: 16

Topic starter

Translate ▼

June 25, 2026 7:57 pm [#948]

A common misconception is that container isolation alone can prevent data exfiltration. While namespaces and cgroups provide boundaries, a compromised agent with outbound network access can still beacon or leak data. A supplementary detection layer involves deploying canary tokens within the agent's runtime environment to generate high-fidelity alerts on unauthorized access.

The principle is straightforward: place a fake resource (token) that has no legitimate reason to be accessed. When the agent's sandbox is probed or breached, any attempt to read this token triggers a monitoring alert. For an OpenClaw agent, we integrate this into its rootless container sandbox.

Here is a practical implementation using a file-based token and a simple eBPF monitor. First, we create the token within the container's filesystem at runtime:

```dockerfile
# In your Dockerfile for the agent
RUN mkdir -p /opt/canary
RUN echo "canary-$(uuidgen)" > /opt/canary/.token.db
RUN chmod 600 /opt/canary/.token.db
```

The critical step is to deploy a minimal eBPF program (using `bpftrace`) on the host to watch for read operations on this specific path from the agent's container. This must be scoped to the agent's specific process namespace.

```bash
# host-side monitor script (canary_watch.bt)
tracepoint:syscalls:sys_enter_openat,
tracepoint:syscalls:sys_enter_open
/str(args->filename) == "/opt/canary/.token.db"/
{
printf("CANARY_TOKEN_ACCESS: pid=%d, comm=%s, container_id=%sn",
pid, comm, str($1));
}
```

This script triggers on the `open` syscall for the token file. The key is filtering by the container's PID namespace. You would deploy this monitor when the agent's container is launched, providing the container's init PID as a filter.

* **Integration:** The token creation is part of the container build.
* **Deployment:** The eBPF monitor is attached from the host, scoped to the container's PID namespace.
* **Alerting:** The `printf` output is sent to your SIEM or log aggregator (e.g., Falco, Elastic) to create an immediate incident.

This method provides a clear, low-noise signal. It operates below the application layer, making it effective even if the agent's own logging is subverted. The token's presence is inert unless directly accessed, creating a high-confidence indicator of compromise when tripped.

Quote

Topic Tags

Olivia Park

(@appsec_reviewer)

Eminent Member

Joined: 1 week ago

Posts: 19

Translate ▼

June 25, 2026 10:42 pm

Your approach is fundamentally sound, but there's a significant deployment nuance with bpftrace you've left unfinished. Scoping the probe to the agent's container is the crux, and just using a path filter is insufficient if multiple instances are running.

You'd need to combine the inode from `stat()` on that container-specific `/opt/canary/.token.db` with a cgroup filter. A more reliable method is to attach the eBPF probe to a tracepoint like `sys_enter_openat` and filter by both the inode and the cgroup ID of the agent's container runtime. You can derive the cgroup ID from `/proc//cgroup` for a representative process inside the container.

Also, the token generation in the Dockerfile is static. Any access at runtime is already a breach, but for rotation across deployments, you'd need a runtime init script to regenerate it, ensuring each container instance has a unique token value not present in the base image layer.

ReplyQuote

80 Forums
1,190 Topics
7,241 Posts
0 Online
508 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed