[01]
Shipped
Malware Detonation
Drop suspicious binaries or scripts into a bounded container. Every action is caught or allowed in real-time by the permission engine. Offline by default — no C2 callbacks during analysis.
$ agenticbox run security-analyst --network offline
[sandbox] pulling ubuntu:24.04... done
[sandbox] container f3a2c1 created
[agent] executing sample.sh...
ALLOWED exec /workspace/sample.sh
ALLOWED fs.write /workspace/output/
BLOCKED net.outbound — network is offline
BLOCKED fs.read /etc/shadow — protected path
[02]
Shipped
Credential Exfiltration Detection
FsGuard canonicalizes all paths and blocks access to SSH keys, AWS credentials, environment files. Symlink-based escapes and ../ traversal are prevented. Every blocked attempt is logged.
> resolve "/data/../etc/passwd"
canonicalize → /etc/passwd
DENIED root not in allowed
> resolve "~/.ssh/id_rsa"
DENIED protected: SSH credentials
roots: /workspace /tmp/agent
[03]
Shipped
C2 Observation Without Reach
Switch to allowlist mode to let the sample reach specific domains — C2 endpoints, malware databases — while blocking everything else. Observe what it tries to contact without giving it real network access.
$ agenticbox run security-analyst \
--network allowlist \
--domains "malware-bazaar.test"
[net] policy: allowlist
ALLOWED api.malware-bazaar.test
BLOCKED evil-c2.example.com
logged for IOC extraction
[04]
Shipped
AI Agent Red-Teaming
Run any AI agent inside a bounded container with scoped permissions. Test whether it can escape its boundaries — while the blast radius stays contained. Every tool call, file access, and network request is auditable.
$ agenticbox run hermes --fs readonly
[agent] model=claude-sonnet-4
ALLOWED read_file /workspace/target.py
ALLOWED exec nmap --version
BLOCKED write_file /etc/cron.d/persist
BLOCKED net → raw TCP socket
blast radius: contained
[05]
Shipped
Full Audit Trail
Every action — allowed or blocked — is logged with timestamp, action type, target, and policy decision. Exportable for incident response, compliance documentation, and forensic analysis.
audit.log — session 0x7e2a
14:31:02 ALLOW exec sample.sh
14:31:03 ALLOW fs.write /workspace/out
14:31:04 BLOCK net.outbound (offline)
14:31:05 BLOCK fs.read ~/.ssh/id_rsa
exportable · tamper-proof
[06]
Shipped
RE Toolchain Pre-Installed
The security-analyst profile ships with radare2, binwalk, YARA, capstone, pefile, ltrace, strace, nmap, and tcpdump. Fully customizable via TOML — add any tool.
$ radare2 /workspace/sample.bin
r2 5.9.0 — ready
$ yara scan /workspace/
yara 4.5.0 — rules loaded
$ binwalk /workspace/firmware.img
binwalk 2.4.0 — extracting...