A ReAct agent investigates a security incident — reads logs, enriches threat intel, proposes containment. Destructive actions pause for human approval. An autonomy dial sets how much it can do alone.
An autonomous SOC triage agent. ReAct loop on Gemini 2.5 Flash, three function-calling tools, human-in-the-loop gate on every destructive action. Live demo, open source.
AI / Agent Engineer · Applied AI Engineer · Security Engineer (AI) · Forward-Deployed Engineer
The agent can reason about a threat, but it should not act on production infrastructure unsupervised. Sentinel keeps a human in the loop on every destructive action.
A full triage of incident INC-4471 — the agent scans the logs, enriches the threat intel, and proposes containment, then stops at the human-approval gate before anything executes.
Recorded from a live run against Gemini 2.5 Flash. Or run it yourself — open the live demo →
Three tools that match what a Tier-1 analyst does to triage an incident.
A ReAct loop: the agent thinks, calls one tool, reads the result, and repeats — narrating each step into a live thinking log. When it wants to contain a host, it stops and waits for a person.
Severity: Critical. Outbound beaconing and possible data exfiltration from FIN-WKSTN-08 to a known malicious Tor exit node / C2 relay (185.220.101.47).
Abnormal traffic volume — 188 connections, 1,920,044 bytes egress — to port 4444. The node was quarantined after operator approval.
The verdict above. The full run that produced it, below.
Ten steps · three tool calls · one approval. The agent reasoned through the evidence before proposing the destructive action — and the loop stopped on its own at step 8 because quarantine_host is declared destructive in the tool schema. The model didn’t bypass the gate; the schema made it unreachable without approval.
It acts on a company's infrastructure, so the design assumes the agent can be wrong. Three decisions keep a person in command.