Your SIEM Can't See the Agent

OpenClaw exposed a new threat class that WAFs, IDS, and SIEMs were never built to detect. Here's why autonomous AI agents are invisible to your current stack — and what visibility actually requires.

Your SIEM Can't See the Agent

March 1, 2026 - Cygient Research

There is a threat class operating against enterprise infrastructure right now that your existing tooling was never built to detect. It does not show up in firewall logs the way you expect. It does not trip rate limits the way a script would. It does not behave like a human, and it does not behave like a bot. It behaves like nothing your detection rules were written for, because when those rules were written, autonomous AI agents did not exist as a production-grade attack vector.

OpenClaw changed that.

The OpenClaw Moment

In January 2026, OpenClaw, an open-source personal AI assistant that runs locally and connects to messaging platforms like Slack, WhatsApp, and Telegram, went from roughly 1,000 publicly exposed instances to more than 21,000 in under a week. Most of those users had exposed their Gateway service directly to the public internet, without authentication, on a default WebSocket port.

Attackers were exploiting it before most users had finished setting it up.

Researchers at Pillar Security deployed a honeypot mimicking an OpenClaw Gateway and observed protocol-aware exploitation within minutes of the instance appearing online. The attack pattern was deliberate and methodical: enumerate the Gateway → downgrade the connection → impersonate a legitimate client → execute commands via JSON-RPC → steal tokens → establish persistence. No brute-force noise. No anomalous volume. No behavioral signature that a SIEM tuned to human adversarial behavior would catch.

Then Oasis Security published ClawJacked, a zero-click vulnerability chain that allows any malicious website to silently take full control of a developer's OpenClaw agent with no user interaction required. Three weaknesses in sequence: browsers do not restrict WebSocket connections to localhost; the OpenClaw Gateway's rate limiter explicitly exempted localhost connections, allowing hundreds of password guesses per second from browser JavaScript; and once authenticated, the system auto-approved new device registrations from localhost without prompting the user. Full agent compromise. API keys, credentials, integrated service access, arbitrary command execution, all from a single page visit.

The Oasis team classified this as high severity. OpenClaw shipped a patch within 24 hours. But by then, Shodan had indexed more than 312,000 instances running on the default port with little or no protection, and intelligence reporting confirmed active credential theft operations, multi-platform message interception, and information-stealing malware being distributed through Telegram channels.

This is not a vulnerability story. This is a category story.

What OpenClaw Actually Revealed

The specific vulnerabilities in OpenClaw, the unprotected Gateway, the localhost rate-limit bypass, the auto-approval flow, are fixable. They will be fixed. They are already being fixed.

What is not fixable is the structural reality that OpenClaw exposed: when AI agents become infrastructure, they become attack surface. And that attack surface does not look like anything your security team has a playbook for.

The Acronis threat research team's analysis of the OpenClaw ecosystem surfaced three categories of adversarial behavior that are worth sitting with:

Supply chain compromise at the skills layer. A malicious VS Code extension impersonating the legitimate OpenClaw agent installed ConnectWise ScreenConnect as a remote-access implant. Fourteen separate malicious "skills" on ClawHub, OpenClaw's extension marketplace, harvested browser data and cryptocurrency wallet credentials through obfuscated terminal commands. The attack surface was not the model. It was the ecosystem around it.

Ecosystem impersonation. Typosquat domains and cloned GitHub repositories impersonating the project established trust before deploying future payloads. Standard social engineering, applied to a new supply chain.

Control plane exploitation. Attackers largely skipped prompt injection, the attack vector that gets the most attention in AI security discussions, and went straight for the control plane. JSON-RPC payloads to an unauthenticated administrative API are faster, more reliable, and require no model-specific knowledge. When the control plane is the vulnerability, model-level defenses are irrelevant.

The threat actors involved, per reporting from Flare's research team, were not lone operators. Multiple threat groups demonstrated sophisticated understanding of the OpenClaw architecture within weeks of widespread adoption.

The Visibility Problem

Here is the specific operational problem for security teams: OpenClaw-class threats are largely invisible to existing tooling.

Your WAF is looking for malicious HTTP requests. OpenClaw's attack surface is a WebSocket endpoint, a different protocol, a different behavioral pattern, a different set of signatures that most WAF rules do not cover.

Your IDS is looking for known-bad traffic. The JSON-RPC payloads used in Gateway exploitation are structurally identical to legitimate API calls. Without behavioral context, what is the agent doing, in what sequence, across what session, there is no signal to catch.

Your SIEM is looking for anomalous volumes and human behavioral patterns. Autonomous agents do not deviate from their playbook under pressure. They do not get nervous. They do not make the timing mistakes that humans make. The behavioral baseline your detection engineering was built on does not apply.

The Bitsight analysis of exposed OpenClaw instances found that attack attempts appeared within minutes of a honeypot instance going online. The attackers knew exactly what port to look for, exactly what the Gateway protocol looked like, and exactly how to exploit a misconfigured instance. That is not opportunistic scanning. That is a playbook, running on infrastructure that is faster and more consistent than any human red team.

Your detection stack was not designed for an adversary that is faster and more consistent than any human.

The Broader Trend OpenClaw Is Pointing To

OpenClaw is not an outlier. It is a preview.

The same forces that drove 21,000 exposed OpenClaw instances in a week are driving AI agent deployment across the enterprise broadly. OpenAI Operator, Claude Computer Use, LangChain, AutoGPT, and dozens of enterprise-grade frameworks are putting autonomous agents into production at a pace that security reviews cannot keep up with. By current projections, AI agent-generated traffic on enterprise APIs will surpass human-generated traffic before the end of the decade.

When that happens, every assumption baked into your current detection architecture, that traffic is generated by humans, that behavioral anomalies reflect human decision-making under pressure, that the entity on the other end of a session is accountable to human-scale constraints, will be wrong more often than it is right.

OpenClaw is the first clear signal. The security incident that makes this a boardroom conversation is coming. The question is whether you have visibility before or after it hits your infrastructure.

What Visibility Actually Requires

The OpenClaw story illustrates something important about what AI agent security requires that traditional tooling does not provide: behavioral intelligence, not just signature matching.

Detecting an adversarial agent is not the same as detecting a known-bad IP or a malicious payload. It requires understanding the behavioral signature of the agent framework it is running on, the specific way LangChain agents construct tool calls, the characteristic timing patterns of AutoGPT sessions, the protocol fingerprint that OpenAI Operator leaves on every interaction. These are identifiable signatures, if you know what you are looking for.

More importantly, adversarial agents have a structural property that makes them uniquely vulnerable to deception: they follow instructions literally. A well-designed deception environment looks indistinguishable from a legitimate target to an agent operating on instructions. It will interact with the decoy fully and faithfully, and in doing so, will disclose everything: its model, its mission, its tools, its operator's intent.

That is the intelligence advantage that no amount of signature-matching can replicate.

What We Built

Cygient's deception intelligence platform was designed from first principles around this threat class.

We operate a global sensor network, deception servers presenting different personas: financial APIs, identity providers, credential stores, cloud infrastructure endpoints. Adversarial agents find them organically, through the same discovery mechanisms they use to find real targets. Every session generates structured intelligence: the model and framework behind the session, the system prompt artifacts that reveal the agent's mission, the specific TTPs in use, and whether the session is part of a coordinated campaign.

This is not a research exercise. The platform is running. The intelligence is being generated. And the behavioral taxonomy we have built from first-principles analysis of every major AI agent framework in production today has no equivalent in any existing threat feed.

OpenClaw was a preview of the threat landscape that is forming. We built the infrastructure to see it clearly, and to give security teams the visibility they need before the next one.

If you are a CISO or security engineering leader who wants to understand what adversarial AI agents are currently doing against your infrastructure, we want to talk to you.

[Request Early Access] → cygient.com

References

Cygient - The Defense Layer for an Autonomous Future