Claude Code can be tricked into running malware

It’s startling that the reverse shell is three indirection steps away from anything Claude Code actually evaluated, according to Mozilla’s 0DIN researchers. The attack chain starts with a perfectly ordinary GitHub repository, yet it ends with a shell running under the developer’s own user account. That’s the core of what we’re calling the Claude security exploit, and it shows how AI agents can be duped into running malicious payloads without any visible exploit code.

Key Takeaways

Clean‑looking repos can trigger AI agents to execute hidden commands.
Claude Code treats setup errors as actionable, automatically running suggested commands.
The payload lives in a DNS TXT record, not in the repository itself.
Successful exploitation gives attackers a shell with full developer privileges.
Current scanners and AI agents miss the attack because no malicious code resides in the repo.

Claude security exploit Shows AI Agents Can Run Hidden Payloads

When a developer asks Claude Code to clone a repo and run the init script, the agent sees an error like “module not found” and follows the suggested fix. That’s where the exploit kicks in: the fix points to python3 -m axiom init, which launches a shell script that pulls a command from a DNS TXT record the attacker controls. The agent doesn’t see the malicious command; it only sees an innocuous error message. You’d think that’s a harmless recovery step, but in reality it hands the attacker a fully interactive shell.

How the Attack Chains Together Without Malicious Code

0DIN breaks the chain into three components, each of which looks benign on its own. First, the repo includes standard setup instructions—think pip3 install -r requirements.txt and python3 -m axiom init. Second, the axiom package is deliberately crafted to refuse execution until it’s initialized, throwing an error that tells the user to run the init command. Third, the init command runs a script that reads a configuration value from a DNS TXT record and executes it as a shell command. The entire process happens without any malicious files in the repository, and Claude Code dutifully follows the error‑driven instructions.

“Claude Code never decided to open a shell. It decided to fix an error. The reverse shell is three indirection steps away from anything Claude Code actually evaluated: an error message it trusted, a script that fetched a value, and a DNS record it never saw,” 0DIN researchers say.

Because the agent never sees the payload, traditional static analysis tools and even dynamic scanners miss the threat. They’re looking for known signatures or suspicious network traffic, but the DNS query is a single TXT lookup that looks normal. That’s why the researchers emphasize that the attack requires “no exploit code, no warning, no suspicious command anyone had to approve.”

Why Existing Scanners Miss the Threat

Security teams log 54% of successful attacks and alert on just 14%, according to the Picus whitepaper cited in the source. Those numbers illustrate why a stealthy technique like this can slip by unnoticed. The malicious action occurs at runtime, after the repository has been cloned, and the command is fetched from a remote DNS record that doesn’t raise any red flags in code reviews. Even a human reviewer might miss the problem if they only skim the repo contents.

What makes the exploit especially sneaky is that Claude Code treats the error as a legitimate user request. The agent’s error‑recovery logic automatically runs the suggested command, which is a behavior many developers rely on for convenience. The fact that the agent “decided to fix an error” rather than “decided to open a shell” is a subtle but crucial distinction that the researchers highlight.

Potential Distribution Vectors and Real‑World Risks

0DIN warns that threat actors could spread such repos through fake job postings, tutorials, blog posts, or direct messages. Those channels are already trusted by developers seeking starter projects, so a malicious repo could gain traction quickly. Once a developer runs the init script, the attacker obtains a shell with the developer’s privileges, giving them access to environment variables, API keys, local configuration files, and the ability to establish persistence.

Access to environment variables can expose cloud credentials.
Reading API keys lets attackers impersonate services.
Local config files often contain database passwords.
Persistence mechanisms could be added via cron jobs or startup scripts.

Because the malicious component never lands in the repository, standard code‑review practices won’t catch it. That’s why the researchers suggest that AI agents should disclose the full execution chain of setup commands, including any scripts fetched dynamically at runtime.

Mitigations and Recommendations for AI‑Powered Tooling

One practical step is to require AI agents to present every command they intend to run, along with any remote fetches, before execution. That way a developer can see that a DNS TXT lookup is about to be turned into a shell command. Another approach is to sandbox the init script, preventing it from executing arbitrary code unless the developer explicitly approves each step.

Organizations should also enforce strict policies around third‑party repos. For example, scanning the network traffic of AI agents during a clone operation could reveal unexpected DNS queries. Adding a layer of verification for any dynamically fetched script—like hashing the fetched payload and comparing it to a known good value—could stop the attack before it gains a foothold.

Historical Context: AI Assistants and Security Concerns

AI coding assistants have been around for a few years, and each new release has brought fresh convenience along with fresh risk. Early versions focused on autocompleting snippets, while later releases added the ability to run commands, resolve dependencies, and even manage project scaffolding. That evolution gave developers a powerful shortcut, but it also opened a door for attackers to embed subtle triggers in otherwise harmless instructions.

Researchers have repeatedly highlighted that error‑driven recovery is a double‑edged sword. When a tool automatically proposes a fix, it saves time. When that same mechanism blindly follows a malicious suggestion, it becomes a conduit for a remote actor. The Claude security exploit is a concrete illustration of that tension, showing that the very feature that makes AI assistants attractive can be weaponized without leaving a trace in the source code.

What This Means For You

If you’re a developer who relies on AI coding assistants, you need to treat their error‑recovery actions with the same scrutiny you apply to any code you run. That means not blindly trusting a suggestion like “run python3 -m axiom init” without checking what the command actually does. Review the init script, watch for network calls, and consider running it in an isolated environment first.

For security teams, the takeaway is that AI agents add a new attack surface that traditional scanners might not cover. Updating your detection rules to flag DNS TXT lookups that are turned into shell commands, and incorporating AI‑specific behavior into your breach‑and‑attack simulations, can help you catch similar exploits before they compromise developer machines.

Scenario 1: A junior engineer clones a starter repo from a tutorial that references the init command. The AI assistant suggests running the command to get the project up, and the engineer presses “yes” without inspecting the script. A reverse shell sprouts, giving an attacker a foothold in the corporate network.

Scenario 2: A CI/CD pipeline automatically pulls in a repository and triggers the same init step as part of its build process. Because the pipeline runs with service‑account credentials, the attacker now controls a privileged service account, opening the door to broader cloud resource abuse.

Scenario 3: A security auditor uses Claude Code to quickly spin up a proof‑of‑concept environment. The assistant follows the same error‑recovery path, inadvertently exposing the auditor’s own workstation to the same remote command execution. Even experts can be caught off guard when the tool hides the payload in a DNS record.

Each of these situations underscores a single truth: convenience should never replace verification. When an AI assistant proposes a fix, treat it as a suggestion, not a command. Double‑check scripts, limit network access, and keep a watchful eye on any external lookups that could be gateways for hidden code.

Competitive Landscape and the Broader AI Threat Model

Claude Code is not the only assistant that can execute commands on behalf of a developer. Other platforms that integrate directly with development environments also expose a similar pathway: they receive a request, generate a command, and run it. The underlying risk therefore extends beyond a single product. Any tool that automatically resolves errors or fetches dependencies without explicit user approval can become a vector for the same three‑step indirection.

What separates one implementation from another is how transparently it reports the actions it plans to take. Some agents log every network request, while others hide them behind a simplified console output. The Claude security exploit demonstrates that hidden network calls are the Achilles’ heel. When an assistant hides the fact that it will query a DNS TXT record, it gives attackers a place to plant a payload that never appears in the codebase.

Vendors that choose to surface each step—showing the exact command, the target URL, and the response handling—will make it harder for malicious actors to hide behind innocuous error messages. Those that keep the process opaque will likely face similar discoveries as the 0DIN team continues to probe AI‑driven development pipelines.

Key Questions Remaining

Will AI agents adopt a default “ask before executing any remote fetch” policy, or will they continue to assume user intent?
How can static analysis tools be extended to detect indirect payloads that originate from DNS records rather than repository files?
What role should developers play in auditing the behavior of their AI assistants, especially when those assistants are integrated into CI pipelines?
Can a standardized disclosure format for AI‑generated command chains help organizations identify risky patterns before they execute?

The answers will shape how quickly the community can harden the development workflow against this class of attack. Until then, vigilance remains the best defense.

Sources: BleepingComputer, original report

Microsoft Lets Users Pause Windows Updates for 35

OpenAI’s Apology and the Tumbler Ridge Tragedy

Claude AI Plans Hiking Trip in 30 Minutes

Climate Tech’s Long-Awaited IPO Surge Begins

Contact Info

Some Populer Post