Elusive Thoughts: The Claude Code Leak

What Happened

On March 31, 2026, Anthropic shipped Claude Code version 2.1.88 to npm. Bundled inside was a 59.8MB .map source map file — a debugging artifact that reconstructs original source code from minified production builds. This single file exposed 512,000 lines of unobfuscated TypeScript across roughly 1,900 files. The entire agent harness architecture of what is arguably the most sophisticated AI coding tool on the market was now public.

This was not a sophisticated attack. No zero-day. No insider threat. A missing .npmignore entry, a known Bun bug (#28001 filed on March 11 and still open at the time of the leak), and nobody on the release team catching it. Bun generates source maps by default and serves them in production mode even when documentation says it shouldn't. Anthropic acquired Bun in late 2025. The irony writes itself.

⚠ Critical Detail

A nearly identical source map leak occurred with an earlier Claude Code version in February 2025. Same mechanism, same packaging oversight. The same class of vulnerability, unpatched, for over a year.

Within minutes, researcher Chaofan Shou posted the download link. Sixteen million views. Anthropic yanked the npm package, but the internet had already archived everything. Decentralized mirrors appeared on Gitlawb. Over 8,100 repositories were hit with DMCA takedowns within hours — but the code was permanently in the wild.

The Timeline

~04:23 ET

Chaofan Shou posts the source map download link on X. Instant virality.

Hours later

Anthropic pulls npm package, begins DMCA takedowns. 8,100+ repos disabled.

~04:00 KST

Korean developer Sigrid Jin wakes up, ports the core architecture to Python using OpenAI's Codex, and pushes claw-code before sunrise.

+2 hours

claw-code hits 50,000 GitHub stars. Fastest repo in GitHub history to reach that milestone.

+24 hours

100,000+ stars. Rust rewrite branch started. Multiple "unlocked" forks appear stripping telemetry and guardrails.

What Was Exposed

This leak did not expose model weights. It exposed the orchestration layer — the harness that makes Claude's models useful for real work. And that is arguably more valuable from a competitive intelligence standpoint.

Architecture Highlights

19 permission-gated tools, each independently sandboxed. A three-layer memory system with persistent files, self-verification against actual code, and idle-time consolidation (internally called autoDream). 44 unreleased feature flags covering functionality nobody outside Anthropic knew existed. Six MCP transport types. A 46,000-line query engine. React + Ink terminal rendering using game-engine techniques.

The Easter Eggs

KAIROS — an unreleased autonomous agent mode. A persistent, always-running background daemon that stores memory logs and performs nightly "dreaming" to consolidate knowledge. Buddy — a Tamagotchi-style companion with 18 species, rarity tiers, RPG stats including debugging, patience, chaos, and wisdom. 187 hardcoded spinner verbs including "hullaballooing" and "razzmatazzing." A frustration detection regex matching swear words. And a swear word filter for randomly generated 4-character IDs.

Undercover Mode

This is the one that made Hacker News collectively lose it. Buried in the code was an entire subsystem called Undercover Mode, designed to prevent Claude from revealing Anthropic's involvement when contributing to open-source repositories. No AI Co-Authored-By lines. No mentions of Claude or Anthropic in commit messages. The system prompt literally instructs the agent to write commit messages "as a human developer would." The question this raises for the open source community is significant: if a tool is willing to conceal its own identity in commits, what else is it willing to conceal?

AppSec Takeaway

Internal model codenames were exposed: Capybara maps to Claude 4.6, Fennec to Opus 4.6, and Numbat to an unreleased model. Internal benchmarks revealed Capybara v8 has a 29-30% false claims rate — a regression from 16.7% in v4. A bug fix comment revealed 250,000 wasted API calls per day from autocompact failures. This is the kind of competitive intelligence leak that no amount of DMCA notices can undo.

· · · · · · ·

The Clean-Room Rewrite: One Dev, One Night, AI Tools

This is where it gets legally and philosophically interesting.

Sigrid Jin — a developer previously profiled by the Wall Street Journal for single-handedly consuming 25 billion Claude Code tokens — did not just mirror the leaked code. He used OpenAI's Codex (a competitor's AI) to rewrite the entire core architecture from TypeScript to Python. No copied code. A clean-room implementation inspired by the leaked architectural patterns.

The result, claw-code, crossed 100K GitHub stars in 24 hours. It now has more stars than Anthropic's own Claude Code repository. A Rust rewrite is underway.

The legal theory: a clean-room AI rewrite constitutes a new creative work. It cannot be touched by DMCA because no proprietary code was copied. The architecture was understood, and then reimplemented independently. Traditionally, clean-room reverse engineering requires two separate teams — one to analyze and create specifications, one to implement from those specifications alone. It takes months and costs real money.

Now one developer with an AI agent did it overnight.

The Copyright Paradox

Here is where things collapse into a legal black hole.

1. AI-Generated Code May Not Be Copyrightable

On March 2, 2026, the U.S. Supreme Court denied certiorari in Thaler v. Perlmutter, letting stand the DC Circuit's ruling that AI-generated works without human authorship cannot receive copyright protection. The Copyright Office's position is clear: copyright attaches only where a human has determined sufficient expressive elements. Mere prompting is not enough.

Anthropic's own CEO has implied significant portions of Claude Code were written by Claude itself. If that is true, then portions of the leaked codebase may not even be copyrightable under current U.S. law. The DMCA takedowns are asserting copyright over code that the law might say nobody owns.

2. The Clean-Room Rewrite Is Legally Novel

Clean-room reverse engineering has been upheld by courts for decades — Sega v. Accolade, Sony v. Connectix. The principle is well-established. But those cases involved human engineers spending weeks or months creating independent implementations from functional specifications. What happens when an AI agent does this in hours? The legal precedent was built on the assumption that clean-room reimplementation is expensive and slow. That assumption is now dead.

3. Anthropic's Double Bind

This is the paradox that should keep every AI company's legal team awake. If Anthropic argues that the Python clean-room rewrite infringes their copyright, they are implicitly arguing that AI-generated code can be substantially similar enough to constitute infringement — which would undermine AI companies' own defenses in training data copyright cases. The entire AI industry's legal strategy depends on outputs being "transformative" rather than derivative. You cannot simultaneously claim your AI-generated code is protected by copyright and that your AI's training on copyrighted code is fair use because the outputs are transformative.

As one commentator put it: you cannot protect what the law says does not exist.

The Uncomfortable Question

If AI-generated code cannot be copyrighted, and if AI can rewrite any proprietary codebase overnight into a different language while preserving the architecture — what exactly is left of software IP protection? Trade secrets only work if you keep the secret. Source maps in npm packages don't qualify.

Security Implications: The Real Damage

From an AppSec perspective, the copyright drama is secondary. The security implications are what matter.

Attack surface exposure. 512K lines of code means 512K lines of code to audit for vulnerabilities. Every permission boundary, every OAuth flow, every tool-gating mechanism is now available for adversarial analysis. Threat actors do not need to black-box fuzz Claude Code anymore. They have the blueprint.

Trojanized forks. Within hours of the leak, threat actors were seeding trojanized repositories on GitHub — clones of the leaked code with embedded backdoors, targeting developers eager to run their own Claude Code instances. This is a supply chain attack vector that will persist for months.

Anti-distillation mechanisms exposed. The code revealed that Claude Code injects decoy tool definitions into system prompts to pollute any training data captured from API traffic. A separate cryptographic client attestation system, built in Zig below the JavaScript layer, verifies that requests come from genuine Claude Code binaries. Now that these mechanisms are public, adversaries can specifically engineer around them.

The "unlocked" forks. Multiple repositories appeared within 24 hours claiming to have stripped all telemetry, removed guardrails, unlocked all experimental features, and enabled use with competitor models. These are effectively jailbroken versions of a powerful coding agent. The risk of these being weaponized is non-trivial.

The root cause is embarrassing. This was a CI/CD pipeline failure. A .npmignore entry. A known bug that sat unpatched for 20 days in a runtime Anthropic itself owns. This is the kind of basic operational security failure that would get flagged in any competent SDL review. And it happened to the company building one of the most advanced AI systems on the planet.

What This Means Going Forward

Anthropic's response was telling. Within hours of the leak, they emailed all subscribers announcing that third-party harnesses now require pay-as-you-go billing instead of subscription access. When technical enforcement fails, you shift to billing enforcement. The moat moved from harness to model.

But the broader implications extend well beyond one company's bad day:

Source maps are an underestimated attack surface. Every engineering team shipping JavaScript or TypeScript to public registries needs to audit their build pipeline for source map leakage. If Anthropic — with their resources and security-conscious culture — can ship a 60MB source map to npm, anyone can.

AI-powered reverse engineering changes the economics of IP protection. Clean-room reimplementation used to be a meaningful barrier precisely because it was expensive and slow. When an AI agent can port 500K lines of TypeScript to Python overnight, the cost of reverse engineering drops to approximately the price of a Claude Max subscription. Every proprietary codebase is now one leak away from an open-source equivalent.

Copyright law is not ready for this. The legal framework was built for a world where code is written by humans, copying is binary (you either copied or you didn't), and clean-room reimplementation takes months. None of those assumptions hold anymore. We are in uncharted legal territory, and the courts are years behind the technology.

· · · · · · ·

Final Thoughts

The Claude Code leak is not, in isolation, the most technically dangerous security incident of 2026. It landed in a month that also saw the Axios npm supply chain compromise, the Mercor AI breach, OpenAI Codex command injection via branch names, and GitHub Copilot injecting promotional ads into pull requests as hidden HTML comments.

But it might be the most strategically significant. Not because of what was exposed, but because of what happened next: one developer, one night, one AI tool, and the complete reimplementation of a proprietary codebase that a company valued enough to issue 8,100 DMCA takedowns to protect.

The question is no longer whether your source code can be leaked. It is whether it matters if it is — because the next version of your competitor might already be writing itself.

Elusive Thoughts

06/04/2026

The Claude Code Leak

The Claude Code Leak:
When .npmignore Breaks Your IP Strategy

What Happened

The Timeline

What Was Exposed

Architecture Highlights

The Easter Eggs

Undercover Mode

The Clean-Room Rewrite: One Dev, One Night, AI Tools

The Copyright Paradox

1. AI-Generated Code May Not Be Copyrightable

2. The Clean-Room Rewrite Is Legally Novel

3. Anthropic's Double Bind

Security Implications: The Real Damage

What This Means Going Forward

Final Thoughts

Viruses With Wings and Brains: The Worm You Cannot Patch

New tool repo

My Other Blogs

06/04/2026

The Claude Code Leak

The Claude Code Leak:When .npmignore Breaks Your IP Strategy

What Happened

The Timeline

What Was Exposed

Architecture Highlights

The Easter Eggs

Undercover Mode

The Clean-Room Rewrite: One Dev, One Night, AI Tools

The Copyright Paradox

1. AI-Generated Code May Not Be Copyrightable

2. The Clean-Room Rewrite Is Legally Novel

3. Anthropic's Double Bind

Security Implications: The Real Damage

What This Means Going Forward

Final Thoughts

Viruses With Wings and Brains: The Worm You Cannot Patch

The Claude Code Leak:
When .npmignore Breaks Your IP Strategy