Claude Code 소스 코드 유출 사건 — npm 패키징 실수로 공개된 AI 에이전트 내부 아키텍처

images.ctfassets.net

Claude Code Source Leak — What 512K Lines Reveal About AI Agent Architecture

Anthropic accidentally published Claude Code's entire source code (512,000 linesDev

Inside the leaked Claude Code files — Ben's Bites

Claude Code's source code appears to have leaked — VentureBeat

Everyone Analyzed Claude Code's Features. Nobody Analyzed Its Architecture — Medium

On March 31, 2026, something unprecedented happened in the developer community. Anthropic accidentally shipped the entire source code of Claude Code — their highest-revenue product — to the public npm registry. A single 59.8MB source map file slipped through, exposing 512,000 lines of TypeScript to every developer on earth.

Within hours, over 8,100 mirror repositories appeared on GitHub. Anthropic issued DMCA takedowns, but it was too late. The blueprint of a $2.5 billion ARR product was public knowledge.

TL;DR

npm packaging error leaks source map → 512K lines of TypeScript exposed → 3-layer memory + 7-stage agent loop confirmed → Permission system, cost optimization, hidden features analyzed → Becomes a reference architecture for AI agents

What is this?

Claude Code is Anthropic's AI coding agent that reads, edits, and executes code directly from your terminal. Most people assumed it was a thin CLI wrapper around the Claude API. The leaked source told a completely different story.

It's a sophisticated software system spanning 512,000 lines across 1,900 files. A custom React-based terminal renderer, 60+ permission-gated tools, multi-agent orchestration, and a background memory consolidation engine.

How did the leak happen?

When deploying Claude Code version 2.1.88 to npm, an internal debugging source map file (.map) was accidentally included in the package. Likely caused by a missing .npmignore rule or misconfigured files field in package.json. Boris, the Claude Code lead, confirmed it was "a developer error, not a Bun bug or a hack."

The timing made it even more surreal — happening one day before April Fools', many initially dismissed it as a prank. To make things worse, the axios npm package was independently compromised by a North Korea-linked threat actor on the very same day.

What's different?

The gap between the "just an API wrapper" assumption and the actual architecture is the story. The leaked code proved that the real competitive advantage in AI coding agents isn't the model — it's the harness.

Area	Typical AI CLI Tool	Claude Code's Actual Implementation
Core Loop	while + await model.chat()	Async generator state machine with 7 branch points
Memory	Store everything or simple truncation	3 layers: Index (MEMORY.md) + Topic files + Raw search
Context Mgmt	Delete old messages	5 compaction strategies (Snip, Microcompact, Autocompact, etc.)
Tool Loading	Send all tool schemas every time	ToolSearch for on-demand dynamic loading
Permissions	Allow all or block all	Default-deny + denial tracking + graceful fallback
UI Rendering	console.log based	React + Ink + Yoga layout (React for terminals)
Cost Optimization	No specific strategy	Prompt cache tracking across 14 state fields, schema stabilization

Let's break it down.

1. Agent Loop: 7-Stage State Machine

The core logic lives in query.ts. Exactly 1,729 lines, with an async generator function called queryLoop wrapping a while(true) loop.

Most agent frameworks follow a simple "send prompt → get response → run tool → repeat" cycle. Claude Code's generator pattern maintains state across 7 explicit branch points (continue sites), enabling session pause/resume, serialization, and mid-turn error recovery.

2. Self-Healing Memory: "Skeptical Recall"

VentureBeat called this the "context entropy solution."

MEMORY.md — A lightweight index always loaded into context. ~150 characters per line, storing pointers only
Topic files — Actual project knowledge, fetched on demand
Raw search — Transcripts are never fully reloaded; specific identifiers are grep'd

"Strict Write Discipline" is the standout: the agent only updates its index after a successful file write, preventing failed attempts from polluting memory. The agent is instructed to treat its own memory as a "hint" — always verifying against the actual codebase before acting.

3. Five Context Compaction Strategies

When long sessions fill the context window, Claude Code combines five strategies:

Snip — Quick pruning of older messages. Speed-first.

Microcompact — Targets tool outputs only. A 5,000-line file read gets saved to disk; the model sees a summary.

Context Collapse — Progressive compression of older conversation segments. (Still behind a feature flag)

Autocompact — Full-conversation summarization at configurable token thresholds.

Reactive Compact — Emergency brake when the API returns a 413 (payload too large).

4. The "46-Line Philosophy" of Permission Design

Every tool must declare isReadOnly and isDestructive. Both default to false — default-deny.

The most fascinating piece is denialTracking.ts, a 46-line file:

Consecutive denials → exits auto-mode

Total denials → asks before every action

Lines of code implementing this principle

When users keep saying "no," the system disables auto-mode and checks before each step. Most agents either keep retrying or hard-stop. Claude Code gracefully degrades: "If you're uncomfortable, I'll ask before every step."

5. Hidden Features

The leaked code also revealed unreleased features:

KAIROS (Autonomous Daemon) — Named after the Greek word for "the right time." An autoDream process consolidates memory during idle time, removing contradictions and converting vague observations into confirmed facts.
/buddy (Tamagotchi Pet) — An ASCII companion living in your terminal. 18 species, gacha rates (1% Legendary), RPG stats (DEBUGGING, PATIENCE, CHAOS, WISDOM, SNARK). Was meant to debut on April Fools' Day but leaked one day early.
Undercover Mode — For stealth contributions to public open-source repos. Strips all AI traces from commits.
Proactive Mode — AI operates autonomously without prompts.
Frustration Detection — Keyword patterns detect user's negative emotions for internal analytics.

6. Anti-Competitive Countermeasures: Anti-Distillation & DRM

The leaked code revealed more than features. Anthropic had built technical countermeasures specifically designed to block competitors from copying Claude Code.

Anti-Distillation — API requests inject fake tool definitions so competitors recording Claude Code traffic get poisoned training data. Reasoning chains between tool calls are reduced to cryptographically signed summaries, never exposed in full.
DRM-level Client Attestation — Below the JavaScript layer, Bun's native Zig HTTP stack injects computed hashes into every API request, cryptographically proving requests originate from a genuine Claude Code binary. Deliberately placed below the JS layer to prevent monkey-patching.

7. claw-code: When AI Clones AI

The biggest community response was claw-code, a project by Korean developer Sigrid Jin who used OpenAI's Codex to clean-room rewrite Claude Code's architecture in Python. No direct code copying — only patterns and architecture were referenced.

75,000+

GitHub stars (50K in just 2 hours)

75,000+

GitHub forks

The legal question is fascinating: Does an AI-generated clean-room rewrite in a different language constitute copyright infringement? Traditionally, clean-room reimplementation required two separate teams and months of work. Now AI does it overnight, and there's no legal precedent. Gergely Orosz (The Pragmatic Engineer) noted that even if Anthropic sues, winning an IP battle over AI-generated transformative works remains unclear.

Key Takeaways: What to Learn From This

Here's what builders, developers, and product managers can take away from this leak.

The moat in AI products is the harness, not the model
Claude Code proved it. Even with the same Claude model, how you design the agent loop, memory, permissions, and cost optimization determines product quality. If you're building AI products, invest more in "how to wrap the model" than "which model to use."
Don't underestimate context management
Even Anthropic has 5 strategies with 2 still experimental. Context management in long-running agent sessions remains an unsolved problem. Especially tool outputs (file reads, etc.) eating half your context window — study the Microcompact pattern.
Deferred tool loading is table stakes for 20+ tools
Claude Code, OpenAI Agents SDK, and CrewAI independently arrived at the same pattern. Sending all tool schemas every call wastes tokens. Use a meta-tool like ToolSearch for on-demand injection.
Design "graceful degradation" into your permission system
The 46-line principle: when users are uncomfortable, become more cautious. Trust matters more than features in AI products. Track repeated denials and smoothly transition from automatic to manual mode.
Consider why the leak isn't fatal
Star History put it well: "A codebase snapshot is a photograph, not the photographer." The real moat is the team building it and the pace of continuous delivery. The fact that Anthropic's stock and user base barely wavered after the leak proves the point.

🔍

Go deeper

Inside the leaked Claude Code files — Ben's Bites

First coverage of the leak. Hidden features list, community reaction, and the irony of April Fools' timing.

Claude Code's source code leaked — VentureBeat

Deep technical analysis of the 3-layer memory, KAIROS daemon, and Undercover Mode. Includes Anthropic's official statement.

What Claude Code's Leaked Source Teaches — Kubesimplify

Code-level verification of the agent loop, 5 compaction strategies, and permission system.

65 Open Source Packages Behind Claude Code — Star History

Full inventory of 65 open-source dependencies. Query lifecycle diagram and /buddy Easter egg deep dive.

The Great Claude Code Leak of 2026 — DEV.to

Accident or PR stunt? Dissects the npm packaging failure, DMCA response, and the claw-code clean-room reimplementation.

Claude Code Leaked via npm Packaging Error — The Hacker News

Security-focused analysis. Supply chain risks, typosquatting threats, and the concurrent axios compromise.

FAQ

Was user data exposed in the Claude Code leak?

No. Anthropic confirmed no customer data or credentials were involved. The leak exposed the product's source code (architecture), not user conversations, API keys, or sensitive information.

Should I stop using Claude Code because of this leak?

No need to stop, but if you installed via npm between March 31 00:21-03:29 UTC, check your lockfiles — the axios package was independently compromised during that window. Switch to the official native installer and update to version 2.1.89+.

Can competitors clone Claude Code from the leaked source?

Not easily. As Star History noted, 'a codebase snapshot is a photograph, not the photographer.' Clean-room reimplementations like claw-code have appeared, but the design patterns (memory, loops, permissions) have become shared industry knowledge rather than giving any single competitor a decisive edge.

Written by Rush

Tracking where business meets AI.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Antioch — Meet the Cursor for Robot AI

Physical AI startups no longer need to rent warehouses or build million-dollar test facilities. Antioch brings software-speed development to robotics through cloud simulation — and just raised $8.5M seed to prove it.

Explore more AI workflow guides on similar topics

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

morningbrew.com

Medvi telehealth, AI startup leverage, GLP-1 startup, one-person unicorn, AI operations

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

Matthew Gallagher built Medvi, a GLP-1 telehealth startup, in 14 months with $20,000 and AI tools. 2 employees. 16.2% net margin. $401M in year one. Here's how the model works — and where it's breaking.

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

substackcdn.com

What if your code review was already done when you woke up, and your newsletter

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

What if your code review was already done when you woke up, and your newsletter sources were already organized? Here's how to automate recurring tasks with Claude Code Scheduled Task.

Next →Antioch — Meet the Cursor for Robot AI