Code w/ Claude 2026 컨퍼런스 발표 현장 — Anthropic Managed Agents 공개

res.infoq.com

Half the Devs Didn't Read the Code — What Code w/ Claude 2026 Means for Software Development

Claude Code 2026, Managed Agents, Outcomes, AI autonomous coding, multi-agent orchestrationDev

Code w/ Claude 2026 Liveblog

Anthropic's Code with Claude showed off coding's future—whether you like it or not

Inside Anthropic's 2026 Developer Conference

Anthropic's Q1 2026 annualized revenue grew 80x against their original plan. SWE-bench Verified scores jumped from 62% to 87% in a single year, and API traffic is up 17x year-over-year. But the most striking data point from the May London conference? Nearly half of the developers in the room admitted they'd shipped PRs written entirely by Claude — without reading a single line of code.

Code w/ Claude 2026 wasn't a product launch event. It was a status report on something already happening at scale.

TL;DR

Conference → Managed Agents launch → Auto quality grading (Outcomes) → Async automation (Routines) → Developer = Orchestrator

What was this conference actually about?

Code w/ Claude is Anthropic's annual developer conference. In 2026 it expanded to three cities for the first time: San Francisco (May 6), London (May 19), and Tokyo (June 10). The absence of a model announcement was intentional — Anthropic's message was "the models are already capable enough. The remaining challenge is building the right structures to use them well."

Anthropic engineer Ravi Trivedi said it plainly at the event: "The key principle is getting out of Claude's way. We like to say: 'Let it cook.'" Shipping AI-written PRs without review has quietly become the default at many teams. Here's what that actually means.

80×

Q1 revenue vs. plan

87%

SWE-bench Verified (from 62%, one year)

17×

API volume YoY growth

2×

Rate limits for Pro & Max

Infrastructure got a major upgrade too. Anthropic announced a partnership to allocate all capacity from SpaceX's Colossus supercluster to Claude, and removed peak-hour throttling for Pro/Max subscribers. CEO Dario Amodei told the audience he expects "a one-person billion-dollar company to emerge in 2026".

What actually changed?

The centerpiece of Code w/ Claude 2026 was the announcement that Claude Code has become a multi-agent platform, not just a coding assistant. Three new primitives shipped.

Outcomes — AI grades its own output
You define a rubric for what success looks like. A separate grader agent scores the output and loops until it passes. It's a shift from "generate and ship" to "generate, verify, retry." Anthropic's internal benchmarks showed an 8.4% quality improvement for Word documents and 10.1% for PowerPoint slides — with no model changes at all.

Multi-agent Orchestration — a lead coordinates a team
A lead agent decomposes complex tasks and delegates pieces to specialized sub-agents running in parallel. Sub-agents share a file system, and the lead monitors mid-workflow progress. Addy Osmani maps this as a three-level pattern: Subagents → Agent Teams → Orchestration at Scale.

Dreaming — agents learn from past sessions
Between sessions, an automated process reviews previous work logs, distills patterns and mistakes into persistent memory, and preloads that memory before the next session starts. It's the feature that increased Harvey's task completion rate by 6x.

	Old Claude Code	Claude Managed Agents
Quality check	Manual human review	Outcomes auto-grades and retries
Agent count	1 (sequential)	Lead + N specialists (parallel)
Automation trigger	Manual prompt	Routines: cron, GitHub webhooks, API
Cross-session learning	Starts fresh each time	Dreaming accumulates patterns
Infrastructure	Local CLI tool	Server-managed (sandbox, checkpoints)

Claude Code itself expanded significantly across surfaces: CLI, IDE (with visual diff tracking), Desktop app (full-screen GUI, image support), and the Claude Agent SDK for external developers. CI auto-fix, Code Review, and Security Review also landed.

Routines is the real unlock

Routines auto-triggers Claude Code tasks via cron schedules, GitHub webhooks, or API endpoints. PRs get reviewed automatically. Security scans run overnight. Failed tests generate fix PRs without anyone touching a keyboard. The shift is from "developer opens Claude" to "Claude works while developer sleeps".

How to get started

Update Claude Code
Run npm install -g @anthropic-ai/claude-code. Full Managed Agents features are primarily Enterprise, but some Routines and basic orchestration are available from Pro/Max.
Define an Outcomes rubric
Pick something repetitive — code review or document generation. Write a rubric: "This PR must have no security vulnerabilities and 80%+ test coverage." Let Outcomes loop until it hits that bar.
Write an AGENTS.md file
Give agents project-specific context: conventions, forbidden patterns, common commands. Dreaming uses this file to accumulate learning across sessions.
Wire up Routines
Connect a GitHub repo webhook so Claude auto-responds to PR and commit events. Start with read-only tasks like code review or security scanning before enabling write access.
Adopt multi-agent patterns in stages
Follow Addy Osmani's three-level approach: Subagents (available now, no setup), Agent Teams (experimental, env var), Orchestration (Managed Agents scale). Start with subagents — decompose one task at a time.

🔗

더 깊이 파고 싶다면

Simon Willison — Code w/ Claude 2026 Liveblog

The primary source — a real-time writeup of the full conference.

Every.to — Inside Anthropic's 2026 Developer Conference

The sharpest analysis of what Managed Agents means for "what an AI platform is." Includes the Spiral same-day deployment case study.

MIT Technology Review — Anthropic's Code with Claude showed off coding's future

A critical take on the "skip the code review" culture — the risks and the safeguards.

Addy Osmani — The Code Agent Orchestra

The definitive guide to multi-agent coding patterns — when to use subagents, teams, and full orchestration.

InfoQ — Anthropic's Code with Claude Announces Managed Agents

Technical details and business metrics in one place.

Claude Code Agent Teams Docs

Official setup guide for multi-agent configuration and experimental features.

FAQ

How is Claude Code Agent Teams different from Managed Agents?

They operate at different layers. Claude Code Agent Teams lets you run multiple Claude Code sessions experimentally on your local machine, while Managed Agents is a server-side execution environment managed by Anthropic. Managed Agents includes production infrastructure like sandboxing, checkpointing, and credential scoping — making it better suited for teams and enterprise workflows.

What kinds of tasks work best with Outcomes?

Outcomes shines when you can clearly define success criteria upfront. Code review (no security vulnerabilities, 80%+ test coverage) and document generation (specific format compliance, required sections) are ideal. Creative tasks or anything requiring subjective judgment are harder to rubric-ize, so Outcomes has less impact there.

How is Routines different from existing CI tools like GitHub Actions?

GitHub Actions runs predefined scripts. Routines runs Claude, which means it understands context before acting. When a PR comes in, instead of just running a linter, Claude reads the code change, assesses whether it introduces security risks, and checks alignment with the existing architecture — in plain language. It's not replacing your CI; it's adding a reviewer who actually understands the code.

Can solo developers or small teams use this right now?

Some Routines and basic Outcomes are available from Pro/Max plans. The full server-managed Managed Agents platform is primarily an Enterprise offering. Solo developers can get a similar effect by combining Claude Code's Agent Teams (experimental, enable via env var) with a well-structured AGENTS.md file.

Is it actually safe to ship PRs without reading the code?

Honestly, not yet. MIT Technology Review covered this phenomenon critically — security vulnerabilities can slip through and bad patterns can accumulate. That's exactly why Anthropic launched Outcomes and Security Review alongside each other. The safe path: connect Outcomes rubrics and security scanning to your CI pipeline first, then 'let it cook'.

Written by Rush

Tracking where business meets AI.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Antioch — Meet the Cursor for Robot AI

Physical AI startups no longer need to rent warehouses or build million-dollar test facilities. Antioch brings software-speed development to robotics through cloud simulation — and just raised $8.5M seed to prove it.

Explore more AI workflow guides on similar topics

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

morningbrew.com

Medvi telehealth, AI startup leverage, GLP-1 startup, one-person unicorn, AI operations

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

Matthew Gallagher built Medvi, a GLP-1 telehealth startup, in 14 months with $20,000 and AI tools. 2 employees. 16.2% net margin. $401M in year one. Here's how the model works — and where it's breaking.

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

substackcdn.com

What if your code review was already done when you woke up, and your newsletter

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

What if your code review was already done when you woke up, and your newsletter sources were already organized? Here's how to automate recurring tasks with Claude Code Scheduled Task.

Next →Antioch — Meet the Cursor for Robot AI