miro.medium.com

Claude Plans, Kimi Builds — What Happens When AIs Split the Work

Claude Code writes the plan, Kimi CLI handles implementation — and the refactoriDev

Threads @focusrefresh — Kimi CLI refactoring 극찬

MoonshotAI/kimi-cli: Kimi Code CLI is your next CLI agent.

Getting Started — Kimi Code CLI Docs

Someone had Claude Code write the plan, then handed implementation to Kimi CLI — and the refactoring came back "art-level." They even built a script that auto-delegates implementation commands to Kimi and gets results back via status.md.

TL;DR

Claude Code creates the Plan → Kimi CLI handles Implementation → Results via status.md → delegate-opus script for reverse delegation → Multi-AI orchestration complete

What Is It?

The era of using just one AI coding tool is ending. Now it's about splitting roles between AIs and letting them collaborate.

Kimi Code CLI is an open-source terminal-based coding agent built by Moonshot AI, a Chinese AI company. It has 7,000+ GitHub stars, 1,100+ commits, and 48 contributors actively developing it. It can read and modify code, execute shell commands, search the web, and autonomously plan and coordinate tasks.

Here's the thing — Kimi CLI occupies a similar niche as Claude Code, but with different strengths. The latest Kimi K2.5 model uses a 1-trillion-parameter Mixture-of-Experts architecture (32B active) and scores 76.8% on SWE-bench Verified — close to GPT-5.2's 80%. It particularly excels at generating clean architectures for code refactoring and implementation tasks.

But what @focusrefresh discovered is even more interesting. When you combine Claude Code's reasoning ability with Kimi CLI's implementation capability, the synergy is explosive. Claude designs the architecture, and Kimi delivers implementation that "truly embodies the spirit of agentic engineering." This is multi-AI agentic engineering in action.

Under the hood, Kimi CLI is built around KimiSoul — an execution engine with fully separated agent system, tool system, and UI layer. It even has a checkpoint-based "time travel" mechanism, letting you roll back to a previous state if something goes wrong mid-task.

Key Analogy

Claude Code = Project Manager (planning, architecture decisions)
Kimi CLI = Senior Developer (clean implementation, refactoring)
delegate-opus script = Slack bot (auto-routes tasks between them)

What Changes?

No single AI can do everything well.

This follows a fundamental principle of software engineering. IBM calls it "AI agent orchestration" — to handle complex workflows, each agent should take on a specialized role. In practice, Mae Capozzi built "hub-team," a Claude Code-based multi-agent orchestrator that automates a 6-step workflow: Planning → Git Setup → Implementation → Testing → Review → PR Creation.

	Single AI	Multi-AI Combo (Claude + Kimi)
Design Quality	Limited by one model's reasoning	Claude's superior reasoning for architecture
Implementation Quality	Design and implementation mixed, context pollution	Kimi implements in a clean context
Cost	Expensive model handles everything	Expensive model for planning only, cheaper model for implementation
Speed	Sequential processing, long wait times	Parallel processing via delegation scripts
Context Management	All info accumulates in one long session	Context separated by role, quality maintained
Extensibility	Locked into one tool	Pluggable design, swap tools freely

@focusrefresh summed up this philosophy in one line: "Make every tool pluggable." From a Kimi session, you can use the delegate-opus script to reverse-delegate to Claude — "Hey, review this plan for me." It's bidirectional delegation.

Cline CLI 2.0 also ships with free Kimi K2.5 access, declaring that "coding agents are no longer assistants — they're collaborators." You can run multiple agent instances in parallel from the terminal, piping them into automation pipelines via stdin/stdout. The infrastructure for multi-AI workflows is already here.

$15/mo~

Kimi Code starting price

7,000+

GitHub Stars

76.8%

K2.5 SWE-bench score

Getting Started

Install Kimi CLI
One line in your terminal. You'll need Python 3.12–3.14 (3.13 recommended).
```
curl -LsSf https://code.kimi.com/install.sh | bash
kimi --version # verify installation
```
On first run, use /login to authenticate. Kimi Code platform OAuth is the easiest option.

Set Up Claude + Kimi Role Splitting
Start with Claude Code for planning. Use Plan mode to design the architecture and generate an implementation_plan.md. Then open Kimi CLI and hand it that plan.

# Claude Code generates the plan
claude "Create a refactoring plan for this project. Save as implementation_plan.md"

# Kimi CLI implements it
kimi "Read implementation_plan.md and implement it exactly"

Build an Auto-Delegation Script
Like @focusrefresh, you can create a script that auto-delegates implementation commands to Kimi and receives results via status.md. The core idea is simple — save Claude Code's output to a file, then batch-process it with Kimi CLI's --print mode.
```
# delegate-to-kimi.sh (example)
claude --print -c "Write the refactoring plan" > plan.md
kimi --print -c "Read plan.md and implement. Write results to status.md"
cat status.md
```
Set Up Reverse Delegation (delegate-opus)
From a Kimi session, you can also delegate back to Claude — "Please review this plan." Build a script that calls Claude from within Kimi, and you have bidirectional collaboration. Kimi CLI supports MCP (Model Context Protocol), so you can also connect other AI tools via MCP servers.
Share Project Context via AGENTS.md
Run /init in Kimi CLI and it analyzes your project structure to generate AGENTS.md. This file helps Kimi understand your project better — it's the equivalent of Claude Code's CLAUDE.md. You can enforce the same project rules across both AIs.

Heads Up

Kimi CLI requires user approval before executing shell commands by default. For automation pipelines, you'll need -y or -yolo mode — but use these carefully on production codebases.

🔗

Deep Dive Resources

Kimi CLI GitHub Repository

Installation, features, MCP setup — open source, check it yourself

Kimi Code CLI Official Docs

From installation to CLI/Web/ACP modes — the official guide

Kimi CLI Technical Deep Dive

KimiSoul engine, checkpoints, ACP protocol — architecture deep dive

Kimi K2.5 Model Repository

1T-parameter MoE model — benchmarks, deployment guide, Agent Swarm

Building a Multi-Agent AI Orchestrator

A 6-step multi-agent orchestrator built on Claude Code

What is AI Agent Orchestration? — IBM

Concepts, types, and challenges of multi-agent orchestration

FAQ

How much does combining Claude Code and Kimi CLI cost?

Claude Code runs on an Anthropic subscription (Max at $100/mo) or API pay-per-use. Kimi CLI starts at $15/mo. By splitting design work to Claude and implementation to Kimi, you can actually reduce overall costs by cutting expensive model usage. Kimi K2.5 is even available for free through Cline.

Are there other AI coding tools that pair well with Claude Code besides Kimi CLI?

Aider, Codex CLI, Cline — all of these can be combined in similar ways. The key insight isn't any specific tool but the role-splitting pattern: reasoning-heavy work goes to the stronger model, code generation goes to the specialist. That principle works with any combination.

Is it safe to run the delegate-opus script on a production codebase?

Proceed with caution. Auto-delegation scripts let AI modify files directly, so always test on a separate branch first and have a human review before merging. Kimi CLI's -yolo mode auto-approves all shell commands, which is especially risky in production environments.

How do you share context in a multi-AI setup? Do you have to explain everything from scratch each time?

File-based context sharing is the key. Claude Code reads CLAUDE.md, Kimi CLI reads AGENTS.md. Put the same project rules in both files, then use plan files (plan.md) and status files (status.md) to pass work context between tools — no need to re-explain every session.

Written by Kevin

Dissecting AI tools and workflows from a developer's lens.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Antioch — Meet the Cursor for Robot AI

Physical AI startups no longer need to rent warehouses or build million-dollar test facilities. Antioch brings software-speed development to robotics through cloud simulation — and just raised $8.5M seed to prove it.

Explore more AI workflow guides on similar topics

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

morningbrew.com

Medvi telehealth, AI startup leverage, GLP-1 startup, one-person unicorn, AI operations

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

Matthew Gallagher built Medvi, a GLP-1 telehealth startup, in 14 months with $20,000 and AI tools. 2 employees. 16.2% net margin. $401M in year one. Here's how the model works — and where it's breaking.

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

substackcdn.com

What if your code review was already done when you woke up, and your newsletter

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

What if your code review was already done when you woke up, and your newsletter sources were already organized? Here's how to automate recurring tasks with Claude Code Scheduled Task.

Next →Antioch — Meet the Cursor for Robot AI