adaptionlabs.ai

AI Is Now Training AI — How AutoScientist Beat Human Researchers by 35%

AI auto fine-tuning, AutoScientist, AI model training automation, Adaption, Sara HookerDev

Adaption aims big with AutoScientist, an AI tool that helps models train themselves

AutoScientist: Automate the Science of Model Training

Adaption's AutoScientist Automates Model Fine-Tuning With Closed-Loop Training Outperforming Human-Designed Configurations

It took AI researchers weeks to tune what AutoScientist handles in hours — and it outperformed their handcrafted configurations by 35%.

3-sec summary

Define goal → Run AutoScientist → Data + recipe co-optimization → Iterative convergence → Custom AI model ready

Wait, how does this even work?

Fine-tuning is the process of taking a general-purpose model like GPT and retraining it for a specific task — "legal document analysis" or "customer support." The idea is simple. The execution has always been brutal.

You need to decide which data to use, which data to discard, what learning rate to set, how many epochs to run, which loss functions to apply. The combinatorial search space is enormous. That's why real fine-tuning has historically required research-lab-level expertise.

AutoScientist automates the entire loop. It co-optimizes data selection and the training recipe simultaneously, running closed-loop iterations until it converges on your objective. Every existing tool either optimizes data or training config — not both at once.

35%

better than human researcher configs

48%→64%

win rate vs. expert configurations

weeks → hours

model training cycle

Sara Hooker, the CEO, was previously VP of AI Research at Cohere and spent five years at Google DeepMind. In February 2026, Adaption raised $50M from Emergence Capital, Mozilla Ventures, and Fifty Years. Co-founder Sudip Roy was Cohere's head of inference. This isn't a wrapper startup — it's the team that knows AI training best.

Why fine-tuning was so hard before

Three classic fine-tuning failure modes: (1) Catastrophic forgetting — learning new things erases existing capabilities. (2) Overfitting — perfect on training data, broken in production. (3) Conflicting signals — contradictory training data confuses the model. AutoScientist is designed to automatically detect and route around all three.

What do the numbers actually show?

Adaption's internal benchmarks: AutoScientist vs. configurations designed by their own AI researchers — across 8 verticals, dataset sizes from 5K to 100K examples, and 100B+ parameter model architectures from Together AI. Average performance uplift: 35%. Win rate: 48% to 64%.

	Traditional fine-tuning	AutoScientist
What gets optimized	Data or recipe (separately)	Data + recipe simultaneously
Time to model	Weeks (manual experimentation)	Hours (automated convergence)
Expertise required	Senior ML engineer essential	Accessible without deep ML knowledge
Data handling	Full dataset used (noise included)	High-signal auto-selection, toxic noise filtered
Failure modes	Forgetting, overfitting, signal conflicts	Automatic detection and avoidance of all three

The honest caveat: these are internal benchmarks from Adaption themselves. Standard evaluations like SWE-Bench or ARC-AGI don't apply because AutoScientist is built for task-specific adaptation, not general benchmarks. Independent verification waits for customer results after the free trial period ends in mid-June 2026.

But the core claim is significant: frontier-level model training is now possible outside of Big Labs. Hooker sees AutoScientist the way code generation unlocked new capabilities — a step toward democratizing AI training itself.

The essentials: how to start

Define your objective
What specific task do you need the model for? "Customer support email automation," "legal document summarization," "domain-specific code generation" — the more specific, the better.
Prepare your data
5,000+ examples is enough to get started. The data doesn't need to be perfectly curated — AutoScientist determines what's high-signal on its own.
Launch AutoScientist
Free 30-day trial at adaptionlabs.ai. Input your objective and let it run the co-optimization loop.
Watch convergence
The system iterates across data and recipe combinations automatically — the equivalent of hundreds of manual experiments a researcher would run.
Deploy your model
Built on Together AI infrastructure, supporting 100B+ parameter models. Ready for cloud serving once optimization completes.

No independent verification yet

All performance claims are from Adaption's internal benchmarks. External validation is still pending. The 30-day free trial is the best way to verify it on your own data before committing.

🔗

더 깊이 파고 싶다면

AutoScientist: Automate the Science of Model Training

Adaption's official blog post explaining how AutoScientist works, with performance metrics and technical details.

Adaption aims big with AutoScientist (TechCrunch)

The original interview with Sara Hooker on AutoScientist's launch, vision, and what it means for the AI training landscape.

Sara Hooker Bets $50M That Smarter Training Beats Bigger Models

Deep background on the $50M raise, Hooker's thesis on smarter training vs. bigger models, and Adaption's founding story.

AutoScientist Automates Model Fine-Tuning With Closed-Loop Training

Technical deep-dive into the closed-loop optimization mechanism and detailed benchmark analysis.

AutoScientist product page — 30-day free trial

Start the free trial and explore the platform directly.

Adaption's AutoScientist: Automating the Frontier of Model Training

Business impact analysis alongside the technical architecture of AutoScientist.

FAQ

How is this different from OpenAI's Fine-tuning API?

OpenAI's fine-tuning API takes your data and trains with it. AutoScientist figures out which data to use and how to train — optimizing both data selection and training recipes simultaneously. It's the difference between 'here's my data, do something with it' and 'here's my goal, figure out everything else.'

Can I trust the 35% performance improvement claim?

There's no independent external verification yet — this is based on Adaption's internal benchmarks comparing against their own researchers. The benchmark design may favor the home team. The best way to verify is to test it yourself during the 30-day free trial.

How much data do I need to get started?

Tests ran on datasets from 5,000 to 100,000 examples. The data doesn't need to be perfectly cleaned — AutoScientist automatically identifies high-signal data and filters out toxic noise. Starting with 5,000 domain-specific examples is enough to begin.

What's it actually best suited for?

Task-specific adaptation rather than general benchmarks. Think: medical record processing, legal document classification, domain-specific code generation, customer support automation. If you need a model that does one specific thing really well with your proprietary data, this is the use case.

What happens after the 30-day free trial?

No public pricing yet — it's enterprise sales-driven, negotiated after the trial period. Given the Together AI infrastructure backing 100B+ parameter models, expect usage-based pricing at scale.

Written by Rush

Tracking where business meets AI.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Antioch — Meet the Cursor for Robot AI

Physical AI startups no longer need to rent warehouses or build million-dollar test facilities. Antioch brings software-speed development to robotics through cloud simulation — and just raised $8.5M seed to prove it.

Explore more AI workflow guides on similar topics

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

morningbrew.com

Medvi telehealth, AI startup leverage, GLP-1 startup, one-person unicorn, AI operations

$20K and 12 AI Tools Built a $1.8B Telehealth Company — And Then the Red Flags Arrived

Matthew Gallagher built Medvi, a GLP-1 telehealth startup, in 14 months with $20,000 and AI tools. 2 employees. 16.2% net margin. $401M in year one. Here's how the model works — and where it's breaking.

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

substackcdn.com

What if your code review was already done when you woke up, and your newsletter

AI That Works While You Sleep — Automating Recurring Tasks with Claude Code Scheduled Task

What if your code review was already done when you woke up, and your newsletter sources were already organized? Here's how to automate recurring tasks with Claude Code Scheduled Task.

Next →Antioch — Meet the Cursor for Robot AI