16 Million Stolen Conversations and a Pentagon That Lost the Plot

Anthropic disclosed that Chinese AI labs ran industrial-scale distillation attacks extracting capabilities from Claude via 16 million API exchanges. Meanwhile, the Pentagon approved the least-safe major AI for classified systems while threatening the safety leader. This week exposed how broken our incentive structures really are.

Safe AI AcademyFebruary 24, 202611 min read165 views

I have been building compliance automation for years now, and if there is one thing I have learned, it is that the threats you worry about most are rarely the ones that actually hit you. You plan for the phishing campaign, the misconfigured bucket, the insider threat. You do not plan for someone sending 24,000 fake customers through your front door to reverse-engineer your entire kitchen.

But that is exactly what happened this week. And honestly, it is the kind of thing that makes me sit back and rethink what belongs in our control libraries.

The Distillation Problem: They Did Not Steal the Recipe Book. They Rebuilt the Kitchen.

Anthropic disclosed that three Chinese AI companies, DeepSeek, Moonshot AI, and MiniMax, orchestrated industrial-scale model distillation attacks against Claude. We are talking about 16 million carefully crafted exchanges through approximately 24,000 fraudulent accounts. MiniMax alone drove over 13 million conversations. Moonshot specifically targeted agentic reasoning and tool use, running 3.4 million exchanges. OpenAI and Google reported similar campaigns targeting their models.

Let me put it this way. You run a restaurant. Someone sends 24,000 people through your doors over several months, each ordering slightly different dishes, taking careful notes on every ingredient, every technique, every plating decision. They are not stealing your recipe book. They are reverse-engineering your entire operation through observation at scale. By the time you notice the pattern, they have enough data to open a competing restaurant across the street.

Stay Updated

Get notified when we publish new articles and course announcements.

16 Million Stolen Conversations and a Pentagon That Lost the Plot

The Distillation Problem: They Did Not Steal the Recipe Book. They Rebuilt the Kitchen.

Stay Updated

The Pentagon Paradox: When the Incentives Are Completely Backwards

Your Developer's IDE Is Now an Attack Surface

New Jailbreak Techniques That Actually Scared Me

The Market Told Us Something Important

Where Do We Go from Here?

Sources and References

Related Articles

The Footnote Was the Headline

The Workspace Is the New Perimeter: Three Supply-Chain Waves and the Week Your CLAUDE.md Became a Payload

Negative Time: Defender Window Officially Closed

Comments

Leave a comment