AI digest: Models making models

We’re seeing AI systems start to build themselves, plus some clever work on making models admit when they don’t know things.

Chinese model helps develop itself

MiniMax released M2.7, which reportedly played an active role in its own development. The model used autonomous optimisation loops to improve its own training. This is significant because it’s one of the first commercial examples of AI systems bootstrapping their own capabilities rather than just helping humans code.

OpenAI snaps up Python toolmaker Astral

OpenAI acquired Astral, the company behind some of the most widely used Python development tools. They’re clearly serious about dominating AI-powered coding and this gives them control over the entire Python toolchain. Smart move, considering how much AI development happens in Python.

Uncertainty-aware LLM systems get practical

Someone built a working implementation of an uncertainty-aware LLM that estimates confidence in its answers and does automatic web research when unsure. This tackles one of the biggest problems with current AI systems - they sound confident even when they’re wrong. The three-stage pipeline with self-evaluation could be genuinely useful for production systems.

Safe ML deployment strategies laid out

New guidance on four controlled deployment strategies for ML models covers A/B testing, canary releases, interleaved testing, and shadow testing. Nothing revolutionary here, but good to see proper engineering practices being documented as AI deployment becomes more mainstream.

AI digest: Models get faster, companies get desperate 11 Jun AI digest: Speech models and code tooling hit production 10 Jun AI digest: agents get serious, speed breaks records 9 Jun

Chinese model helps develop itself

OpenAI snaps up Python toolmaker Astral

Uncertainty-aware LLM systems get practical

Safe ML deployment strategies laid out

Related