AI digest: infrastructure wars and safety roadblocks

This week’s mix of practical tools and existential handwringing shows AI companies walking the line between shipping fast and staying safe.

NVIDIA’s AITune solves the deployment headache

NVIDIA released AITune, an open-source toolkit that automatically picks the fastest inference backend for any PyTorch model. No more manual fiddling with TensorRT configurations or wondering which optimisation to use. This is exactly the kind of boring infrastructure work that actually matters when you’re trying to ship models at scale.

Anthropic keeps its best model locked away

Anthropic built Claude Mythos Preview, which found thousands of vulnerabilities across major operating systems and browsers, then decided not to release it publicly. Instead, they’re quietly handing it to tech companies through Project Glasswing. Smart move or overly cautious? Probably smart, given how quickly these capabilities are advancing.

OpenAI pitches infrastructure as its moat

OpenAI is telling investors that its early infrastructure buildout gives it a decisive advantage over Anthropic. Meanwhile, Anthropic just signed a multi-year deal with Coreweave for cloud services. The infrastructure wars are heating up, but betting against Anthropic’s ability to scale seems premature.

Alibaba tackles visual RAG complexity

Alibaba’s Tongyi Lab dropped VimRAG, a multimodal RAG framework that uses memory graphs to handle massive visual contexts. Visual data breaks traditional RAG approaches because it’s token-heavy and semantically sparse. This could be the breakthrough multimodal applications need to actually work reliably.

AI digest: Agents get serious 24 Apr AI digest: Agents go mainstream 23 Apr AI digest: Tools get serious 22 Apr

NVIDIA’s AITune solves the deployment headache

Anthropic keeps its best model locked away

OpenAI pitches infrastructure as its moat

Alibaba tackles visual RAG complexity

Related