AI news, digests, and updates from the SOFT CAT pipeline.
OpenAI ships GPT-5.5 for autonomous work, Google fixes distributed training, and agents start learning from their mistakes.
Big tech companies are finally shipping agent platforms while new open source models challenge the frontier labs.
OpenAI and Hugging Face ship proper developer tools while the industry moves beyond chat demos.
Open-source agent swarms challenge closed models while governments get special AI access for cyber defence.
Cross-datacenter LLM serving, quantum AI models, and the real cost of model upgrades.
Major model releases from Anthropic and xAI, while research shows AI dependency might be happening faster than we thought.
OpenAI sheds side projects whilst Anthropic expands into design, plus major funding moves and enterprise tooling updates.
OpenAI and Anthropic battle for coding supremacy while enterprises throw serious money at AI infrastructure.
Major players ship agent tools while Google doubles down on multimodal AI.
Google turns Chrome into an AI workflow hub whilst Anthropic's Claude Mythos demonstrates concerning autonomous hacking abilities.
Compute shortages squeeze the AI industry while enterprises take careful steps towards agent adoption.
MiniMax opens up its platform with CLI tools and self-evolving models, while Meta explores neural computers that merge computation and memory.
Small models get smarter, researchers tackle memory bottlenecks, and AI security moves from theory to practice.
NVIDIA drops an auto-tuning toolkit, Anthropic sits on a vulnerability-finding model, and OpenAI plays the infrastructure card against competitors.
Meta releases its first closed-source frontier model while OpenAI halves Pro pricing and Anthropic limits cybersecurity capabilities.
Major breakthroughs in AI agent infrastructure, Google's automated research writing, and Z.AI's open-weight coding model.
Anthropic debuts security-focused Mythos model, Google ships offline dictation, and the usual chip wars heat up.
Compact vision encoders rival giants, AI writes GPU kernels, and OpenAI sketches out the post-work economy.
AutoAgent lets AI systems engineer themselves overnight, Netflix releases video object removal tech, and Microsoft admits Copilot is just for entertainment.
Netflix open-sources video editing AI, Anthropic discovers emotions in Claude, and subscription pricing hits reality.
Anthropic makes waves with a $400M biotech acquisition, new PAC, and desktop control features whilst OpenAI shuffles executives.
Google releases Gemma 4 under Apache 2.0, Microsoft launches three new models, and the battle for local AI heats up.
IBM targets document processing, Hugging Face standardises post-training, and Anthropic accidentally leaks source code across GitHub.
OpenAI raises massive funding, Google cuts video costs, small models punch above their weight, and Anthropic has another bad week.
Salesforce cracks voice latency while Americans lose faith in AI outputs despite using the tools more.
Major releases in agent infrastructure this week, plus OpenAI quietly kills Sora after six months.
Google clarifies AI crawling boundaries, lightweight agent frameworks mature, and Mistral enters the voice generation race.
NVIDIA builds proper RL infrastructure for agents while everyone else scrambles to make AI systems that actually work in practice.
Google and others push hard on real-time voice AI while practical deployment stories emerge.
Google's TurboQuant compresses AI memory by 6x whilst new web agents navigate with just screenshots.
Major advances in model efficiency clash with infrastructure growing pains as the AI boom hits practical limits.
LeCun tackles world model collapse, Meta builds self-improving agents, and Luma's new image model thinks before it generates.
New tools tackle AI agent fragmentation whilst big tech consolidates around custom chips.
Chinese models bootstrap themselves, OpenAI buys Python tools, and uncertainty-aware systems get real.
NVIDIA drops a compact MoE model, OpenAI consolidates its scattered product line, and Trump's AI framework hands Big Tech exactly what it wanted.
AI agents are escaping controlled environments with new tools for real-world integration and a Meta security incident shows the risks.
Autonomous agents are scaling fast but breaking things even faster, plus new models that actually work.
Big moves in enterprise AI tooling as companies shift from experimentation to production deployment.
Mistral launches unified multimodal model, Nvidia pushes trillion-dollar chip projections, and legal battles mount over AI training data.
Major infrastructure releases from IBM, LangChain, and Moonshot AI alongside enterprise governance tools and ByteDance's copyright troubles.
New coding workflows, open-source speech breakthroughs, and China's push for AI-powered one-person companies.
Big tech models hit roadblocks whilst AI agents push into professional research and real-world applications.
New agent protocols emerge while model makers face performance setbacks and market shifts.
NVIDIA drops a massive open-source model, Google unifies all media types in one embedding space, and AI agents are hacking corporate systems.
Meta buys an AI social network, Amazon blocks Perplexity's shopping bot, and everyone's building meta-agents that build other agents.
ByteDance drops a proper agent orchestrator, Anthropic squares off with the Pentagon, and everyone's suddenly worried about AI security testing.
Karpathy releases autonomous ML research tools whilst AI models tackle real production challenges.
OpenAI faces internal rebellion over Pentagon deals while researchers push forward with new AI architectures and enterprise applications scale rapidly.
OpenAI launches security agents, Anthropic fights Pentagon designation, and Microsoft ships compact multimodal reasoning.
OpenAI drops GPT-5.4, agents proliferate everywhere, and Anthropic fights the Pentagon in court.
Big tech rivalries heat up while new models and tools push capabilities forward.
Major model releases from Google and OpenAI, plus breakthrough memory systems for robots and new AI development tools.
OpenAI's Pentagon deal sparks user exodus while Alibaba ships tiny models and Cursor prints money.
OpenAI's Pentagon deal causes drama, Google speeds up LLM retrieval by 948x, and researchers show AI can bust anonymous accounts in minutes.
OpenAI signs Pentagon deals while Anthropic fights back, plus new research shows even frontier models decay in long conversations.
Microsoft tackles multi-horizon tasks while Nous fixes AI forgetfulness, plus Google speeds up image generation.
New architectures challenge transformer dominance while Chinese labs face theft accusations and AI agents start handling real work.
Chinese labs accused of data theft, OpenAI declares war on benchmarks, and voice bots spread lies with confidence.
Taalas ditches GPUs for hardwired AI chips, Pentagon summons Anthropic CEO, and ByteDance cracks the reasoning code.
The Model Context Protocol lets you plug tools into Claude like USB devices. Tested a few.
Ran both models through the same set of coding and reasoning tasks. Results were closer than expected.
Wired up Claude with tool calling to build a basic autonomous agent. Simpler than expected.
Moonshot AI's Kimi k1.5 quietly dropped and it's genuinely impressive on long reasoning tasks.