Tools & Experiments

Google Auto-Diagnose

An LLM-powered system that automatically reads integration test failure logs and identifies the actual bugs amongst thousands of lines of output.

Anyone who’s spent hours hunting through integration test logs knows the pain. Sixteen different log files, thousands of lines each, and somewhere in there is the one line that explains why your build failed at 3am.

Google’s Auto-Diagnose tackles this exact problem. It’s an LLM system that reads through all those failure logs automatically and pinpoints the actual issue. No more grep-ing through endless stack traces or trying to remember which service logs matter for which failure mode.

The tool works by analysing the full context of test failures, not just error messages. It understands how different services interact and can trace problems back to their root cause across multiple log sources.

We’d love to see this kind of intelligent log analysis become standard tooling. The cognitive overhead of debugging distributed systems is enormous, and LLMs are particularly good at pattern matching across large amounts of unstructured text. This feels like exactly the sort of practical AI application that could save developers serious time.

No interactive tool for this one yet. Browse all tools for more.