1 post tagged developer-tooling.
Everyone's chasing 50% speedups with sparse kernels and custom CUDA code, but we're building a tower of optimisation hacks that breaks every time someone changes the model.