#model-optimisation
2 posts tagged model-optimisation.
Thoughts
Model compilers just turned optimisation into a black art nobody understands
Everyone's chasing 50% speedups with sparse kernels and custom CUDA code, but we're building a tower of optimisation hacks that breaks every time someone changes the model.
Scale killed the LLM star
The race for bigger models is over, and efficiency just won.