#performance
2 posts tagged performance.
Thoughts
Latency budgets are the new Moore's law
Voice interfaces are forcing us to optimise for milliseconds instead of parameters, and it's changing everything about how we build AI systems.
Reasoning phases are just expensive preprocessing with delusions of intelligence
Adding a 'thinking step' before generation is just prompt engineering disguised as architectural innovation.