In the rapidly evolving world of AI-powered coding, the emergence of exceptionally fast models like Codex Spark presents both opportunities and challenges for developers. Sarah Chieng, Head of Developer Experience at Cerebras, delivered a compelling talk titled "Fast Models Need Slow Developers" at AI Engineer Europe, highlighting how these advancements necessitate a fundamental shift in how developers interact with AI coding assistants.
Related startups
The Speed Revolution in AI Coding
Chieng introduced Codex Spark, a model capable of generating code at an astonishing 1,200 tokens per second, a significant leap from the 40-60 tokens per second seen in models like the Sonnet family or GPT-4. This dramatic increase in speed, she explained, is a result of optimizations across the entire AI inference stack, including hardware advancements and novel model architectures.
However, this speed also exposes the limitations of existing developer habits. "A lot of these bad habits that we had before that we're generating maybe 50 tokens per second of bad code," Chieng stated, "Unless we fix them, they're going to start generating 1,200 tokens per second of bad code." The core message is that simply having faster models isn't enough; developers must also adapt their approach to harness this power effectively.
Rethinking Developer Workflows for Speed
Chieng outlined several key strategies for developers to navigate this new era:
