Positron AI, founded by industry veterans Thomas Sohmers and Mitesh Agrawal, is challenging the conventional wisdom of AI hardware design, asserting that memory bandwidth, not raw compute power, has become the primary bottleneck for large language model inference. This critical insight, detailed by Sohmers and Agrawal on the Latent Space podcast, underscores their unique architectural approach to accelerating AI. Their innovation promises to deliver significantly more efficient and cost-effective solutions for the burgeoning AI landscape.
Thomas Sohmers, Co-founder and CTO, brought a rich background in semiconductor startups, having taped out his first chip at 19 and later serving as Principal Hardware Architect at Lambda Labs. Mitesh Agrawal, CEO, spent nearly a decade at Lambda Labs, driving growth from inception to a half-billion-dollar annual run rate, focusing on cloud operations and data center infrastructure. Their combined experience revealed a fundamental disconnect between prevailing hardware design and the actual demands of modern AI models.
