OpenAI and Broadcom have officially revealed Jalapeño, OpenAI's first custom-designed inference processor. This accelerator is engineered specifically for the demands of large language models (LLMs) and represents a significant step in OpenAI's ambition to build out its entire infrastructure stack.
The collaboration aims to deliver a multi-generation compute platform designed to make advanced AI faster and more accessible. Early tests indicate that the first-generation Jalapeño chip will offer substantially better performance per watt compared to existing state-of-the-art accelerators. This new OpenAI Broadcom LLM chip is built from the ground up, considering the needs of current and future LLMs across the industry.