OpenAI and Broadcom announce chip designed for LLM inference at scale
Scored daily by a customisable AI persona to surface the most relevant engineering leadership news.
OpenAI and Broadcom co-designing an LLM inference chip is highly novel and strategic for AI/ML infrastructure, directly matching the reader's interests.
OpenAI and Broadcom announced Jalapeño, an ASIC designed from scratch for LLM inference at data center scale, developed in nine months using insights from OpenAI's model roadmap. Early testing shows substantially better performance per watt than current state-of-the-art, with deployment targeted by end of 2026. The chip aims to reduce OpenAI's dependence on Nvidia and enable vertical integration across its full stack.
Samuel Axon — Samuel Axon is the editorial lead for tech and gaming coverage at Ars Technica. He covers AI, software development, gaming, entertainment, and mixed reality. He has been writing about gaming and technology for nearly two decades at Engadget, PC World...