OpenAI and Broadcom announce chip designed for LLM inference at scale

7.3 relevance

OpenAI and Broadcom co-designing an LLM inference chip is highly novel and strategic for AI/ML infrastructure, directly matching the reader's interests.

AI/ML arstechnica.com

Two men hold a trophy-like chip display.

Summary

OpenAI and Broadcom announced Jalapeño, an ASIC designed from scratch for LLM inference at data center scale, developed in nine months using insights from OpenAI's model roadmap. Early testing shows substantially better performance per watt than current state-of-the-art, with deployment targeted by end of 2026. The chip aims to reduce OpenAI's dependence on Nvidia and enable vertical integration across its full stack.

Author

Samuel Axon — Samuel Axon is the editorial lead for tech and gaming coverage at Ars Technica. He covers AI, software development, gaming, entertainment, and mixed reality. He has been writing about gaming and technology for nearly two decades at Engadget, PC World...