Saturday, June 27, 2026
EN·DarkSubscribe
AI Infrastructure · News & Analysis
Commentary · trigger: OpenAI和Broadcom推出Jalapeño定制推理芯片,旨在将AI推理成本削减一半。

Broadcom and OpenAI's Jalapeño Chip Tests the Limits of Custom Silicon Speed and Scale

A nine-month development sprint to first silicon samples validates Broadcom's ASIC model and OpenAI's push for supply independence, but unproven production performance and mounting margin pressure leave the harder questions open.

The joint announcement of Jalapeño—a custom AI inference ASIC developed by OpenAI and Broadcom—represents one of the more substantive developments in the current AI infrastructure buildout, even accounting for the considerable hype that accompanies any chip launch in this cycle. Unveiled this week, Jalapeño is a full-reticle ASIC designed from the ground up for large language model inference. OpenAI and Broadcom claim it delivers roughly 50% lower inference cost than current GPU-based alternatives and is targeted at gigawatt-scale data center deployments before the end of 2026. Those claims are OpenAI's own characterizations rather than independently verified benchmarks, and production-scale silicon remains prospective; but the architectural intent is unambiguous: purpose-built silicon for inference economics at a scale that commodity GPUs were never designed to optimize.

What has drawn the most attention is the development timeline. Multiple reports citing both companies indicate Jalapeño reached silicon samples within nine months—a pace that compresses what has historically been a two-to-three-year development cycle for a complex ASIC. OpenAI says it used its own AI models to accelerate the design and verification process. If that claim is accurate at production scale rather than just in internal testing, it would represent a meaningful proof point for AI-assisted chip design as a genuine throughput multiplier, not merely an aspirational talking point. Broadcom's role as the design and manufacturing partner is entirely consistent with its established positioning: over the past several years, the company has quietly built what many industry observers consider the most capable custom ASIC delivery operation serving hyperscalers, providing the silicon engineering depth that most frontier AI labs lack internally.

Broadcom's arrival at this point was neither accidental nor swift. The company redomiciled from Singapore to the United States in 2018 after a CFIUS review blocked its proposed takeover of Qualcomm, a move that established its American corporate identity at the precise moment AI infrastructure spending began its long acceleration. The $18.9 billion acquisition of CA Technologies that same year, followed by the landmark $61 billion purchase of VMware completed in 2023, expanded Broadcom into enterprise software while its semiconductor business concentrated on connectivity and bespoke chips for hyperscalers. The results of that focus are now visible in the financials: Broadcom's fiscal year 2025 revenue reached $63.9 billion, and its AI-related semiconductor revenue—primarily custom chips and AI networking ASICs—is reportedly on pace to reach $16 billion in a single quarter, which would represent roughly a tripling from prior run rates. The company has also guided toward more than $100 billion in AI semiconductor revenue by 2028, a figure that implies continued broadening of the custom-silicon customer base well beyond its existing hyperscaler relationships. Its Jericho3-AI, a 5nm Ethernet switch targeting AI cluster interconnect, illustrates how Broadcom is pursuing this build-out simultaneously across compute ASICs and the networking fabric that binds large-scale GPU and ASIC pods together.

The market's reaction has provided a useful counterweight to uncritical enthusiasm. Despite the revenue acceleration, AVGO shares fell as much as 17–23% in the period surrounding recent earnings, with analysts attributing the selloff primarily to margin compression concerns rather than any doubt about top-line trajectory. Custom silicon engagements are engineering-intensive and often involve aggressive pricing to win flagship hyperscaler accounts, a combination that can compress returns even as revenues surge. Broadcom's own capital expenditure data reflects the investment intensity: SEC filings show the company spent $481 million in cumulative CapEx through the first two quarters of fiscal 2026 ending May 2026, compared to $244 million over the same period in fiscal 2025—a near-doubling year-on-year. A simultaneous packaging technology partnership announced with Applied Materials signals that Broadcom is also investing in the supply chain infrastructure required to support next-generation chip density, a necessary step but one that carries its own cost. Whether gross margins can expand as AI revenue scales, or whether sustained hyperscaler pricing pressure erodes them, remains the central financial question the company has yet to answer convincingly.

The Jalapeño chip also crystallizes a strategic realignment underway at OpenAI that mirrors moves already made by its peers. After years of near-total reliance on Nvidia's GPU ecosystem, OpenAI is hedging its supply risk through custom silicon—a path Google pioneered with TPUs, Amazon followed with Trainium and Inferentia, and Microsoft pursued with Maia. Choosing Broadcom as a design partner rather than building entirely in-house reflects both the complexity of delivering competitive ASIC silicon and OpenAI's apparent preference for speed-to-market over full vertical integration. Several sources describe Jalapeño as rivaling Nvidia's Blackwell and Google's TPU on LLM inference efficiency, though these are again OpenAI's own characterizations. Nvidia retains dominant share of AI training compute and significant inference share; displacing that position requires not just competitive silicon but the software ecosystem—CUDA libraries, runtime tooling, developer familiarity—that Nvidia has spent more than a decade building and that no single chip announcement can replicate.

Three concrete signals will determine how this partnership matures over the next 18 months. First, whether Jalapeño delivers on its claimed 50% inference cost reduction at genuine gigawatt-scale production by late 2026—the difference between a well-engineered prototype and a validated platform that reshapes procurement decisions. Second, Broadcom's gross margin trajectory over the next two to three fiscal quarters, which will reveal whether AI custom silicon is genuinely accretive at scale or whether the cost of winning landmark accounts like OpenAI is being absorbed quietly in margin dilution. Third, the depth of OpenAI's long-term commitment to Broadcom versus its stated intent to diversify across multiple semiconductor suppliers—a distinction that matters enormously for Broadcom's revenue visibility beyond the initial program. Jalapeño gives Broadcom a marquee validation and OpenAI a credible Nvidia hedge, but both parties have more to prove before this architecture becomes the infrastructure reference point the announcement implies.

Based on 78 archived reports · Broadcom
Broadcom and OpenAI's Jalapeño Chip Tests the Limits of Custom Silicon Speed and Scale · Slicast