Thursday, June 25, 2026
EN·DarkSubscribe
AI Infrastructure · News & Analysis
Commentary · trigger: OpenAI与博通推出Jalapeño自定义AI推理芯片,历时九个月开发,目标在2026年底前在

Broadcom and OpenAI's Jalapeño Chip Tests Whether Custom Silicon Can Redefine AI's Cost Economics

The Jalapeño announcement confirms Broadcom as OpenAI's primary silicon partner and marks a significant step in the industry's effort to move LLM inference off Nvidia's GPU platform, even as margin pressures and trade uncertainty cloud the company's near-term financial picture.

OpenAI and Broadcom on June 24, 2026 unveiled Jalapeño, a custom large-language-model inference processor that the two companies say can reduce LLM inference costs by roughly 50% relative to current GPU-based approaches. The chip, reportedly developed in nine months with OpenAI's own models accelerating the design and validation cycle, is targeted for gigawatt-scale deployment before the end of 2026. It is OpenAI's first internally designed silicon product — a milestone that reflects how far the AI lab has moved from being solely a software and model company. For Broadcom, the announcement is the latest and most concrete evidence of a partnership that was first reported in October 2025 as encompassing a 10-gigawatt commitment for custom AI chips and networking, and that had reportedly carried a headline deal value of $10 billion when its outlines were first disclosed in September 2025.

Broadcom's AI revenue trajectory provides the financial context for understanding why the company is a credible partner for such an undertaking. The company's AI-related semiconductor revenue reportedly surged 143% in its most recent reported period, and analysts tracking the June 2026 earnings cycle noted the figure was running on a pace to approach or exceed $16 billion in a single quarter — against a total fiscal 2025 revenue base of $63.9 billion across all segments. Broadcom has set a public target of more than $100 billion in AI semiconductor revenue by 2028, a figure that implicitly assumes demand for custom ASIC designs and purpose-built networking will remain structural rather than cyclical. That thesis has been reinforced by the company's move into Ethernet-based AI fabrics — a segment where it is co-developing technology with partners such as FuriosaAI — and by the June 2026 launch of its AI XPV platform, a broad infrastructure stack backed by Apollo and Blackstone that integrates custom networking, storage, and interconnect capabilities.

The foundation for Broadcom's current position was laid across several years of strategic partnership-building. The company's custom silicon practice grew partly through early collaboration with Google's TPU program, which established a design template for co-developed inference silicon at hyperscale. In April 2026, Broadcom announced multi-year supply agreements with both Google and Anthropic extending through 2031 — deals described at announcement as landmark commitments that effectively lock those companies into Broadcom's ASIC pipeline for the medium term. The September 2025 OpenAI partnership, now given a face with Jalapeño, brings a third major AI frontier lab into that customer set. For OpenAI, the move mirrors strategies already underway at Amazon with Trainium and Inferentia, at Google with the TPU series, and at Meta with its MTIA program: develop purpose-built inference silicon to reduce unit cost at scale and reduce structural dependence on Nvidia's pricing.

The financial picture carries a nuance that equity markets surfaced sharply in recent weeks. Despite triple-digit growth in AI revenue, Broadcom's stock fell — with reports citing declines of between 17% and 23% around its most recent earnings — as the prevailing concern shifted to margin compression in the custom silicon and AI networking supply chain. Capital expenditure data from SEC filings tells part of that story: cumulative capex in the first half of fiscal 2026 reached $481 million, nearly double the $244 million spent in the equivalent period of fiscal 2025, and already equivalent to 77% of the $623 million spent across all of fiscal 2025. A June 2026 SEC 8-K filing disclosed the pricing and completion of a cash tender offer for certain debt securities, indicating active balance sheet management running in parallel with the revenue expansion. The cost dynamics of serving gigawatt-class AI infrastructure do not invalidate the revenue growth narrative, but they complicate the translation from top-line momentum to bottom-line expansion.

Structural risks shadow the opportunity on multiple fronts. Trade policy remains unresolved: Broadcom's shares slid in May 2026 after a Trump-Xi summit concluded without an agreement on chip-related export restrictions, a concrete reminder that the company's supply chain — anchored in TSMC's advanced process nodes — is exposed to geopolitical disruption. On the competitive dimension, Nvidia CEO Jensen Huang has publicly argued that general-purpose GPU platforms remain irreplaceable because they offer flexibility across training, inference, and the emerging requirements of agentic workloads — a flexibility that custom ASICs sacrifice in exchange for cost efficiency on well-defined, stable inference tasks. That trade-off introduces model-architecture risk: if LLM designs evolve at a pace faster than a chip's useful production life, a nine-month design cycle can lose economic value more quickly than a general-purpose platform would. Applied Materials and Broadcom announced a joint packaging technology partnership in June 2026, which may partially address supply chain complexity, but the structural exposure to both regulatory and architectural change remains.

Three signals merit close attention over the coming quarters. First, margin trajectory: Broadcom's next two quarterly earnings reports will reveal whether the company can sustain AI revenue momentum toward its 2028 target while stabilizing or improving gross margins — the gap between stated ambition and current margin reality is the central financial question hanging over the AVGO thesis. Second, deployment confirmation for Jalapeño: a public announcement of volume production at OpenAI's gigawatt-scale facilities before the end of 2026 would validate both the chip design and the supply chain execution across Broadcom, TSMC, and packaging partners including Applied Materials. Third, trade and regulatory developments: any tightening of semiconductor export controls limiting access to TSMC's advanced nodes or to advanced packaging capacity would force a reassessment of the multi-year customer commitments — including the 2031 deals with Google and Anthropic — that currently underpin the revenue model. Broadcom has built a strategically important and increasingly validated position as the hyperscaler-friendly alternative to Nvidia's integrated stack; how durable that position proves depends as much on the regulatory and geopolitical environment as on the silicon itself.

Based on 49 archived reports · Broadcom
Broadcom and OpenAI's Jalapeño Chip Tests Whether Custom Silicon Can Redefine AI's Cost Economics · Slicast