AI Chips
Not investment advice. This page is for informational and educational purposes only. Sterling is not a registered investment advisor. Nothing here is a recommendation to buy, sell, or hold any security. Markets carry risk and you may lose money. Do your own research and consult a licensed professional before making any investment, securities purchase, or trading decision.

Thesis

Why this sector matters to investors right now. Structural, not market timing.

AI infrastructure has become the single largest capex line item in the global economy. Microsoft, Amazon, Alphabet, and Meta plan a combined 725 billion dollars of capex in 2026, up 77 percent from a record 410 billion in 2025, with roughly 75 percent (~450 billion) directed at AI infrastructure (Tom's Hardware citing analyst figures; CNBC February 6, 2026; Statista). NVIDIA Q4 fiscal 2026 (ending January 25, 2026) reported 68.1 billion in revenue, with data center revenue alone of 62.3 billion (up 75 percent year over year) and full-year fiscal 2026 data center revenue of 197.3 billion versus 115.2 billion the prior year (NVIDIA Q4 FY26 release). All four hyperscalers report being supply-constrained, not demand-constrained.

The investible structure splits into five layers: the merchant GPU duopoly (NVIDIA dominant, AMD ramping), the foundry oligopoly (TSMC dominant, Samsung struggling at advanced nodes, Intel Foundry re-emerging), the HBM memory triopoly (SK Hynix leading, Micron rising, Samsung lagging), the lithography monopoly (ASML), and the custom ASIC design layer (Broadcom plus Marvell) that builds the hyperscalers' internal alternatives to NVIDIA. Custom silicon shipments are projected to grow 45 percent in 2026 versus 16 percent for merchant GPUs (Broadcom and Marvell coverage via heygotrade and trefis), meaning the ASIC layer is taking share inside the larger pie. The compute supercycle is the structural story. The competitive question is which layer captures the value as the spend keeps expanding.

Structural drivers

Forces that shape long-run demand and economics. Each driver is sourced.
  • Hyperscaler capex is unprecedented and rising. 725 billion dollars combined 2026 capex across Microsoft (190 billion), Amazon (200 billion), Alphabet (190 billion), Meta (145 billion-plus), up 77 percent year over year (Tom's Hardware November 2025; CNBC February 6, 2026; Statista). All four report supply-constrained, not demand-constrained.
  • NVIDIA data center revenue confirms the demand. Q4 FY26 data center revenue of 62.3 billion, up 75 percent year over year, with networking up 263 percent year over year. Full-year FY26 data center 197.3 billion versus 115.2 billion prior year. Source: NVIDIA Q4 FY26 press release, February 2026.
  • Memory is the next-tightest bottleneck. The global HBM market is forecast to grow from 38 billion in 2025 to 58 billion in 2026 (Introl Blog). Microsoft attributed roughly 25 billion of its 190 billion 2026 capex specifically to rising memory and component costs (CNBC February 6, 2026).
  • TSMC N2 (2 nanometer) entered high-volume manufacturing in Q4 FY25 with capacity ramping toward 100,000 wafers per month by end of 2026 and 200,000 per month in 2027. Q4 FY25 advanced nodes (7 nanometer and below) were 77 percent of wafer revenue. Source: TSMC Q4 2025 earnings transcript and TrendForce.
  • ASML EUV shipments expand. 60-plus units in 2026 (versus 48 in 2025), including 10 High-NA EUV scanners at ~380 million each, with Intel targeting 14A node deployment in 2027. Source: ASML Q4 FY25 results and Q3 2025 Press Release; TrendForce July 2025.
  • Custom AI ASICs are growing faster than merchant GPUs. Projected 45 percent growth in 2026 versus 16 percent for GPU shipments. Broadcom AI revenue ~12 billion in FY25 with Q3 25 AI revenue 5.2 billion (up 63 percent year over year). Marvell data center revenue 6.1 billion in FY ending February 2026. Source: heygotrade; Broadcom Q3 FY25 earnings.
  • Compute platform consolidation reduces customer optionality at the silicon layer. Anthropic alone has committed to up to 1 million TPUs via Google Cloud and is also a major Trainium2 customer at AWS (CNBC November 21, 2025). Hyperscaler workloads are becoming the primary buyers of advanced silicon.

Structural risks

Forces that could compress demand, change economics, or break the thesis.
  • Hyperscaler capex digestion risk. 725 billion of 2026 capex implies revenue scaling that has to materialize. Fortune (April 30, 2026) and CNBC (February 6, 2026) reported the cash impact is starting to show in free cash flow profiles. A multi-quarter air pocket in AI infrastructure spending would compress every layer simultaneously.
  • China export controls remain volatile. US restrictions on the H20 were tightened in 2025 (requiring licenses) and then partly relaxed (licenses granted). Huawei has plans to double Ascend output in 2026, with US Commerce estimating 200,000 Chinese AI chips produced in 2025 versus Huawei sources at 300,000 to 400,000 (CFR; Council on Foreign Relations; SemiAnalysis). National security policy can swing demand on short notice.
  • TSMC concentration and Taiwan exposure. TSMC sits at the structural choke point of advanced logic. Any disruption (cross-strait, infrastructure, equipment-export licensing) would cascade through the entire stack. Geopolitical risk is unhedged in the current valuation of the sector's compute layer.
  • HBM supply is concentrated and qualification-dependent. SK Hynix held 57 to 62 percent HBM market share through 2025; Samsung fell from 41 percent in Q2 2024 to 17 percent in Q2 2025 as it struggled to pass NVIDIA's HBM3E qualification, partly recovering to 22 percent in Q3 2025. Single-supplier exposure to NVIDIA's roadmap is real (Astute Group; Counterpoint Research).
  • Memory cost inflation is the hidden line item. Microsoft's 2026 capex outlook explicitly attributed 25 billion to memory and component costs. If HBM and DRAM prices keep rising, even at constant unit demand, hyperscaler returns compress (CNBC February 6, 2026).
  • Custom silicon erodes merchant GPU TAM. Broadcom holds roughly 70 percent of the custom AI accelerator design market (Google TPU, OpenAI's custom chip, Meta MTIA). Marvell ASIC revenue projected at 9 to 11 billion in 2026 (AWS Trainium, Microsoft Maia, Meta DPU). AWS has deployed 500,000-plus Trainium2 chips; Google's Ironwood (TPU v7) shipped November 2025. If hyperscalers internalize more inference and training, NVIDIA growth slows even if total compute keeps rising.
  • Samsung Foundry has not won meaningful leading-edge AI silicon customers, leaving TSMC as a single source for most advanced AI chips. Intel Foundry has Microsoft Maia 3 on 18A (Tom's Hardware) and possible discussions with Google and Apple, but the timeline is multi-year.
  • ASML China revenue concentration is unwinding. China fell to 19 percent of ASML system sales in Q1 2026 from 36 percent in Q4 2025 (ASML Q1 2026 results). ASML still guides total 2026 sales of 36 to 40 billion, flat to modestly higher than 2025, supported by non-China EUV demand. A faster-than-expected China step-down would change the equipment-cycle thesis.

Competitive landscape

How to think about the players. Framing along axes (pure play vs diversified, incumbent vs challenger, etc). Not stock picking.

The investible sector sorts into six layers, each with different economics.

1. Merchant accelerators. NVIDIA is the runaway leader: Q4 FY26 data center 62.3 billion (+75 percent), Blackwell in volume production since Q3 FY26, Rubin platform upcoming. AMD Instinct revenue is ramping (MI350 launched 2025; MI400 in 2026 with 432 GB HBM4); AMD targets data center 60-plus percent annual growth and tens of billions in AI revenue by FY27. AMD is projected to take 12 to 15 percent of AI accelerator share by Q4 2026 (Tech Insider).

2. Custom AI silicon designers. Broadcom is the dominant design partner (Google TPU, OpenAI custom chip, Meta MTIA), with ~70 percent of custom ASIC design and a guided 60 to 90 billion serviceable AI opportunity from three hyperscalers by FY27. Marvell is the second-source design partner (AWS Trainium, Microsoft Maia, Meta DPU, Google Axion ARM CPU). Custom silicon volume growing 45 percent in 2026 (heygotrade; The Next Platform).

3. Hyperscaler internal silicon. Google TPU v7 'Ironwood' (November 2025) is the most mature internal stack; Anthropic up to 1 million TPU commitment. AWS Trainium2 has 500,000-plus chips deployed. Microsoft Maia is still internal-only; Maia 3 will tape out at Intel Foundry on 18A. Meta MTIA continues with Broadcom. OpenAI is building custom silicon with Broadcom. These programs reduce hyperscaler dependence on NVIDIA but require multi-year ramp.

4. Foundries. TSMC dominates leading-edge logic: N2 in HVM Q4 FY25, N2 capacity ramping to 100,000 wafers per month by end 2026, A16 in 2H 2026. Samsung Foundry lags at advanced nodes for AI silicon. Intel Foundry 18A wins (Microsoft Maia 3 'Griffin'); Google and Apple are in discussion. SMIC is stuck at 7 nanometer (Council on Foreign Relations).

5. HBM and memory. SK Hynix leads (57 to 62 percent HBM share through 2025), reached HBM4 development completion. Micron gained share (21 percent Q2 2025) shipping HBM3E and sampling HBM4 at 11 Gbps. Samsung struggled with HBM3E qualification, lost share from 41 percent (Q2 2024) to 17 percent (Q2 2025), partly recovered to 22 percent (Q3 2025). UBS projects SK Hynix at ~70 percent share of HBM4 for NVIDIA's Rubin platform.

6. Equipment. ASML holds the EUV lithography monopoly: 60-plus EUV scanners planned 2026, 10 High-NA EUV at 380 million each, Intel 14A node a key customer for High-NA. Applied Materials, Lam Research, KLA, and Tokyo Electron supply etch, deposition, and process control.

7. China alternatives (not investable for most international capital). Huawei Ascend 910B and 910C produce more than the H20 on raw compute and energy but lag on HBM bandwidth. SMIC is the captive logic supplier. Stockpiled foreign HBM remains a constraint (SemiAnalysis; CFR).

Cross-cutting framing: every layer is currently benefiting from the same demand wave. The differentiated bets are within layers (who in HBM, which custom-ASIC designer, which foundry node). The macro question of when (not whether) hyperscaler capex normalizes is the single biggest cross-layer risk.

Key metrics to watch

The operational and financial metrics that matter most in this sector. Each one names its source and update cadence.
MetricSourceFrequencyWhy it matters
Hyperscaler combined capex (Microsoft, Amazon, Alphabet, Meta)Quarterly 10-Q filings and earnings calls; consolidated by S&P Global, Statista, CreditSights, FuturumQuarterlyThe single most important demand-side metric for the entire sector. 725 billion 2026 capex is the baseline; any guide-down across the four would compress every layer.
NVIDIA Data Center revenue and Compute versus Networking mixNVIDIA quarterly press releases and 10-QQuarterlyDefines the top line of merchant-GPU economics. Q4 FY26 data center 62.3 billion plus networking 11 billion is the current bar. Mix toward networking is the early signal of system-level (not just chip-level) buying.
AMD Data Center segment revenue and Instinct mixAMD quarterly earnings (10-Q and CFO commentary)QuarterlyCleanest read on whether AMD is taking share inside the GPU duopoly. AMD's Q4 FY25 Data Center revenue was 5.4 billion (+39 percent); MI350 versus MI400 commentary indicates the trajectory.
TSMC monthly revenue and advanced-node wafer mixTSMC monthly revenue release and quarterly earningsMonthly revenue, quarterly node mixLeading indicator for the entire foundry-driven supply chain. Q4 FY25 advanced nodes were 77 percent of wafer revenue (N3 at 28 percent, N5 at 35 percent). N2 ramp pace through 2026 is the central foundry-cycle metric.
HBM market share and shipment volume by supplierCounterpoint Research, TrendForce, supplier earnings (SK Hynix, Micron, Samsung)QuarterlySingle biggest near-term constraint on AI chip output. SK Hynix versus Samsung qualification status with NVIDIA, plus Micron share gains, are the variables to watch.
ASML EUV bookings and High-NA EUV deliveriesASML quarterly press releases and capital-markets-day disclosuresQuarterlyLeading indicator for foundry investment intent 12 to 18 months out. 60-plus EUV planned 2026 versus 48 in 2025 is the baseline; bookings (not just shipments) signal forward demand.
Custom AI ASIC revenue (Broadcom, Marvell)Broadcom and Marvell quarterly press releases and CFO commentary on AI-specific revenueQuarterlyDirect measure of hyperscaler in-house silicon traction. If custom-ASIC revenue grows faster than NVIDIA's data center revenue, the structural shift away from merchant GPU is accelerating.
China AI chip production volume and US export-control changesUS Department of Commerce announcements; CFR, SemiAnalysis, Rhodium Group reportingContinuous (event-driven), with quarterly synthesisGeopolitical wildcard. Each policy turn changes which suppliers can sell into which markets and at what license cost. Huawei's stated plan to double Ascend output in 2026 is one of several signals.

Catalysts and milestones

Known upcoming events that could move the sector. Dated where possible.
  • NVIDIA Rubin platform launch (post-Blackwell). Uses HBM4 from SK Hynix (projected ~70 percent share). Volume timing through 2026. Source: NVIDIA roadmap; UBS via Astute Group.
  • AMD MI400 series ramp through 2026 with 432 GB HBM4. AMD targets 20-billion-dollar data center GPU revenue annual run rate over two years (Digitimes). Source: AMD investor day November 2025.
  • TSMC N2 monthly capacity reaching ~100,000 wafers by end of 2026 and A16 volume production in 2H 2026. Source: TSMC Q4 2025 earnings; TrendForce April 27, 2026.
  • ASML High-NA EUV deliveries (10 in 2026). Intel using High-NA for 14A in 2027. Source: ASML Q3 2025 release; TrendForce July 17, 2025.
  • Hyperscaler 2027 capex guidance announcements through 2026 earnings cycle. Confirmation or moderation of the 725 billion 2026 base. Source: Microsoft, Amazon, Alphabet, Meta quarterly earnings.
  • SK Hynix HBM4 mass production once qualification completes. Samsung HBM3E and HBM4 qualification with NVIDIA. Source: SK Hynix newsroom; Samsung investor releases.
  • US export-control updates on China AI chips. H20 license posture, allowed memory configurations, and HBM export rules. Source: US Department of Commerce; Rhodium Group analysis.
  • Intel Foundry 18A external customer announcements. Microsoft Maia 3 'Griffin' is confirmed; Google and Apple in discussion. Source: Tom's Hardware November 2025; Semicon Electronics.
  • EU Chips Act and US CHIPS Act award disbursements and milestone-based payouts through 2026. Influences where new capacity lands.
  • Anthropic, OpenAI, and other AI lab compute commitments. Anthropic's stated up-to-1-million-TPU commitment is a single contract that affects Google TPU economics.

What would change the view

Conditions or evidence that would invalidate the thesis or materially shift the risk picture.
  • Hyperscaler capex pauses or contracts. If Microsoft, Amazon, Alphabet, or Meta materially guides down 2027 capex versus 2026, the supercycle thesis weakens immediately and every layer reprices.
  • An AI compute breakthrough reduces training and inference cost-per-token by an order of magnitude beyond expected algorithmic gains. Could shift demand from training to inference, change accelerator mix, and pressure NVIDIA pricing power.
  • Custom ASIC share crosses 50 percent of accelerator TAM. If Broadcom plus Marvell plus internal hyperscaler programs collectively exceed half the dollar value of AI accelerators shipped, the merchant-GPU thesis structurally weakens. Current custom-versus-GPU split is well below 50 percent.
  • China achieves sub-7 nanometer logic at scale domestically. Would remove a binding constraint on Huawei Ascend and weaken US leverage. Watch SMIC public disclosures and Huawei chip teardowns.
  • A geopolitical event affecting Taiwan. The single biggest unhedged risk. Watch shipping insurance rates, strait-transit volumes, and US-China policy posture.
  • HBM oversupply emerges. If Samsung passes qualification and Micron accelerates HBM4 sampling and shipments, HBM ASPs could compress materially, helping accelerator margins but hurting memory-supplier margins.
  • NVIDIA loses a major hyperscaler design slot. Watch for any major hyperscaler announcement that internal silicon replaces a previously-NVIDIA workload at scale.
  • ASML loses meaningful 2026 or 2027 EUV bookings. The leading-indicator metric for the equipment cycle. Sustained book-to-bill below 1 would signal an inflection.

What we are not covering

Sub-areas, technologies, or companies we are deliberately excluding from the analysis, and why.
  • PC and consumer CPUs (Intel client, AMD Ryzen client). Different demand cycle, different competitive dynamics. Covered in adjacent broader semiconductors coverage if and when added.
  • Smartphone and mobile SoCs (Qualcomm Snapdragon, MediaTek Dimensity, Apple A-series). Different end market with handset-cycle exposure.
  • Standalone networking silicon below the data-center fabric layer (Cisco, Arista merchant silicon, NIC chips beyond DPUs). The DPU layer (Broadcom, Marvell, NVIDIA networking) is in scope; lower-layer Ethernet silicon is not.
  • Power semiconductors (silicon carbide and gallium nitride for power conversion). Important for AI data centers but separate market with different drivers.
  • Analog, mixed-signal, and microcontroller suppliers (Texas Instruments, Analog Devices, Microchip, NXP, STMicro). Industrial and automotive end-market dynamics not relevant to AI compute.
  • Display drivers and image sensors. Different end markets.
  • Wafer-level material commodities and consumables (silicon wafers, photoresist, specialty gases). Adjacent but operates as commodity-cycle businesses.
  • Quantum and neuromorphic computing. Adjacent technology with much smaller current commercial footprint. Covered in separate Quantum sector when launched.

Sources

Primary sources cited in this analysis. Links open in a new tab.

Audit trail

Record of the last review and what changed. Required on every refresh.
Last reviewed: 2026-05-14
Change log
  • 2026-05-14Initial publication. All eight required SOP components populated using sourced 2025 full-year and 2026 year-to-date data. Primary sources: NVIDIA Q4 FY26 press release (February 2026), TSMC Q4 2025 earnings transcript, ASML Q3 2025 press release and Q1 2026 results, AMD Q4 FY25 earnings, Counterpoint Research and TrendForce HBM share data, Broadcom and Marvell quarterly disclosures, CNBC and Tom's Hardware coverage of hyperscaler capex, NVIDIA newsroom on DRIVE/Drive Thor, Mobileye and other adjacent OEM disclosures, Council on Foreign Relations and SemiAnalysis coverage of China and Huawei, Rhodium Group on export-controls. All sources accessed 2026-05-14.
Unresolved questions
  • Confirm whether hyperscaler 2027 capex guidance materially exceeds, matches, or steps down from the 2026 725-billion-dollar baseline as Q3 and Q4 2026 earnings disclose forward views.
  • Samsung HBM4 qualification timeline with NVIDIA. If Samsung passes qualification, share-recovery thesis activates. If it fails, SK Hynix dominance hardens.
  • Intel Foundry 18A yield and customer ramp. Microsoft Maia 3 is the first major external workload; Google and Apple are in discussion. Each conversion (or non-conversion) materially repositions Intel Foundry.
  • TSMC N2 customer mix beyond Apple. AMD, NVIDIA, and the hyperscaler custom ASICs are the demand engines for advanced foundry capacity; allocation between them determines who can ship at peak performance through 2027.
  • China AI chip output reality in 2026. US Commerce projected 200,000 in 2025 versus Huawei sources at 300,000 to 400,000. Actual production is opaque. The number affects whether US export controls are working.
  • Cash flow versus accrual capex. Hyperscalers report capex on different bases (announcements, accrual, cash). Watch free cash flow and debt issuance to confirm announced capex is actually deployed.

Ask Sterling

Register for a premium account to gain access to Sterling AI.

Get Started

Things you can ask Sterling:

Summarize Tesla's latest earnings reportWhy did NVIDIA's margins expand?Compare Apple vs Microsoft's cash flowWhat's driving the EV sector growth?
AI Chips - Strategy | Sterling