| Infrastructure layer | Thesis | Names to watch |
|---|---|---|
| Networking — east-west fabric | Agentic shifts traffic inside the DC. Backend NIC + switch silicon scales with port count, not GPU FLOPS. | ANET (Tomahawk/Jericho merchant silicon), AVGO, MRVL (custom-ASIC tailwind) |
| Networking — "scale-across" (DC-to-DC) | Brand-new category NVIDIA just named. Long-haul + metro fiber moves from optional to mandatory once GPU clusters exceed single-site power. | CIEN (TD Cowen "No DC Is An Island"), DY (JPM fiber-build read), GLW as fiber substrate |
| Networking — optical in-rack | 400G → 800G → 1.6T transceivers; co-packaged optics is the credible next step when copper runs out of room. | COHR, LITE, FN; Celestial AI (private — optical interconnect for new memory tier) |
| KV-cache offload tier (CMX) | Brand-new line item. Bluefield-4 + DRAM + NAND SSDs holding "warm context" between HBM and bulk storage. Didn't exist nine months ago. | NVIDIA Bluefield; MU / SK Hynix / Samsung (HBM + GDDR7 + NAND); Solidigm, PSTG for SSD layer |
| CPU pull-through | Per-token economics are GPU-led; per-cluster capex is not. Agents pull CPU for orchestration, RAG, KV-spill management. | AMD (server CPU), INTC (Xeon 6+); Arm-custom angle via MRVL (AWS Graviton silicon), ARM licensing |
| Disaggregated prefill/decode | Rubin CPX puts GDDR7 onto AI-server bills alongside HBM4 — first time commodity memory enters the inference TAM. | Same memory names — but GDDR7 is a new line, incremental volume for MU / SK Hynix not in prior DC mix |
| Voice/agent application stack | Voice agents reroute call audio to 3 new endpoints per turn (STT/LLM/TTS) on sub-200 ms latency budget. | ElevenLabs, Deepgram (private); TWLO ConversationRelay as orchestration layer |
| Vector DB + indexing | RAG and codebase indexing are persistent new infrastructure layers — Cursor indexes every customer codebase. | Pinecone, Weaviate, Qdrant (all private); MDB Atlas Vector as public adjacency |
| Agent security | OWASP LLM06 ("Excessive Agency") is a real category; agent-driven egress + supply-chain risk through MCP servers. | PANW, ZS, CRWD; Protect AI, Lasso private-side |
| Model labs themselves | If SemiAnalysis is right, the labs are the highest-margin link — not the wrappers (Cursor/Devin run negative GM today). | Anthropic, OpenAI secondaries; wrapper exposure is the wrong end of the bar-bell |
The two bull cases stack rather than cancel:
Capex pool is 1.5–1.9× consensus → networking, fiber, CPU, KV-tier memory all under-counted in published TAMs.
Model-lab margins keep widening → value capture concentrates with Anthropic / OpenAI; wrapper/app layer is a trap.