industry analysis

On CPO Pt 4 | Today's Major CPO Products

A look at Nvidia (Quantum-X / Spectrum-X) and Broadcom (Humboldt / Bailly / Davisson) production CPO product lineups, Nvidia's generation-by-generation adoption timeline, and a comparison of the two vendors' entry paths.

On CPO

8. Today's Major CPO Products

The vendors actually driving the production CPO roadmap are essentially two — Nvidia and Broadcom.
Both ultimately converge on the TSMC COUPE platform, but their paths to get there and their architectural choices differ sharply.

(1) Nvidia (NASDAQ: NVDA)

Switch	Standard	Launch	Aggregate BW	OE BW
Quantum X800-Q3450	InfiniBand	2H 2025	115.2 Tbps	1.6 Tbps
Spectrum 6810	Ethernet	2H 2026	102.4 Tbps	3.2 Tbps
Spectrum 6800	Ethernet	2H 2026	409.6 Tbps	3.2 Tbps

Nvidia's CPO roadmap splits in two phases. Quantum-X is a first-gen product aimed at supply-chain validation; Spectrum-X is closer to a more aggressive architectural bet.

1) Quantum X800-Q3450

Quantum X800-Q3450 — Nvidia's first-gen production CPO switch.

Standard: InfiniBand
Launch: 2H 2025
Aggregate BW: 115.2 Tbps
OE BW: 1.6 Tbps (Gen 1)
Architecture: 4 monolithic ASICs, multi-plane

Multi-plane architecture

The crux of Quantum-X is multi-plane architecture.

Each 800G physical port splits into 4 × 200G lanes, with each lane connecting to a different ASIC.
Data is sprayed across all four ASICs and recombined at the destination port.
The result reaches the same maximum cluster size as four discrete 28.8T switches (746,496 GPUs at 3 tiers), but the shuffle happens inside the box, dramatically reducing external fiber complexity.

Inside the package

Each of the four ASICs is surrounded by six detachable optical sub-assemblies, with each sub-assembly containing three optical engines.
- Per ASIC: 18 OE × 1.6 Tbps = 28.8 Tbps.
- Across the package: 4 ASIC × 18 OE = 72 OE, 4 × 28.8T = 115.2 Tbps aggregate.
The detachable sub-assembly matters — technically it leans closer to NPO and lowers the serviceability burden vs. full CPO.

The 200G MRM breakthrough

Quantum-X's most important engineering claim is that each MRM operates at 200G PAM4 (8 modulators × 200G = 1.6 Tbps per engine).
That matches the fastest MZMs and overturns the industry assumption that "MRM is stuck at NRZ."
It is an engineering achievement by Nvidia + TSMC, and it simultaneously justifies the MRM-centric strategy.

Process-node split

The PIC is fabricated on N65 (a mature node). Optical components gain little from scaling and tend to perform more stably at larger geometries.
The EIC is fabricated on N6 (an advanced node). For driver, TIA, and control logic, the density and power-efficiency benefits of advanced nodes are clear-cut.
The two dies are connected at sub-micron interconnect granularity via TSMC COUPE's hybrid bonding.

Cooling

Two copper cold plates sit on top of the ASIC in a closed-loop liquid-cooling configuration.
It isn't only for ASIC thermals; it doubles as a mechanism to keep the temperature-sensitive MRM inside its narrow operating window.

Quantum-X implications

Quantum-X is a supply-chain pipe-cleaner.
Detachable sub-assemblies, smaller monolithic ASICs, and a relatively conservative 1.6T OE all make it a fault-tolerant first-gen product.
If something fails, swap the sub-assembly. Nvidia gathers real-world reliability data without committing to a full-CPO architecture.

2) Spectrum-X Photonics

Spectrum-X Photonics — Nvidia's second-gen CPO switch (Spectrum 6810 / 6800).

Standard: Ethernet
Launch: 2H 2026
Aggregate BW: 102.4 Tbps (6810) / 409.6 Tbps (6800)
OE BW: 3.2 Tbps (Gen 2)
Architecture: Multi-chip module (MCM)

Spectrum-X has a fundamentally different architecture from Quantum-X. Where Quantum-X is a multi-plane configuration of four monolithic ASICs, Spectrum-X is built around a much larger reticle-class multi-chip module (MCM).

The 6810 is a single-package product using one MCM.
The 6800 stitches together four 6810 MCMs (Spectrum-6 + 8 SerDes chiplets + OE set) inside a single chassis with built-in inter-package fiber shuffle, delivering 4× aggregate bandwidth.

Why MCM matters

The central Spectrum-6 switch ASIC is a 102.4 Tbps reticle-class die.
Around it sit eight SerDes I / O chiplets (two on each side), each handling 64 × 200G lanes for 12.8T.
This separation lets the design allocate far more shoreline to I / O than monolithic — the core reason for the 4× per-package bandwidth vs. Quantum-X.

3.2 Tbps optical engine (Gen 2)

Each OE: 16 optical lanes × 200G.
The package carries 36 OEs, but only 32 are active; the remaining 4 are redundancy.
Unlike Quantum-X, the OEs are soldered to the substrate, so in-place replacement isn't possible. If an OE fails in the field, a spare OE has to take over.
Each I / O chiplet feeds 4 OEs (12.8T per chiplet → 4 × 3.2T, a clean fan-out).

6810 vs. 6800

Spec	Spectrum 6810	Spectrum 6800
Aggregate BW	102.4 Tbps	409.6 Tbps
MCM count	1	4
Physical ports	128 × 800G	512 × 800G
OE attachment	Soldered (true CPO)	Soldered (true CPO)
External lasers	16 ELS	64 ELS

Spectrum-X implications

Spectrum-X is an architectural bet.
Soldered OEs (no field swap), 3.2T engines, MCM packaging accompanied by 8 SerDes chiplets — a far more aggressive design.
The design is premised on the supply chain Quantum-X is currently building.
The four redundant OEs are essentially an architectural acknowledgement that "in the field, some fraction will fail, but in-place fixes are hard."
200G MRM is the foundational technology for both products and locks Nvidia onto the TSMC COUPE platform.

3) Quantum-X vs. Spectrum-X side by side

Feature	Quantum X800-Q3450	Spectrum 6810	Spectrum 6800
Standard	InfiniBand	Ethernet	Ethernet
Launch	2H 2025	2H 2026	2H 2026
Architecture	4 monolithic ASICs (multi-plane)	1 MCM	4 MCMs in chassis (with fiber shuffle)
Per-ASIC / package BW	28.8 Tbps per ASIC	102.4 Tbps	102.4 Tbps × 4
Aggregate BW	115.2 Tbps	102.4 Tbps	409.6 Tbps
OE bandwidth	1.6 Tbps (Gen 1)	3.2 Tbps (Gen 2)	3.2 Tbps (Gen 2)
OE count	18 per ASIC × 4 = 72	36 (32 active)	36 (32 active) × 4
OE attachment	Detachable (NPO-like)	Soldered (true CPO)	Soldered (true CPO)
Physical ports	144 × 800G	128 × 800G	512 × 800G
External lasers	18 ELS	16 ELS	64 ELS

4) What the roadmap implies

Quantum-X is the supply-chain pipe-cleaner.
- Detachable sub-assemblies, smaller monolithic ASICs, and a relatively conservative 1.6T OE make it fault-tolerant as a first-gen product.
- Failures get fixed by swapping the sub-assembly. Nvidia accumulates real-world reliability data without committing to a full-CPO architecture.
Spectrum-X is the architectural bet.
- Soldered OEs (no field swap), 3.2T engines, MCM packaging with 8 SerDes chiplets — a much more aggressive design.
- The design depends on the supply chain Quantum-X is now building. The four redundant OEs are essentially an architectural acknowledgement that "in the field, some fraction will fail, but in-place fixes are hard."
200G MRM is the foundational technology for both products.
- Without that breakthrough, neither product hits its bandwidth target.
- At the same time, Nvidia is the first to ship 200G PAM4 MRM in production CPO — locking it onto the TSMC COUPE platform.

(2) Broadcom (NASDAQ: AVGO)

Broadcom is the longest-running production CPO vendor, having shipped three generations over four years.

Generation	Switch ASIC	Announced	Aggregate BW	OE BW	# OEs	Key shift
Humboldt (TH4)	Tomahawk 4	2021	25.6 Tbps (half-electrical)	3.2 Tbps	4	Proof of concept
Bailly (TH5)	Tomahawk 5	OFC 2023 (2024 production)	51.2 Tbps (all-optical)	6.4 Tbps	8	Full CPO + CMOS EIC + FOWLP
Davisson (TH6)	Tomahawk 6	October 2025	102.4 Tbps	6.4 Tbps	16	TSMC N3, 200G / lane

It's worth noting that all three generations are scale-out Ethernet switches. Broadcom has not yet entered InfiniBand or scale-up CPO production.

1) Generation 1: Humboldt (TH4) — Proof of Concept

Humboldt (TH4) — Broadcom's first-gen production CPO switch.

Switch ASIC: Tomahawk 4
Announced: 2021
Aggregate BW: 25.6 Tbps (12.8T pluggables + 12.8T via 4 × 3.2T OE)
OE BW: 3.2 Tbps × 4

Hybrid design

First-gen Humboldt was a hybrid design, splitting bandwidth in half between traditional electrical I / O and CPO.

Why hybrid? A ToR switch needs both short copper links to servers (electrical) and long uplinks to spine switches (optical) at the same time. Splitting the switch this way let Broadcom validate CPO without committing the whole system to it.

Key technical choices

PIC + SiGe EIC stacked via TSV process.
SiGe carries a power penalty of about 3 pJ / bit vs. CMOS — one of the biggest reasons the next generation (Bailly) moved the EIC to CMOS.
Roughly 250 optical components per OE — far fewer than the ~1,000 of subsequent generations.

Humboldt implications

Humboldt proved CPO works in a real Ethernet switch.
It also exposed the SiGe EIC's power penalty, giving Broadcom a clear motivation to move to 7 nm CMOS in the next generation.
It's closer to a proof of concept than the formal start of production CPO.

2) Generation 2: Bailly (TH5) — Full Optical CPO

Bailly (TH5) — Broadcom's second-gen production CPO switch.

Switch ASIC: Tomahawk 5
Announced: OFC 2023, production 2024
Aggregate BW: 51.2 Tbps (all-optical)
OE: 8 × 6.4 Tbps (each OE 64 × 100G FR4)

Architectural leap to all-optical CPO

Bailly was an architectural leap. Going all-optical doubled bandwidth to 51.2 Tbps and moved the EIC from SiGe to 7 nm CMOS.

CMOS EIC enabled the scaling

7 nm CMOS allowed more complex per-OE control logic (Humboldt 250 → Bailly ~1,000 optical components).
Lane count per OE rose from 32 to 64.
With the SiGe penalty gone, power dropped together.

Packaging shift to FOWLP

Bailly moved to fan-out wafer-level packaging via ASE / SPIL. The PIC sits on top, TMVs (Through-Mold Vias) route signals down to the EIC, and copper-pillar bumps connect to the substrate.
Why FOWLP? It's already a proven technology in the mobile-handset market and is broadly supported across the OSAT ecosystem. It scales better than custom TSV processes.

Bailly implications

Bailly is the generation that simultaneously validated three changes: all-optical, CMOS EIC, and FOWLP.
It demonstrated FOWLP working at production scale and built a packaging foundation that could extend to the customer-ASIC business.
That said, the FOWLP / TMV structure imposes a 100G / lane ceiling. That limit is exactly why the next generation (Davisson) had to move to 200G / lane and is also the motivation for pivoting to TSMC COUPE.

3) Generation 3: Davisson (TH6) — 102.4 Tbps

Davisson (TH6) — Broadcom's third-gen (current-gen) production CPO switch.

Switch ASIC: Tomahawk 6 (TSMC N3)
Announced: October 2025, with deployment running 2025–2026
Aggregate BW: 102.4 Tbps
OE: 16 × 6.4 Tbps
SerDes: 200G / lane

A 102.4 Tbps switch

Unveiled in October 2025 and rolling out for deployment in 2025–2026, Davisson once again doubles bandwidth vs. Bailly to reach 102.4 Tbps.

Spec	Davisson (TH6)
Switch ASIC	Tomahawk 6 (TSMC N3)
Aggregate bandwidth	102.4 Tbps
Optical engines	16 × 6.4T
SerDes speed	200G / lane (up from 100G)
Optics	Davisson DR OE (200G / channel)
Port configurations	128 × 800G or 64 × 1.6T
Packaging	Liquid-cooled, 10U chassis

Key shift: 200G / lane SerDes

The same generational leap Nvidia made on Quantum-X.
Doubling lane speed reaches 102.4 Tbps without doubling fiber count.
That said, in the FOWLP / TMV-based Bailly structure, 200G / lane is near the ceiling — the direct reason Broadcom needs to move the next generation to TSMC COUPE.

Open-ecosystem play

Broadcom outsources box assembly to contract manufacturers (Micas, Celestica).
Some customers like Japan's NTT are reportedly exploring buying Broadcom switch silicon and building proprietary CPO systems with their own OEs on top.
That's a far more open-vendor approach than Nvidia's vertically integrated model.

Davisson implications

Davisson is Broadcom's "mature production CPO" stage.
It's the first generation that combines 200G / lane + TSMC N3 + a multi-vendor box ecosystem all at once.
It's also close to FOWLP's last generation. The next generation flips the stack itself to TSMC COUPE.

4) Beachfront vs. Oceanfront — a fan-out approach toward scale-up

The architectural approach Broadcom is pushing to bond CPO onto customer ASICs (XPUs / GPUs) is a fan-out-based "beachfront vs. oceanfront" structure.

Metaphor:
- "Beachfront" is the package substrate edge — limited shoreline.
- "Oceanfront" is the fan-out region extended beyond it, creating additional space to attach optical engines.
Why it matters for scale-up:
- Pushing OEs farther from a hot xPU reduces the thermal stress on temperature-sensitive optics.
- OEs can be attached at the very last step of the assembly process → only known-good OEs touch the package, dramatically lifting yield.
- Four OEs can be lined up along a single oceanfront edge → today, 51.2 Tbps offload from a single edge.
Roadmap trajectory (per Broadcom roadmap slides):
- 2025: 12.8 Tbps per oceanfront (1 OE)
- 2027: 51.2 Tbps per oceanfront (2 OEs)
- 2028: 102.4 Tbps per oceanfront (4 OEs)
The CoWoS-S → CoWoS-L transition unlocks substrate edges greater than 100 mm, accommodating larger OE arrays.

5) Strategic positioning

The customer-ASIC business is the real prize.
- Broadcom isn't simply a switch vendor. It sits in the seat designing hyperscalers' custom ASICs.
- The production experience accumulated in CPO makes Broadcom a natural design partner the moment those customers want to bond CPO onto their own AI ASICs.

Customer	Adoption stance
OpenAI	Picked Broadcom for the ASIC partnership. CPO experience is one of the core factors.
Google	The most hesitant hyperscaler. Negative on CPO adoption due to reliability concerns, even though it's Broadcom's largest ASIC customer.
NTT, etc.	Reportedly exploring buying Broadcom switch silicon and building proprietary CPO systems with their own OEs on top.

The Google paradox is worth flagging. Google relies on Broadcom for custom TPU silicon but rejects CPO deployment.
- Its infrastructure philosophy prioritizes reliability over peak performance.
- CPO's blast radius (one OE failure → 64+ ports lost) clashes with Google's operations model.
- A strong signal that even within the deepest customer relationship, CPO adoption isn't universal — it's workload-specific.

6) The pivot to the TSMC COUPE platform

Even Broadcom — the vendor with the most production experience — is moving its next-generation product to the TSMC COUPE platform. That means fundamentally revising some of the existing design decisions.

Component	Existing Broadcom approach	COUPE approach
Packaging	FOWLP via SPIL	TSMC SoIC hybrid bonding
Coupling	Edge coupling	Grating coupling
Modulator	MZM	MRM
Speed ceiling	100G / lane (TMV parasitics)	200G / lane and beyond

Why the change?
- FOWLP can't push beyond 100G / lane because of TMV parasitics.
- Edge coupling doesn't fit cleanly with structures that put the OE on an interposer.
- As long as the existing approach holds, Broadcom falls permanently behind on the speed roadmap.
The cost: even as the most production-experienced vendor, the IP accumulated in MZM / FOWLP doesn't transfer cleanly to MRM / SoIC. In effect, the design in the core area has to start over.
The bet: the design support and tooling TSMC provides (PDK, EDA workflow, reference designs) offset the reset cost, and the long-term performance ceiling justifies the short-term pain. It's a strong signal of TSMC's leverage inside the CPO ecosystem — even the most experienced CPO vendor is committing to the TSMC platform.

7) Bottom line

Broadcom's Humboldt → Bailly → Davisson three-generation arc has built the most mature portfolio in production CPO. Each generation left a clear lesson.

Humboldt proved CPO works in a real Ethernet switch but exposed SiGe's power penalty.
Bailly transitioned to all-optical with a CMOS EIC and validated FOWLP at scale, but ran into the 100G / lane ceiling.
Davisson doubles bandwidth via 200G / lane and TSMC N3, and it kicks off an open-ecosystem play through box-maker partnerships and a silicon-only sales channel.

The future is split.

Broadcom continues its leadership in scale-out switches with Davisson,
but the larger opportunity is scale-up CPO bonded inside customer AI ASICs — the beachfront / oceanfront fan-out approach is the on-ramp.
The TSMC COUPE pivot is a signal that "even the most experienced CPO vendor finds it hard to out-engineer the TSMC manufacturing platform."
In the end, the future of CPO isn't defined by who carries the most legacy CPO IP — it's defined by whoever masters the TSMC stack first.

(3) Nvidia's generation-by-generation CPO adoption timeline

This isn't a one-off 2026 event but a multi-year roadmap in which Nvidia, generation after generation, pushes CPO into deeper layers.

Generation	Timing	CPO adoption surface	What it means
Quantum-X / Spectrum-X	2H25–2H26	Scale-out switch (back-end network)	First production CPO — supply-chain validation phase
Rubin	2H26+	Full-scale scale-out CPO deployment	Begins shifting some scale-out traffic from optical transceivers to CPO
Rubin Ultra	2H27	Phase 1 scale-up CPO — rack-to-rack	Pod-level domains like NVL576 / NVL1152 connect over fiber. Just the copper → optical swap delivers ~3–4× performance per unit of traffic.
Feynman	2H28+	Phase 2 scale-up CPO — intra-rack	Past 400G per lane, copper hits its limit even inside the rack. Optics fills intra-rack, rack-to-rack, spine, and DCI. Optical-lane demand jumps ~10×.

Nvidia chose a conservative path: start with the "least sensitive" surface and expand inward toward the most sensitive surface near the GPU as the supply chain matures.
The follow-the-leader pattern, with Broadcom and the ASIC camp trailing Nvidia by 6–12 months, holds.

(4) Composite implications

Nvidia and Broadcom converge on the same platform (TSMC COUPE), but their entry paths are opposite.
- Nvidia enters via a vertically integrated end-to-end system (switch + OE + ELS + fiber routing + cassette) — fast adoption is possible, but customer lock-in grows.
- Broadcom builds a more open ecosystem via bare die + contract manufacturer + customer-ASIC partnerships — adoption pace is slower but fits a multi-vendor world better.
Across both, 200G / lane + an MRM-class modulator + hybrid bonding is settling in as the production CPO baseline.
That said, both vendors are in a market-validation phase rather than mass adoption. Quantum-X uses detachable sub-assemblies; Bailly / Davisson run without multi-OE redundancy; neither has yet reached an architecture that recovers cleanly when one OE fails in the field.
So the 2026 CPO market is shaped less by "who builds the fastest OE" and more by "who can transplant the serviceability and multi-vendor ecosystem of pluggables onto CPO fastest."

Link resiliency — the hyperscaler value beyond power saving

CPO's stability story isn't a single dramatic improvement — it's a system-level removal of the variables that produce soft failures: fewer interfaces, tighter tolerance management, more uniform ports, and the elimination of human-touch steps.
In large AI clusters, link resiliency isn't a nice-to-have — it ties directly to the useful training time you can extract from the fleet.
A dead link halts training for a GPU group or drops throughput as routing detours; at the scale of tens of millions of GPU-hours, even sub-percent differences in link uptime translate into real losses.
CPO replaces a still-disorganized, hand-touched optical supply chain with a more systematized, predictable, factory-sealed structure.
That's why some hyperscalers say "CPO's biggest value isn't power saving — it's resiliency."
One caveat: CPO's blast radius grows. A single OE failure can affect 64+ ports, so whether CPO's resiliency model can be reliably detected, isolated, and recovered in real datacenter operating environments remains a separate, central challenge.

On CPO

8. Today's Major CPO Products

(1) Nvidia (NASDAQ: NVDA)

1) Quantum X800-Q3450

Multi-plane architecture

Inside the package

The 200G MRM breakthrough

Process-node split

Cooling

Quantum-X implications

2) Spectrum-X Photonics

Why MCM matters

3.2 Tbps optical engine (Gen 2)

6810 vs. 6800

Spectrum-X implications

3) Quantum-X vs. Spectrum-X side by side

4) What the roadmap implies

(2) Broadcom (NASDAQ: AVGO)

1) Generation 1: Humboldt (TH4) — Proof of Concept

Hybrid design

Key technical choices

Humboldt implications

2) Generation 2: Bailly (TH5) — Full Optical CPO

Architectural leap to all-optical CPO

CMOS EIC enabled the scaling

Packaging shift to FOWLP

Bailly implications

3) Generation 3: Davisson (TH6) — 102.4 Tbps

A 102.4 Tbps switch

Key shift: 200G / lane SerDes

Open-ecosystem play

Davisson implications

4) Beachfront vs. Oceanfront — a fan-out approach toward scale-up

5) Strategic positioning

6) The pivot to the TSMC COUPE platform

7) Bottom line

(3) Nvidia's generation-by-generation CPO adoption timeline

(4) Composite implications

Link resiliency — the hyperscaler value beyond power saving

On CPO

Read next