NVIDIA Rubin CPX Accelerates Inference for Million‑Token Context AI

21 190
17.3
NVIDIA2.13 млн
Следующее
Популярные
48 дней – 52 4260:21
Quantum: A Million-X Jump
Опубликовано 9 сентября 2025, 15:19
Some of today’s most advanced AI workloads, like code and video generation, demand context processing at unprecedented scale, often exceeding one million tokens.

This is why NVIDIA is launching Rubin CPX: a GPU purpose‑built for the compute‑intensive context phase of inference.

Together with NVIDIA Dynamo orchestration and the Vera Rubin NVL144 CPX platform, Rubin CPX is ushering in a new era of end‑to‑end disaggregated inference architecture—setting benchmarks in performance, efficiency, and ROI for these high-value workloads.

Learn more: nvidianews.nvidia.com/news/nvi...
автотехномузыкадетское