NVIDIA Hopper, 141 GB HBM3e, SXM
The NVIDIA H200 is the Hopper GPU with 141 GB HBM3e memory, 1.76× the capacity of H100. It runs 70B models at FP16 natively and delivers 4.8 TB/s of bandwidth for the fastest token generation at scale. Coming soon to packet.ai.
Pricing to be confirmed at launch
The H200 is the H100 with 141 GB HBM3e memory — the same Hopper compute engine, 1.76× the capacity and 1.43× the bandwidth.
Same 80-billion-transistor die as the H100. 4th-gen Tensor Cores, MIG support, NVLink 4.0, with HBM3e delivering 1.4× the bandwidth.
76% more memory than H100. Long-context LLMs, multi-modal pipelines, and large-batch training fit without sharding.
FP8 training and inference with per-tensor scaling for up to 4× speedup over FP16 on Hopper.
Scale across NVSwitch-connected nodes for large training runs where gradient communication is the bottleneck.
Dedicated or monthly — plus multi-node clusters.
Full H200 card reserved exclusively for you. 99.99% SLA, zero noisy-neighbour risk.
Join waitlist →Reserved H200 at a flat monthly rate. Full single-tenant isolation, predictable cost exclusively for you. 99.99% SLA, zero noisy-neighbour risk.
Get a wholesale quote →Scale frontier inference across multiple H200 nodes with NVLink 4.0 and InfiniBand interconnect.
141 GB lets you run 70B models at FP16 natively. No quantisation, no model sharding.
4.8 TB/s bandwidth delivers the fastest token generation per GPU outside Blackwell.
141 GB allows full-parameter fine-tuning of 70B models on a single card.
Hopper GPU with 141 GB HBM3e. Same FP8 compute as H100, 1.76× the memory and 1.43× the bandwidth.
Coming soon to packet.ai. Join the waitlist for early access.
Yes. 141 GB HBM3e is enough for Llama 3.1 70B at FP16 on a single card, no quantisation needed.
Same FP8 Tensor Cores, 1.76× the memory (141 vs 80 GB), and 1.43× the bandwidth (4.8 vs 3.35 TB/s). Inference throughput is noticeably higher at large batch sizes.
Yes. Up to 7 isolated MIG instances for multi-tenant inference.
141 GB HBM3e. Join the waitlist for early access on packet.ai.
On-demand · hourly billing · US & EU regions