NVIDIA Blackwell, 32 GB GDDR7, PCIe Gen5
The NVIDIA RTX 5090 is NVIDIA’s flagship Blackwell consumer GPU — 32 GB of GDDR7 memory at 1.79 TB/s of bandwidth, 5th-generation Tensor Cores with native FP4 support, and 21,760 CUDA cores. Coming soon to packet.ai from $0.59/GPU-hour dedicated.
Dedicated $0.59/hr · Pricing to be confirmed at launch
The RTX 5090 brings 5th-gen Tensor Cores, native FP4, and 1.79 TB/s GDDR7 to a PCIe card — the most capable consumer GPU for AI inference ever made.
Native FP4 precision delivers 3,352 AI TOPS — 2× the AI throughput of RTX 4090. First consumer GPU with FP4 support.
78% more bandwidth than RTX 4090. Fits 13B at FP16 and 32B at Q4 on a single card.
Drops into any PCIe Gen5 system — no SXM motherboard required. Accessible Blackwell for any server environment.
4th-gen RT Cores and hardware AV1 encode make the RTX 5090 the best consumer GPU for AI video and image generation.
Dedicated single-tenant — plus multi-node clusters.
Full RTX 5090 reserved exclusively for you. Zero noisy-neighbour risk, 99.99% SLA.
Join waitlist →Reserved RTX 5090 at a flat monthly rate. Full single-tenant isolation, predictable cost exclusively for you. 99.99% SLA, zero noisy-neighbour risk.
Launching soonScale inference across multiple RTX 5090 nodes with InfiniBand interconnect.
1.79 TB/s bandwidth and 32 GB VRAM make the 5090 the fastest consumer GPU for token generation on 7B–13B models at FP16.
32 GB gives you headroom for LoRA and QLoRA fine-tuning of 13B models without sharding — at a fraction of H100 cost.
DLSS 4 Multi Frame Generation, 4th-gen RT Cores, and hardware AV1 encode make the RTX 5090 the best consumer GPU for FLUX, SDXL, and video AI.
Blackwell flagship consumer GPU: 32 GB GDDR7, 1.79 TB/s, 3,352 AI TOPS. The most powerful consumer GPU ever built.
Coming soon. Join the waitlist to be notified the moment capacity opens.
Dedicated from $0.59/GPU-hour, single-tenant. Monthly pricing TBC at launch. See pricing →
13B at FP16 natively; 32B at Q4 on a single card. For 70B+, use H100 or H200.
H100 has more memory (80 GB vs 32 GB), ECC, NVLink, and 24/7 datacenter reliability. RTX 5090 wins on cost-per-token for sub-30B inference and generative AI workloads.
No. PCIe Gen5 only. For NVLink multi-GPU, use H100 SXM or B200.
The most powerful consumer GPU ever built. Join the waitlist for early access on packet.ai.
No commitment · we’ll notify you by email