NVIDIA Ada Lovelace, 24 GB GDDR6X, PCIe Gen4
The NVIDIA RTX 4090 is the flagship Ada Lovelace consumer GPU with 24 GB of GDDR6X memory at 1.01 TB/s of bandwidth, 4th-generation Tensor Cores with FP8, and 16,384 CUDA cores. Available on packet.ai from $0.39/GPU-hour dedicated.
Dedicated $0.39/hr · Monthly $263/mo
The RTX 4090 brings 4th-gen Tensor Cores, FP8 support, and 24 GB GDDR6X to a PCIe card. The most capable consumer GPU for inference and generative AI.
Ada Tensor Cores with FP8 deliver up to 1,321 AI TOPS. Strong enough for 7B model serving and LoRA fine-tuning.
Fits 7B models at FP16 and 13B at 4-bit on one card. The sweet spot for consumer-grade LLM inference.
No SXM board required. Drops into any Gen4 server for fast, flexible deployment.
Hardware AV1 encode and DLSS 3 Frame Generation make the RTX 4090 ideal for FLUX, SDXL, and video AI pipelines.
Dedicated single-tenant, plus multi-node clusters.
Full RTX 4090 reserved exclusively for you. Zero noisy-neighbour risk, 99.99% SLA.
Deploy Hourly →Reserved RTX 4090 at a flat monthly rate. Predictable cost, full single-tenant isolation.
Deploy Monthly →Scale inference across multiple RTX 4090 nodes with InfiniBand interconnect.
24 GB fits 7B at FP16 natively and 13B at Q4. The most popular model sizes at the lowest cost per token.
24 GB gives headroom for LoRA and QLoRA fine-tuning of 7B models at the lowest hourly rate on packet.ai.
DLSS 3 Frame Generation, hardware AV1 encode, and 24 GB VRAM make the RTX 4090 a workhorse for FLUX, SDXL, and video AI.
Ada Lovelace flagship consumer GPU: 24 GB GDDR6X, 1.01 TB/s, 1,321 AI TOPS.
Dedicated from $0.39/GPU-hour, or $263/month flat rate. Single-tenant, 99.99% SLA.
7B at FP16 natively; 13B at Q4 on a single card. For larger models, use L40S or H100.
RTX 5090 has more memory (32 GB vs 24 GB), faster bandwidth, and 5th-gen Tensor Cores with FP4. RTX 4090 wins on price at $0.39/hr vs $0.59/hr.
Yes. LoRA and QLoRA fine-tuning of 7B models fits comfortably in 24 GB at the lowest hourly rate on packet.ai.
No. PCIe Gen4 only. For NVLink multi-GPU, use H100 SXM or B200.
The best value consumer GPU on packet.ai. $0.39/hr dedicated, or $263/mo flat.
On-demand · hourly billing · US & EU regions