
NVIDIA RTX 5080
NVIDIA RTX 5080
The RTX 5080 is NVIDIA's high-end Blackwell GPU offering 16 GB of GDDR7 memory at 960 GB/s bandwidth for a $999 MSRP. It slots between the RTX 5070 Ti and the flagship RTX 5090, offering solid 4K gaming performance and respectable local AI inference capabilities. While the 16 GB VRAM limits which models you can run at higher quants, the GDDR7 memory bandwidth means what does fit runs fast.
Quick verdict
| If you... | Then... |
|---|---|
| ...already have an RTX 4080 Super | Skip - only ~10-15% raw uplift. Not worth $1000+ |
| ...have an RTX 3080 or older | This is a massive upgrade. ~2× the performance, 60% more VRAM |
| ...want to run Qwen3-30B comfortably | Perfect match at Q4_K_M with ~22 tok/s |
| ...need 70B models | Look at the RTX 5090 or a dual-GPU setup |
| ...primarily do image gen (SDXL/Flux) | Excellent throughput, but 16GB is tight for Flux |
Spec breakdown
- VRAM: 16 GB GDDR7
- Memory bandwidth: 960 GB/s (30 Gbps effective)
- TDP: 360 W (recommend 850W+ PSU)
- PCIe: 5.0 ×16
- Architecture: Blackwell GB203-400
- CUDA cores: 10,752
- Tensor cores: 336 (5th gen)
- Power connector: 1× 12V-2×6 (16-pin)
Real-world AI inference
Tested on common model configurations (Q4_K_M, 2k context unless noted):
| Model | Tokens/sec | Source |
|---|---|---|
| Qwen3-30B Q4_K_M | ~22 tok/s | r/LocalLLaMA |
| Mistral Small 3 Q5_K_M | ~48 tok/s | Community benchmarks |
| Llama 3.3 70B Q3_K_M (offload) | ~11 tok/s | r/LocalLLaMA |
| ComfyUI SDXL (1024×1024) | ~9 s/image | Community |
| ComfyUI Flux Dev | ~28 s/image | Community |
Best models that fit
The 16GB VRAM ceiling puts the RTX 5080 in a specific sweet spot:
- Q4_K_M: Qwen3-30B (~18 GB) - fits comfortably at modest context
- Q5_K_M: Mistral Small 3 (~17 GB) - good fit
- Q8_0: 13-14B models like Qwen3-14B - plenty of room
- Q4_K_M (tight): Llama 3.3 70B at Q3_K_M fits via partial offload
Cost vs cloud
At $1,000-1,300, if you spend $50/month on API calls for coding assistance and LLM queries, the RTX 5080 pays back in ~20-26 months. For heavy users spending $150+/month, it's more like 7-9 months.
Where to buy
Affiliate disclosure: links below earn us a small commission at no cost to you.
- Amazon: Check current pricing via button above
- Newegg: Sometimes has better stock availability
- B&H Photo: No tax in most states, good for workstation builds
Honest alternatives
- RTX 5090 (~$2,200): Double the VRAM (32 GB), 70B models fit at Q4
- RTX 4090 used (~$1,300): 24 GB VRAM, slower bandwidth but more model flexibility
- RTX 5070 Ti (~$900): Same 16 GB, slightly slower, cheaper
What the community says
"The RTX 5080 is exactly what I expected - a solid generational bump over the 4080 Super. Qwen3-30B at 22 tok/s is my daily driver now. I just wish it had 20GB for that extra headroom."
- u/blackwell-early on r/LocalLLaMA, 156 upvotes
Frequently asked
Quick answers to common questions
How much VRAM does the NVIDIA RTX 5080 have?
The NVIDIA RTX 5080 has 16 GB of VRAM with 960 GB/s memory bandwidth. MSRP was $999.
What local AI models can run on the NVIDIA RTX 5080?
The NVIDIA RTX 5080 with 16 GB VRAM can run many models depending on quantization. Models up to ~24B params may fit at Q4_K_M. Use our VRAM calculator to check specific models.
Is the NVIDIA RTX 5080 good for local AI inference?
NVIDIA RTX 5080 is best for llm-inference, image-gen, gaming. With ample VRAM it handles most open models well.
Where can I buy the NVIDIA RTX 5080?
Check our buy links above for the best current prices on Amazon, Newegg, and B&H. Prices vary by retailer and availability.
How does the NVIDIA RTX 5080 compare to other GPUs?
NVIDIA RTX 5080 has 16 GB VRAM and 960 GB/s bandwidth. It is a mid-to-high-range card capable of running most 7B–30B models. Browse our hardware directory for side-by-side comparisons.
Is the NVIDIA RTX 5080 worth buying right now?
The current price is $1299 vs the MSRP of $999. The price is at or above MSRP. Consider waiting for sales events like Prime Day or Black Friday.
What power supply do I need for the NVIDIA RTX 5080?
The NVIDIA RTX 5080 has a TDP of 360W. This requires a high-wattage PSU (850W+ recommended). Always check the manufacturer's recommendations for your specific build.
Nearby options
Similar hardware and models that fit
Similar hardware
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.