RX 7900 XTX
Our complete AI-workload review of the RX 7900 XTX: VRAM analysis, Flux & ComfyUI throughput, local LLM performance, power efficiency, and workstation fit.
Overview
The RX 7900 XTX delivers 49/100 on our composite AI workload index — a balanced view of Flux generation throughput, SDXL inference, and quantized local LLM performance. With 24GB of VRAM and a 355W TDP, it's positioned for serious AI creators running large workflows locally.
Pros & Cons
Pros
- 24GB VRAM handles most Flux & SDXL workflows
- 1.2 it/s on Flux.1 dev FP16
- Excellent CUDA ecosystem support
- Strong resale value
Cons
- 355W TDP requires serious PSU
- Premium pricing
- Limited stock at MSRP
Performance benchmarks
| Flux.1 dev FP16 · 1024² · 25 steps | 1.2 it/s | |
| SDXL Base · 1024² · 20 steps | 5.8 it/s | |
| Llama 3 70B · Q4_K_M · 2k ctx | 11 tok/s | |
| Hunyuan Video · 720p · 5s | 0.36 it/s |
VRAM analysis
With 24GB of VRAM, the RX 7900 XTX can comfortably run: Flux.1 dev FP16 with all loaders resident, SDXL with multiple LoRAs, and Llama 3 70B at Q4 quantization.
Flux performance
At 1.2 it/s, a standard Flux.1 dev 25-step generation completes in ~20.8 seconds. For batch workflows, expect linear scaling up to VRAM limits.
ComfyUI performance
SDXL throughput of 5.8 it/s makes the RX 7900 XTX viable for iterative ComfyUI workflows. Block-swap and tiled VAE are rarely needed.