What I'd Buy in 2026 for €4,000: A Mid-Tier Sovereign AI Build
Here is what I would buy at €4,000 today, knowing what I know in 2026-05. There are two honest paths at this budget, and the choice is binding on workload rather than aesthetics. Path A is a new RTX 4090 24 GB on an upper-mid AM5 platform with 128 GB of DDR5, optimized for throughput on dense and MoE language models that fit in 24 GB. Path B is a used RTX A6000 48 GB on a Threadripper-class workstation board, optimized for the model classes that need more than 24 GB in one card.
I have not personally tested either build end to end. I run a DGX Spark at roughly the same euro outlay, which is the third honest option at this price point and gets its own comparison below. The Path A and Path B picks are conservative, sourced from current Geizhals listings, and the prices below are captured 2026-05-22.
Path A: new 4090 build
| Component | Pick | Price | Source |
|---|---|---|---|
| GPU | Gainward RTX 4090 24 GB | €2,689.99 | geizhals.eu Gainward 4090 Phantom |
| CPU | AMD Ryzen 7 7800X3D (boxed) | €349.00 | geizhals.de Ryzen 7 7800X3D boxed |
| Mainboard | MSI MAG B650 Tomahawk WIFI | €163.59 | geizhals.de MSI MAG B650 Tomahawk |
| RAM | 2× Crucial Pro 64 GB DDR5-5600 (128 GB total) | €1,260.40 | geizhals.de Crucial Pro 64GB Kit |
| NVMe | Samsung 990 PRO 4 TB | €499.99 | geizhals.de Samsung 990 PRO 4TB |
| PSU | be quiet! Pure Power 12 M 850 W ATX 3.1 | €151.67 | geizhals.de Pure Power 12 M 850W |
| Case | Fractal Design Meshify 2 | €124.90 | geizhals.de Meshify 2 |
| Path A total | €5,239.54 | over budget |
That total is over budget at the 4090’s current Geizhals floor of €2,690. To come in under €4,000, drop the RAM to a single 64 GB kit (saves €630) and drop the NVMe to a 2 TB Samsung 990 PRO at roughly €280 (saves €220). Adjusted total: €4,389. Still over budget by €389. To land at €4,000 exactly, you either accept a slightly lower-end 4090 SKU (the floor moves week to week), buy the GPU on a Mindfactory sale (historically a 5 to 10 percent discount window appears monthly), or accept that the build is €4.4k rather than €4.0k. I prefer the third option; the budget envelope is not the constraint that matters, the workload-fit is.
Path B: used A6000 build
| Component | Pick | Price | Source |
|---|---|---|---|
| GPU | Used NVIDIA RTX A6000 48 GB (Ampere) | €2,200 to €2,800 | ebay.com RTX A6000 48GB shop |
| CPU | AMD Ryzen 7 7800X3D (boxed) | €349.00 | geizhals.de Ryzen 7 7800X3D boxed |
| Mainboard | ASUS TUF Gaming B650-Plus WIFI | €146.73 | geizhals.de ASUS TUF B650-Plus |
| RAM | 2× Crucial Pro 64 GB DDR5-5600 (128 GB total) | €1,260.40 | geizhals.de Crucial Pro 64GB Kit |
| NVMe | Samsung 990 PRO 2 TB | ~€280 (estimate, verify before buying) | geizhals.de Samsung 990 PRO range |
| PSU | be quiet! Pure Power 12 M 850 W ATX 3.1 | €151.67 | geizhals.de Pure Power 12 M 850W |
| Case | Fractal Design Meshify 2 | €124.90 | geizhals.de Meshify 2 |
| Path B total | €4,512 to €5,112 | over budget on high end |
The used A6000 is the price-mover. eBay completed listings range from roughly €2,200 to €2,800 in 2026-05, with the low end being cards from data-center decommissioning and the high end being lightly-used workstation pulls with the original box. The 2 TB NVMe is estimated because the current Geizhals listing for the 4 TB at €499.99 implies the 2 TB at approximately €280; I have not pulled a specific 2 TB SKU’s current price and want to mark that line honestly as estimate, verify before buying.
Prices captured 2026-05-22 from Geizhals.de, Geizhals.eu, and eBay. They will drift. Re-verify before you buy.
Why each pick, the short version
4090 over 5090 at this tier. The 5090 at €3,469 to €3,889 is faster but pushes the build well past €5k for the same VRAM envelope. The 4090 is the price-correct dense-inference card at €2,690 because the per-token throughput delta to the 5090 does not justify the €800 to €1,200 cost delta unless you are specifically planning to use NVFP4 quantization. For the NVFP4 trade-offs see NVFP4 Quantization Explained; short version, the format is real and the speedup is real, but it is a Blackwell-only path and 24 GB caps you well below the model classes where NVFP4 actually changes the workflow.
A6000 48 GB over 4090 24 GB. This is the workload-fit pivot. Models that need more than 24 GB in one card become first-class citizens. Llama 3.1 70B at Q6 or Q8 quantization fits in 48 GB with headroom for context. Mistral Large dense fits at moderate quant. Fine-tuning small LoRAs has scratch space. The A6000 is the cheapest path to 48 GB of NVIDIA VRAM in a single card; the alternative is two 3090s with NVLink, which is mechanically feasible but operationally noisier and harder to cool.
7800X3D over 7700. At this tier the €100 cost delta is rounding error and the 3D V-cache helps the rare workloads that mix gaming with inference on the same box. If this is strictly an inference workstation, drop to the €2k tier’s Ryzen 7 7700 pick and pocket €110. I included the X3D here because the readers writing in at the €4k tier more often run mixed workloads (one box, used for both day-job development and inference experimentation).
128 GB DDR5. Two times the €2k build’s RAM. The reason is the second card and the room for CPU-offloaded layers when a model just barely overflows VRAM. 128 GB is the threshold below which model loading at the 70B+ class starts to feel slow because of page-cache churn.
4 TB NVMe (Path A) or 2 TB NVMe (Path B) plus a 4 TB SATA backup. Models accumulate fast. At this tier I assume you are running three to five models concurrently, each 30 to 80 GB on disk. The 4 TB primary plus an external backup is the smallest config that does not constantly trip over itself. The backup-without-bankruptcy approach lives in Backing Up 119B Parameters Without Bankruptcy; the same logic applies one tier down.
850 W PSU. A 4090 alone pulls up to 450 W under load. The A6000 is gentler at roughly 300 W. The 7800X3D plus board plus drives add 130 W headroom. 850 W is the safe floor for the 4090 path; the A6000 path could drop to 750 W but the saved cost is €30 and the headroom is worth it.
Path A versus Path B versus Spark at €4k
| Dimension | Path A (4090) | Path B (used A6000) | DGX Spark |
|---|---|---|---|
| VRAM | 24 GB | 48 GB | 128 GB unified |
| Best workload | dense ≤ 24 GB | dense to 48 GB | MoE 100B+ |
| 70B Q6 fit | tight, spills | clean | clean |
| 119B MoE fit | spills heavily | spills moderately | native fit |
| Image generation | best of the three | strong | weak |
| Quietness | acceptable | acceptable | moderate fan ramp |
| Warranty | new card, full | none (used) | NVIDIA |
| Resale (24 months) | strong | weak (data-center pulls) | unknown |
The Spark at €4,769 from the NVIDIA Founders listing on Geizhals is the third honest option and gets its own deep-dive in the Spark decision tree. The decision among the three pivots on whether your model roadmap is dense (Path A or Path B) or MoE (Spark), and whether you want a single box that you administer as a Linux server (Spark) or a desktop with a discrete card (Path A and Path B). See also the DGX Spark vs Mac Studio comparison for the workstation-versus-server framing.
What this runs, what it does not
Path A runs well: Llama 3.1 70B at Q4 (very fast), Mistral Small 3.x at FP16, Qwen 30B-class at Q8, Stable Diffusion XL and Flux at production resolution, dense models up to the 24 GB ceiling. Does not run well: Qwen 3.6 119B MoE (the active-parameter footprint plus the routing table do not fit in 24 GB cleanly), Mistral Large dense at usable quant, anything labelled 100B+ dense.
Path B runs well: Llama 3.1 70B at Q6 or Q8 (clean), Mistral Large dense at Q4, dual-model serving (a 7B plus a 70B on the same card), small LoRA fine-tunes (real, not symbolic). Does not run well: 119B MoE class with full context (still spills the routing table to system RAM), latest Blackwell-only quantization formats (no NVFP4 path on Ampere).
For the model-class trade-offs that decide which path wins for your workload, see Mistral Small 4 vs Qwen 3.6 vs GLM 5 on DGX Spark. The relative model rankings translate; the absolute throughput numbers do not because those were measured on Blackwell.
Monthly power cost, three jurisdictions
The 4090 inference-idle is roughly 20 W. Under load it pulls 400 W to 450 W. The A6000 idles at 15 W and loads to 280 W to 300 W. A realistic mixed-use profile (eight hours active, sixteen hours idle) averages around 180 W for Path A and 140 W for Path B. I will use 180 W as the conservative centerline; that is 131 kWh per month.
| Jurisdiction | €/kWh | Monthly cost at 131 kWh |
|---|---|---|
| Germany | €0.34 | €45 |
| United States (national avg) | €0.16 | €21 |
| India | €0.07 | €9 |
Hardware amortization over three years is €112 (Path A €4,000 envelope) to €126 (Path B €4,500 envelope) per month. Power adds €9 to €45. Total cost of operation: €120 to €170 per month, still well below cloud-API for sustained workloads. The break-even math is in Self-Hosted AI vs Cloud APIs: The Real Total Cost.
Compare to the other tiers
Below this tier, the €2k beginner build is the right answer for workloads that fit in 24 GB and do not need new-card warranty coverage. Above this tier, the €8k premium build is the right answer for sustained MoE workloads and for the operators who want the Spark’s unified-memory architecture. The €15k pro-studio build is the floor for two-card parallel jobs and serious fine-tuning.
If I had it to do again
The mistake I see most often at this tier is buying Path A when the workload was Path B (or vice versa). The trap is the GPU’s VRAM number on the spec sheet, which the buyer treats as a binary check (does the model fit yes or no) when it is actually a continuous variable (how much context, what quant, what batch size, what serving framework). Spend two evenings before you buy this build doing a paper exercise on three specific models you intend to run, at three specific quantization levels, with three specific context lengths. If all nine cells fit in 24 GB, Path A is correct. If three or more cells need 48 GB, Path B is correct. If any cell needs 80 GB or more, you are in the €8k tier and the €4k tier is going to disappoint.
The other discipline is to read Five DGX Spark Disasters I Survived before buying any of these paths. The disasters are operational, not architectural; they happen to every local-inference box, not just the Spark. Knowing what they look like in advance saves at least one weekend.
Book a Stack Audit
If you want a second pair of eyes on which of Path A, Path B, or Spark matches your actual workload, the Stack Audit is two hours, fixed-fee, ends with a configuration recommendation. About a third of audits end with “rent cloud for six months, here is what to measure.” The honesty is the product.
Contact via the footer (Nostr or email). Or read the €8k version next if your workload is past the 48 GB ceiling.