RkBlog

Threadripper 1920X versus Ryzen 5900X on Linux and Windows

2020-12-08

First generation Threadripper opened the workstation and HEDT markets to a wider audience. Some time after it was released I made a 1920X based PC for work on Linux and some gaming on Windows. With the release of Zen 3 I got the opportunity to compare 12 core Threadripper 1920X to a consumer 12 core Ryzen 5900X in productivity and game on Linux and Windows.

Test systems

1920X 5900X
RAM 4 x 8 GB G.Skill Ripjaws V (3200MHz, CL15, F4-3200C15D) 4 x 8 GB Patriot Viper Steel (4400MHz, CL19, PVS416G440C9K)
Running at 3800 CL16
Motherboard Asrock X399M Taichi micro-ATX Gigabyte B550 Aorus Master
GPU Sapphire Vega 64 + Alphacool Eiswolf 240 GPX Pro Gainward RTX 3070
PSU Chieftec PROTON BDF-1000C Enermax Revolution 87+ 850W (80+ Gold)
CPU Cooler Alphacool Eisbaer 240 (water AiO) Alpenfohn Brocken 3

Both systems were installed in open-bench cases (Streacom BC1 for the Threadripper and custom V-Slot for the Ryzen platform). Tests were done on Windows 10 (latest version at the time of writing) and Ubuntu 20.10 and with a daily build of 21.04 (more on that later). For more accurate comparison some Linux Ryzen 5900X tests were also done with the Ripjaws RAM kit at 3200 CL15.

Windows benchmarks

Each of the CPUs and GPUs have already been reviewed. I wanted to put them head to head in a set of benchmarks to get a rough idea how much improvement 5900X offers over 1920X (and then how good old Vega performs versus latest Nvidia GPU). Here is a set of synthetic CPU benchmark results for both systems:

1920X 5900X Performance delta
5900X vs 1920X
V-Ray 10381 16888 162,7%
Corona (seconds) 93 57 163,2%
Cinebench R15 Single 162 271 167,3%
Cinebench R15 2392 3608 150,8%
Cinebench R20 Single 397 616 155,2%
Cinebench R20 5245 8222 156,8%
Cinebench R23 Single 1044 1667 159,7%
Cinebench R23 13133 21788 165,9%
Novabench CPU Score 2155 3579 166,1%
PassMark CPU Score 24661,8 41590 168,6%
Aida64 memory latency (ns) 91,1 64,6 141,0%
Aida64 memory read (MB/s) 75524 58139 77,0%
Geekbench 5 Single 1035 1748 168,9%
Geekbench 5 10338 14859 143,7%
Time Spy CPU Score 9854 14263 144,7%
Time Spy Extreme CPU Score 4865 7496 154,1%
Night Raid CPU Score 10093 16092 159,4%

Quite solid wins of the Ryzen 9 5900X. The memory read is lower as Ryzen uses dual channel RAM configuration while Threadrippers use quad-channel configuration. Next we have some system benchmarks:

1920X + Vega 64 5900X + RTX 3070 Performance delta
PCMark 10 5837 8021 137,4%
PassMark 5815,6 9200 158,2%
Novabench 3547 5856 165,1%
Basemark 1481 1566 105,7%

Basemark is a browser based benchmark and it surfaced when some latest hardware results leaked to the public. The scores did differ a bit and even that 105% delta is questionable so I’m not so sure if Basemark can be used to put leaked scores into perspective vs other devies. Both systems were tested via MS Edge using Chrome engine. GFXBench didn't want to run on the Ryzen system due to some unspecified network error - that benchmark was also used for testing in some recent leaks. It's quite lightweight, even for Vega 64 or some iGPUs.

To end synthetic benchmarks - GPU/gaming benchmarks:

1920X + Vega 64 5900X + RTX 3070 Performance delta
Time Spy 7572 13708 181,0%
Time Spy Extreme 3634 6804 187,2%
Night Raid 39399 64704 164,2%
Superposition 1080p Medium 14120 24542 173,8%
Superposition 4K Optimized 6026 11018 182,8%
DX12 Draw calls per second 27263082 58248594 213,7%
DX11 MT Draw calls per second 1416229 4643475 327,9%

So in summary R9 5900X is around 60% better than 1920X. The Threadripper is 3 years older than the Zen 3 part so you can say AMD managed to pull 20% per year spent on Zen architecture. That's way more than what we got from Intel in the past years.

RTX 3070 is around 70%-80% faster than Vega 64. TechPowerUp database gives it 64% - and all of that while skipping on accelerated Ray Tracing, DLSS, VRS support and other features that the older GPU is missing.

I don't play many games, so I won't go over many games here, just those I focus on.

Starcraft 2

Starcraft 2 is really dated DX9 game and needs a high end CPU to perform well, which is very important for competitive players. All others playing Coop can suffer from stutter to nearly ~0 FPS if they engage in a clash of a large amount of units. If you want examples of bad frametimes this is the game to get data from.

I use MSI afterburner to record FPS data from start of a Coop games and from end-game combat moments where large armies clash. The results do differ from game to game but it's clear that 5900X offers around 60-100% better performance:

1920X Stacraft 2 performance
5900X Stacraft 2 performance

1% low or 0.1% low FPS isn't the best picture here. If we look at frametimes (recorded with CapFrameX) you can see some spikes on start of a game but way more unstable frametime in the end-game. The more variance in frametime the less fluid the experience may be (especially with sudden big spikes):

Frametimes - start of a SC2 coop game
Frametimes - start of a SC2 coop game
Frametimes - endgame combat
Frametimes - endgame combat

The game was running on default high settings without any FPS limiter or VSync.

World of Warcraft

WoW is quite hard to benchmark as depending on scenario you are stressing a different part of the system. The most CPU bound element of the game is mass-actor combat, like raids, battlegrounds and sometimes multiple mobs casting simultaneously etc. I'm planning to do a separate WoW article for this system but here is a quick comparison of the two CPUs (mode 7, 3440x1440):

WoW combat benchmark
WoW combat benchmark

The performance is pretty much doubled in this edge-case scenario. To get consistent and comparable data for combat performance I use TBC Karazan large mob count pull that recreates worst case scenario of actual raids quite well - by the sheer quantity of mobs and world state complexity they create, it's not about assets - unless you are running on an iGPU.

WoW uses few cores to offload draw calls and alike but world state and other tasks are still handled on the main core so single core performance (and latency/memory timings) is extremely important for any large scale end game content. In a dungeon or even average medium-small raid group the performance will be better. In terms of core scaling you get nearly max performance on 4 cores. 2 cores is noticeably less while 6 cores sees barely any scaling - although that's just for the game running without any other app and addons. At 4C/4T i5-9400F configuration the all core load is already close to 100%. 6-8 core Zen 3 parts will be really good for this game, just like similar parts from Intel.

Temperature effect

Ryzen performance depends on the CPU temperature. If it has thermal headroom it will clock bit higher giving bit better results. GPU can also lower the clocks based on current temperature. As the system is in an open case there is no airflow restrictions but there is also no case airflow which may or may not be optimal to some components.

Using the winter air I tested the Ryzen system at home at room temperature and on my balcony with ambient close to 0C. Time Spy score increased by 0,6% (if it's not within the margin of error), Superposition 1080p medium perform better by 2,3% when in cold air. Cinebench did improve, up to 6% on R23 single core (from 1572 to 1667 with the CPU set to Auto OC in Ryzen Master).

The Unigine scores: hot and cold show that the Gainward RTX 3070 reached up to 2010MHz in cold air and 1965 MHz in warmer home air.

The CPU score differences are likely due to insufficient mounting pressure of the radiator as later on it turned out some mounting screws weren't really tight (and the radiator wasn't that much warmer when the CPU was under full load - or the cooler is just not as good even though Alpenfohn Brocken 3 is massive and with two fans.

RAM benchmarks

The Patriot Viper Steel with XMP at 4400MHz CL19 isn't optimal for Ryzen. This high bin of Samsung B-die is intended for RAM OC pretty much. Even though the 2x8 GB kit is on the QVL list for the motherboard it did not boot when attempted with that XMP profile. Next auto-timing were tested at various frequencies. Final settings were timings given by the Ryzen DRAM calculator.

RAM latency
Time Spy CPU score
Superposition score

The auto timings were really slow/loose (like 3600 CL26) even though Superposition score doesn't change as much as a more CPU direct Time Spy CPU score. 4 sticks of single rank RAM are better than two. I you use two dual rank DIMMs (like some 16GB or bigger) then two should be the optimal choice.

Aida64 RAM benchmarks 4x8 Patriot Viper Steel

Linux benchmarks

For Linux I used Phoronix benchmark suite. There was no problem running Linux on the Threadripper platform while Ryzen on Gigabyte B550 Aorus Master had some problems. First of all - 2.5Gbit Ethernet did not work on any tested Ubuntu version (20.04, 20.10 and daily build of 21.04). On LTS and 21.04 WiFi did work, while 20.10 and recent Manjaro Live USB did not fully boot stopping at semi-working terminal without desktop running. This isn't likely only related to RTX 3070 as the problem wasn't at the end of the boot process. Sadly couldn't investigate more at the time.

Benchmarks I've picked focused around latency and efficiency in specific workloads - to get a picture of how Zen 3 differs from Zen 2, Threadripper and Intel designs. You can check the raw results on openbenchmarking. Sadly I couldn't run NASA parallel benchmarks as they didn't compile on Ubuntu 21.04 due to to new compiler - and those benchmarks did gave big differences for Ryzen 3500X vs i5-9400F before (some of them rely on inter-core communication latency and so on).

So lets start with quick comparison, both systems running the same 3200 MHz CL15 RAM:

1920X vs 5900X Linux benchmarks
1920X vs 5900X Linux benchmarks

And with 5900X running 3800 CL16 RAM:

1920X vs 5900X Linux benchmarks
1920X vs 5900X Linux benchmarks

Threadripper wins only in Stream memory bandwidth benchmarks due to quad channel memory, but 3800MHz RAM does cut that loss by a lot. Now lets look at some results in detail.

IPC_benchmark is a Linux inter-process communication benchmark. Here we see Intel noticeably behind it counterpart - 3500X. 5900X is the clear winner while Threadripper is the worst among AMD CPUs.

IPC_benchmark

Core-Latency is a test of core-latency, which measures the latency between all core combinations on the system processor. Here Intel also clearly wins with Zen 3 showing some improvement over Zen 2 (and TR4 far behind)

Core-Latency

Pmbench is a benchmark aimed to profile system paging and memory subsystem performance. This test profile will report the average page latency of the system. We can see monolithic Intel design winning while Zen 3 very close by. Older Zen 2 and TR4 CPUs with higher latency get much worse scores:

Pmbench

In Botan cryptography benchmarks 5900X wins in both AES-256 and Blowfish cyphers. It's kind of odd that 12-core 1920X achieves nearly same score as 6-core CPUs. Similarly in the AOMedia AV1 encoder as well as VP9/WebM conversion benchmark the 1920X looses versus all CPUs while 5900X wins by some margin depending on settings used:

AOMedia AV1 encoder benchmark

In Kvazaar, an CPU-based H.265 video encoder, Threadripper regains some performance while Intel struggles here. Maybe this benchmarks likes larger amounts of cache on the CPU?:

Kvazaar

7-Zip compression benchmark looks more like something we would expect. It scales with amount of cores so 12-core win with 6-core ones. 5900X does improve by a lot over the older Threadripper:

7-Zip

Compilation benchmarks are tricky as some projects like Linux Kernel scale really well with core count and cache size ending up being a cache benchmark and not so much a CPU performance metric. Here we have two examples:

Kernel compilation
Apache compilation

To end the Linux benchmarks overview let's look at Blender rendering, which is an all-core load. 12-core wins with 6-core while Zen 3 is the clear winner:

Blender benchmark

Aside of some rough edges with the newer B550 motherboard the Zen 3 Linux performance is really good. We can see noticeable improvements over previous generations. Those that do need Threadripper features will have to wait for the Zen 3 based versions, where as even 5900X can handle various compute and development workloads easily.

The new CCD/CCX design does improve latency although in extreme tests the monolithic Intel design still wins. This is less likely to translate into performance but for complex compute workloads in some edge case scenarios it may matter.

Conclusions

With Zen 2 going up to 16-cores it was obvious that some HEDT/workstation tasks reserved for more expensive higher end parts will go into the consumer row. For pure gaming 3900X and 3950X weren't the most optimal choice when considering the price but if you streamed/made some videos or used the PC also for some heavy duty work then such CPU was a cheaper-Threadripper option.

After 3 years of development that passed between 1920X and 5900X launches the performance improvement is quite sizable. We get around 60% more performance and new features like PCIe 4.0, newer USB standards and even Thunderbolt 3 on some of the AM4 motherboards.

Also gaming performance got a nice performance boost. The design changes that decreased the latency as well as IPC and clocks increases all benefited this performance metric. There are still cases where Intel will be better but there should be no outliers where you loose noticeable amount on any of the platforms.

If you have tasks that can use 12 or 16 cores then go for the high-end 5900X and 5950X chips, if not then just save some money and go with 5600X/5800X... but that all depends on what's in stock.

Comment article