Tuesday, July 23, 2024
HomeFeatureStudying the hype behind Radeon RX 7900 series: Can it really deliver...

Studying the hype behind Radeon RX 7900 series: Can it really deliver the promise?

Studying the hype behind Radeon RX 7900 series

Note: This feature was first published on 15 November 2022.

The Radeon RX 7900 XTX is the crown of AMD's RDNA 3 GPU-based offerings at this juncture. (Image source: AMD)

Note: This is atwo-part story that continues further in a dedicated RDNA 3 GPU engineering article.

Picking the right battle

Let's get one thing straight. AMD's latest RDNA 3-powered graphics cards, the RadeonRX 7900 series is clearly not targeted to duel against NVIDIA's flagship, the GeForce RTX 4090. And rightly so, as the green camp's top enthusiast graphics card is priced north of US$1,599, while AMD has decided to battle where serious gamers are really going to spend their money – top-tier options just shy of US$1,000.

This is why the RadeonRX 7900 XTX is priced at US$999 and comes with an equally large 24GB frame buffer. Sure, it's not using higher-speed GDDR6X memory, but AMD has chosen to go with existing GDDR6 memory to conserve power utilised, improve the product's cost-effectivenessand still get the most out of it from a throughput perspective. That would mean gamers get to toggle higher resolutions if the rest of their rig is ready for it, and a boon for creators with access to that much frame buffer on a product that costs 37% less.

That's not all, the US$899 Radeon RX 7900 XT is also just as much enticing with more firepower (RDNA 3 vs. RDNA 2), a larger frame buffer (20GB vs. 16GB) and higher memory throughputthan its predecessor (higher clocks and wider memory bus).

Radeon RX 7900 vs. RX 6900 series
RX 7900 XTX RX 7900 XT RX 6950 XT RX 6900 XT RX 6800 XT
GPU Navi 31(RDNA 3) Navi 31(RDNA 3) Navi 21 XT(RDNA 2) Navi 21 XT(RDNA 2) Navi 21 XT(RDNA 2)
Compute Units 96 84 80 80 72
Game Clock 2,300MHz 2,000MHz 2,100MHz 2,015MHz 2,015MHz
Boost Clock 2,500MHz 2,400MHz 2,310MHz 2,250MHz 2,250MHz
VRAM 24GB GDDR6 20GB GDDR6 16GB GDDR6 16GB GDDR6 16GB GDDR6
Memory Bus 384-bit 320-bit 256-bit 256-bit 256-bit
Memory Speed 20Gbps 20Gbps 18Gbps 16Gbps 16Gbps
Memory Bandwidth 960GB/s 800GB/s 576GB/s 512GB/s 512GB/s
Typical Board Power (TBP) 355W 300W 335W 300W 300W
Price US$999 US$899 US$1,099 US$999 US$649

Did we mention that it's not going to sip any more board power than its predecessor? Well perhaps the RX 7900 XTX, but it's only by a small degree. Plus, the Radeon RX 7900 series is a more forward-looking card designed with the latest DisplayPort standard in mind to support really high-res and high-refresh output. Here's how the red camp thinks it's positioning against NVIDIA, but nothing is fixed until the green camp announcesthe RTX 4080 tonight:-

Is AMD going to cramp NVIDIA's style? The completion has never been this heated in a long while. In a month's time, all will be clear. (Image source: AMD)

There's a lot else in store, but these high-level aspects should give you a better understanding of why the Radeon RX 7900 series seem so promising. The Radeon RX 7900 series will hit retail on13th December 2022, and while there's still time for the competition to cough up options, let's focus on what AMD's new graphics powerhouse is capable of in this feature.

Performance Expectations

Since NVIDIA's suitably priced RTX 40 series competitor has yet to launch at the time of writing (the GeForce RTX 4080 16GB), AMD could only officially share performance figures across their own offerings. As can be expected, the Radeon RX 7900 series were tested on AMD's very own top platform offering with aRyzen9 7900X CPU AM5 processor and 32GB of RAM. Having both the CPU and GPU on AMD's ecosystem further amplifies the performance benefits since it can unleash AMD SmartShift Max for power shift optimization uplift between the CPU and GPUwhere the power and temperature envelope has room to spare and AMD Smart Access Memory (SAM) technologythat allows the CPU to directly access the graphics card's full memory space to reduce latency and buffering.

(Image source: AMD)

Now let's dissect what AMD has shared thatwe can analyse before receiving the review unit. Looking at the above performance figures at max settings in 4K resolution this is how we can break it down when comparing against the Radeon RX 6000 series flagship:-

Radeon RX 7900 vs. RX 6900 series in 4K gaming
Games Resident Evil Village COD: MW2 Cyberpunk 2077 Watch Dogs: Legion
GPUs 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX
FPS 124 157 190 92 117 139 43 60 72 68 85 100
% Increase 26.6% 53.2% 27.2% 51.1% 39.5% 71.4% 25% 47%
GPU cost (USD) $1,099 $899 $999 $1,099 $899 $999 $1,099 $899 $999 $1,099 $899 $999
Cost per FPS (USD) $8.86 $5.72 $5.25 $11.94 $7.68 $7.18 $25.55 $14.98 $13.88 $16.16 $10.58 $9.99

AMDclaims up to 67% performance uplift on titles using pure rasterization. While it could be true based on a wider set of game titles tested, it doesn't seem that way when we focus on these four leading game title varieties that AMD chose to showcase. Based on these titles, the Radeon RX 7900 XTX manages over 50% performance uplift over a Radeon RX 6950 XT and is more cost-effective than the Radeon RX 7900 XT when comparing the cost to churn per frame in the respective games.

Meanwhile, the Radeon RX 7900 XT seemed to fare between 25 to 30% better than its flagship predecessor but cost far less than it.

Let's study what happens when your turn on ray tracing and enable Fidelity FX Super Resolution (FSR), the NVIDIA DLSS equivalent for AMD:-

(Image source: AMD)

At 4K resolution, it's tough to get high performance with ray tracing and at maximum graphics settings. As such, we'll have to rely on AMD FSRto get some decent boost figures which help propel AMD's proposition that the Radeon RX 7900 series isnow a great optionfor those who wish to game at 4K resolution. Meanwhile, let's dissect the FSR-enabled figures as shared by AMD:-

Radeon RX 7900 vs. RX 6900 series in 4K HDR gaming (with FSR)
Games Resident Evil Village Dying Light 2 Cyberpunk 2077 Hitman 3
GPUs 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX 6950 XT 7900 XT 7900 XTX
FPS 146 171 201 39 64 72 42 57 62 57 81 89
% Increase 17.1% 37.7% 64.1% 84.6% 35.7% 47.6% 42% 56.1%
GPU cost (USD) $1,099 $899 $999 $1,099 $899 $999 $1,099 $899 $999 $1,099 $899 $999
Cost per FPS (USD) $7.53 $5.25 $4.97 $28.18 $14.04 $13.88 $26.16 $15.77 $13.88 $19.28 $11.10 $11.22

In this comparison, AMD claims the new Radeon RX 7900 XTX can perform up to 82% better at ray-traced games at 4K resolution with AMD FSR enabled and it does seem to be the case with Dying Light 2 with High RT settings. Taking an average across the four featured games, you get a little over 50% better ray-traced performance with the Radeon RX 7900 XTX and nearly 40% better with the XT variant when compared with the Radeon RX 6950 XT predecessor.

To be honest,AMD's launch event for Radeon RX 7900 series is the first time we're experiencing AMD FSR in action (since games didn't yet support it during the launch ofthe RX 6900 series), and it's looking great.Now, if only more games adopt FSR 2.0, which offers some terrific imaging quality, but this is definitely a work in progress.

Can you tell which screen has AMD FSR enabled? Neither could we and we had to move the viewpoint of the screen quite vehemently to see which was a natively 4K rendered ray-traced image (which would result in a little more jerking due to the load) versus one that was upscaling using AMD FSR tech at 1440p to generate 4K ray-traced visuals. (Click to view a larger image.)

What makes the Radeon RX 7900 series so much faster than its predecessor?

AMD's new RDNA 3 GPU architecture is crucial to the success of the Radeon RX 7000 series, and it touched on these four key areas of advancement. (Image source: AMD)

The Radeon RX 7900 series are the first graphics cards to feature AMD's RDNA 3GPUarchitecture that has been engineered for higher frequencies (designed to achieve3GHz) while consuming less power than RDNA 2. According to AMD's estimate, it has improved frequencies by 15%, but performance per watt has gone up by a staggering 54%.

That said, while the clocks for the Radeon RX 7900 series arehigher than the RX 6900 series, the highest boost clock is still 2.5GHz on the RX 7900 XTX. So why does AMD say it's designed to achieve 3GHz? When we asked AMD engineers about this mismatch, they shared that the rest of the headroom is provided for overclocking and for extreme tinkerers to take the GPU to new performance heights.

While RDNA 2 already made over 50% gain in performance per watt standings, RNDA 3 achieved that same degree once more. (Image source: AMD)

To go about achieving this, RDNA 3 enhanced the computation capabilities of the graphics core to double itspeak operations output per cycle thanks to anew RDNA3 unified Compute Unit (CU). To recap, the Radeon RX 7900 XTX and RX 7900 XT have 96 and 84 of these new CUs, respectively. It might not sound like it's a lot more than the Radeon RX 6900 series, but as you'll see below, the updated CUs pack a lot of changes.

A game-optimized unified compute unit (CU)

Note that this is just a snapshot of one of the new unified CU units - the Radeon RX 7900 XTX has 96 of them, 20% more than than its predecessor's old CU units. (Image source: AMD)

Each CU possesses64 dual-issue, multi-purpose arithmetic logic units (ALU)SIMDstreaming processorswithin that support not only dual-issue Wave 32 instructions but also a single-clock Wave 64 FMA(fused multiple accumulation) instruction for increased utilisation. The flexible utilization of these streaming processors for FP, integer and AI operations is what makes it a unified CU.

Coupled withimproved pipelining throughout, each of these unified CUs now has two dedicated AI acceleratorsthat are focused on enhancingmatrix operationsto boost machine learning, HPC and AI tasks more effectively- up to 2.7x more to be exact.

(Image source: AMD)

Efficiency of the entire graphics core has also been bolstered through upsizing of various caches (3MB L0 vector caches – 240% larger, 3MB L1 – 300% more, 6MB of L2 – 50% increase)and an updated AMD Infinity Cache to improve its bandwidth though the capacity has gone down from 128MB on RDNA 2 to 96MB on RX 7900 XTX and just 80MB on RX 7900 XT. AMD attributes this to a more balanced design that's suitable to the new RDNA 3 architecture.

(Image source: AMD)

Improving ray tracing performance was an important agenda for RDNA 3 (which would be AMD'ssecond generation of ray tracing as RDNA 2 was first to introduce RT) and AMD has stuck to using accelerators embedded in each of its CUs just like in RDNA 2.

If you notice the specs table earlier, Radeon RX 7900 series easily boasts many more CU units than its predecessors, and this directly allows the new GPUsto harbour20% more ray tracing units. Secondly, the AMD engineers have been very judiciouson how to use the raysbyexecutingthemefficiently and studying the process and outcome to dump unnecessary instruction to improve ray tracing logic, incorporate new box testing rules to exact more efficiency from each ray (there's no upside to wasting precious cycles through inefficient rendering) and added hardware managed DirectX ray-tracing (DXR) flags. The result of all these improvements is that AMD says it has achieved up to 1.8x performance uplift on heavy ray-traced workloads.

So all this sounds promising and RDNA 3 could be the real deal we've been expecting from AMD. When can we actually see any real-world performance from actual in-house testing? Stay tuned as that's likely going to be closer to the actual availability of the Radeon RX 7900 series that's slated for availability on13th Decemeber 2022.

Meanwhile, this story continues further in a dedicated RDNA 3 GPU engineering article.

< PrevPage 1 of 1 – Studying the hype behind Radeon RX 7900 seriesPage 1 of 1 – Studying the hype behind Radeon RX 7900 seriesPage 1 of 1 Page 1 of 1 – Studying the hype behind Radeon RX 7900 seriesNext >

RELATED ARTICLES
- Advertisment -

Most Popular