RTX 4090 could be first GPU to reach 100+tflops

DriedMangoes

Hunting down all the witches
Sep 12, 2013
13,834
4,378
3,930

Recent rumors regarding the next-generation NVIDIA GeForce RTX 4090 series suggest that the AD102-powered graphics card might be the first gaming product to break past the 100 TFLOPs barrier.

NVIDIA GeForce RTX 4090 Class Graphics Cards Might Become The First Gaming 'AD102' GPU To Break Past the 100 TFLOPs Barrier​

Currently, the NVIDIA GeForce RTX 3090 Ti offers the highest compute performance amongst all gaming graphics cards, hitting anywhere between 40 to 45 TFLOPs of FP32 (Single-Precision) GPU compute. But with the next-generation GPUs arriving later this year, things are going to take a big boost.



As per rumors from Kopite7kimi and Greymon55, the next-generation graphics cards, not only from NVIDIA but AMD too, are expected to reach the 100 TFLOPs mark. This would mark a huge milestone in the consumer graphics market which has definitely seen a major performance and also a power jump with the current generation of cards. We went straight from 275W being the limit to 350-400W becoming the norm and the likes of the RTX 3090 Ti are already sipping in over 500W of power. The next generation is going to be even more power-hungry but if the compute numbers are anything to go by, then we already know one reason why they are going to sip that much power.


As per the report, NVIDIA's Ada Lovelace GPUs, especially the AD102 chip, has seen some major breakthrough on TSMC's 4N process node. Compared to the previous 2.2-2.4 GHz clock speed rumors, the current estimates are that AMD and NVIDIA will have boost speeds similar to each other and that's around 2.8-3.0 GHz. For NVIDIA specifically, the company is going to fuse a total of 18,432 cores coupled with 96 MB of L2 cache and a 384-bit bus interface. These will be stacked in a 12 GPC die layout with 6 TPCs and 2 SMs per TPC for a total of 144 SMs.


Based on a theoretical clock speed of 2.8 GHz, you get up to 103 TFLOPs of compute performance and the rumors are suggesting even higher boost clocks. Now, these are definitely sounding like peak clocks, similar to AMD's peak frequencies which are higher than the average 'Game' clock. A 100+ TFLOPs compute performance means more than double the horsepower versus the 3090 Ti flagship. But one should keep in mind that compute performance doesn't necessarily indicate the overall gaming performance but despite that, it will be a huge upgrade for gaming PCs and an 8.5x increase over the current fastest console, the Xbox Series X.

zYeEJNC.png