AMD's RDNA4 Architecture (Video)

George Cozma

Mar 5, 2025

Hello you fine Internet folks,

Read →

7 Comments

David. Hellyx

Mar 8, 2025

I'd love to see a detailed analysis of architectures between RDNA3 vs RDNA4 vs ADA vs Blackwell

Reply (1)

KozakMaks

Apr 2, 2025

Amd ai accelerators vs tensor cores

jozsef

Mar 5, 2025

Could you analyze this out of order memory more?

If i understand correctly, this is not a traditional cpu like out of order resource, because it doesnt exploit instruction level parallelism, only inter warps memory parallelism.

Do I see it right?

Jul 1, 2025

Hopefully RDNA4 still can implement SER, OMM, and Cooperative Vector support with Firmware and Driver updates.

Although recent work on Mesa/Linux seems to indicate Nvidia's proposed extensions stop the ways that would allow AMD hardware to be on par :/

jozsef

Mar 9, 2025

I'd like to see more rdna 4 architecture analysis, because IMHO rdna 4 is the most interesting gpu architecture since gcn. I am curiously interested in rdna 4 dynamic register allocation and out of order capapilities. Especially dynamic register allocation from software perspective, thinking about deadlocks, what are mentioned in rdna 4 instruction set architecture pdf.

Thanks in advance!

valentin

Mar 7, 2025

could you put charts up that compare it directly to the following dies N33,N32,N31 and N21 ?

N33 - the only monolithic big RDNA3 GPU (Except Viola but that is on N4P and on a Platform (PS5 Pro) where you cant do micro benchmarking and analysis)

N32 - the revised version of N31 (smaller caches per WGP and Array than N31), and overall the roughly the same size as N48

N31 - specifically the 7900GRE as it has almost the same number of transistors.

N21 - 2x the L3 cahce, monolithic and just for an overall overview on how RDNA developed over the course of the last 3 gens.

most interestingly would be N33 or N32 vs N48 when it comes to the caches

jozsef

Mar 8, 2025

So, do i see it wrong, or this out of order memory does not exploit instruction level parallelism within a thread? Anyone an answer???

Chips and Cheese

AMD's RDNA4 Architecture (Video)