Would the out-of-order memory access capability that AMD added to RDNA4 have any applications in a CDNA4-based design? I know UDNA is rumored to represent a fusion of CDNA and RDNA, implying that some design ideas will be kept from each. I know RDNA4's OOO abilities are unique, but I don't know if there would even be a theoretical benefit to adopting a similar approach for AI or HPC workloads.
Would the out-of-order memory access capability that AMD added to RDNA4 have any applications in a CDNA4-based design? I know UDNA is rumored to represent a fusion of CDNA and RDNA, implying that some design ideas will be kept from each. I know RDNA4's OOO abilities are unique, but I don't know if there would even be a theoretical benefit to adopting a similar approach for AI or HPC workloads.
The core (CU) count for MI300X is incorrect, it should be 304 instead of 288 in the first table
Yep, my mistake when moving the article from the Google Docs to the Substack and Wordpress! Has been fixed!