It’s not a single die and it’s extremely simple compared to the packaging tech on AMD’s massive CDNA setups aimed at the FP64 monster government research class supercomputers.
Thanks for the impressive testing, as always. I have an article idea/request that seems like it would be well within your expertise - looking at the new CUDA tile programming model and how it may or may not bring other API’s closer to native CUDA performance on NVIDIA’s GPUs.
"Nvidia’s conservative hardware" This is a f***ing 1600mm² die, it's not conservative. lol
I meant it's a conservative multi-die setup compared to using two or four base dies and stacking eight compute dies on top
It’s not a single die and it’s extremely simple compared to the packaging tech on AMD’s massive CDNA setups aimed at the FP64 monster government research class supercomputers.
Thanks for the impressive testing, as always. I have an article idea/request that seems like it would be well within your expertise - looking at the new CUDA tile programming model and how it may or may not bring other API’s closer to native CUDA performance on NVIDIA’s GPUs.