Discussion about this post

User's avatar
Fredrik Tolf's avatar

I'm interested in how much a fairly light AVX-512 workload (say, a small but highly optimized loop that just runs for a couple of microseconds) affects core behavior. If I recall correctly, a common criticism with Skylake-X was that the core dropped completely as soon as *any* AVX-512 instructions were executed, causing many developers to just avoid AVX-512 completely since it left their code running actively worse than AVX2 code that should nominally be slower.

The "rapid switching" graph seems to indicate that it shouldn't be nearly as big of a deal on Zen 5, since the core at least seems to recover immediately when small-ish AVX-512 sequences end, but it does also clearly shows IPC throttling immediately , but this could of course just be due to measurement granularity. Do you think there might be a "maximum size" of AVX-512 workload that would cause the core not to throttle at all?

Expand full comment
StoykovK's avatar

Very interesting!

I was wondering whether the frequency behavior could actually be impacted by max possible current that core can handle as mentioned: https://numberworld.org/blogs/2024_8_7_zen5_avx512_teardown/

I've noticed that running heavy AVX512 code on 9955hx3d is limitted by 125A current rather than thermal/voltage or wattage.

Expand full comment
15 more comments...

No posts

Ready for more?