1 Comment
⭠ Return to thread

My interpretation is the sequence with AVX-512 instructions gets throttled. If you had an AVX-512 sequence that lasts for a few microseconds surrounded by lighter code that runs for orders of magnitude longer, that AVX-512 sequence would get get throttled but everything else runs at full speed.

Because the AVX-512 sequence is so short, dropping a bit of performance there is preferable to running everything at a lower clock

Expand full comment