Discussion about this post

User's avatar
nicball's avatar

Great article! Although I believe ice lake has up to 40 cores, not 28 :P

Expand full comment
Evan R.'s avatar

In the 4th graph titled "Single Core Bandwidth", does anyone here have a guess about why the part of the curve for the L2 bandwidth starts to increase after a test size of 256 KBytes and reaches a local maximum at a test size of about 600 to 800 KBytes for the EPYC 9355P (1 MB L2 per core)? In other words, for the EPYC 9355P, why does using more of the L2 capacity cause the L2 bandwidth to increase, from 420 to 550 GB/sec for "EPYC 9355P, Add" and from 190 to 280 GB/sec for "EPYC 9355P, Read"? This is about a 30% to 50% increase in L2 bandwidth just from using more of the L2 capacity.

The section titled "Single Core Performance: SPEC CPU2017" says "I’ve been running the rate suite with a single copy to summarize single-thread performance." Wouldn't it be better to use the SPECspeed suite for single-thread performance? My understanding is that SPECrate is designed for measuring the throughput of multiple cores while SPECspeed is designed for measuring latency.

The "System Overview" section says the Amazon r8i instance uses a 6985P-C processor but it's actually a 6975P-C processor. The graphs in the article are labeled correctly.

Expand full comment
5 more comments...

No posts

Ready for more?