And it’s so painful for me since my workload is completely memory bandwidth bound but does not scale across multiple sockets, much less multiple nodes.
A specialized computational fluid dynamics program. And nominally CFD is great to scale across many sockets/systems but it seems this particular program wasn’t written to be able to do that.
And it’s so painful for me since my workload is completely memory bandwidth bound but does not scale across multiple sockets, much less multiple nodes.
96 cores per socket? 😎
12 channels of ram per socket? 🤩
DDR5-4800? 🫠
Just curious, but what workload doesn’t scale across multiple sockets? Gaming?
A specialized computational fluid dynamics program. And nominally CFD is great to scale across many sockets/systems but it seems this particular program wasn’t written to be able to do that.