AMD Zen 5 Motor de ejecución filtrado, Cuenta con FPU real de 512 bits


AMD “Zen5” CPU microarchitecture will introduce a significant performance increase for AVX-512 workloads, with some sources reported as high as 40% performance increases over “Zen 4” in benchmarks that use AVX-512. A Moore’s Law is Dead report detailing the execution engine of “Zen5” holds the answer to how the company managed this—using a true 512-bit FPU. Actualmente, AMD uses a dual-pumped 256-bit FPU to execute AVX-512 workloads on “Zen 4.” The updated FPU should significantly improve the core’s performance in workloads that take advantage of 512-bit AVX or VNNI instructions, such as AI.

Giving “Zen5” a 512-bit FPU meant that AMD also had to scale up the ancillaries—all the components that keep the FPU fed with data and instructions. The company therefore increased the capacity of the L1 DTLB. The load-store queues have been widened to meet the needs of the new FPU. The L1 Data cache has been doubled in bandwidth, and increased in size by 50%. The L1D is now 48 KB in size, desde 32 KB in “Zen 4.” FPU MADD latency has been reduced by 1 cycle. Besides the FPU, AMD also increased the number of Integer execution pipes to 10, desde 8 en “Zen 4.” The exclusive L2 cache per core remains 1 MB in size.

Actualizar 07:02 UTC: Moore’s Law is Dead reached out to us and said that the slide previously posted by them, which we had used in an earlier version of this article, is fake, but said that the information contained in that slide is correct, and that they stand by the information.