AMD Instinct MI200: Chiplet a doppia GPU; Architettura CDNA2; 128 GB HBM2E -

AMD today announced the debut of its 6 nm CDNA2 (Compute-DNA) architecture in the form of the MI200 family. The new, dual-GPU chiplet accelerator aims to lead AMD into a new era of High Performance Computing (HPC) applicazioni, the high margin territory it needs to compete in for continued, sustainable growth. A tal fine, AMD has further improved on a matured, compute-oriented architecture born with Graphics Core Next (GCN) – and managed to improve performance while reducing total die size compared to its MI100 family.AMD’s MI250X accelerator features two compute dies with 58 Billion transistors built out of TSMC’s 6 processo nm. Each of these chips features a total of 110 Unità di calcolo (CU) per un totale di 220 CUs on a single accelerator. The new CDNA2 architecture also incorporates new, improved Matrix Cores to the tune of 880 Unità (440 per chip). And as the MI250X is configured, that incredible amount of GPU power is paired with 128 GB di memoria HBM2E, in esecuzione a 3.2 GB/s. AMD’s performance estimates against NVIDIA’s current-gen A100 are blowouts. Compared to the A100, the MI250X is quoted as being: 4.9 times faster at FP64 vector compute; 2.5 times as fast in FP 32 Vettore; 4.9 times faster in FP64 Matrix; a more meager 1.2 times faster performance on FP16 and BF16 Matrix operations; 1.6 times bigger memory capacity (128 GB on the MI 250X compared to the A100’s 80 GB); and a 1.6 times faster memory bandwidth (though that’s derived from the faster HBM2E memory).The two CDNA2 dies are linked together AMD’s Infinity Fabric, which makes its debut on a graphics architecture. This link provides a series of 25 Gbps links offering up to 100 GB/s of bi-directional bandwidth between both GPUs. There are eight available links in the MI200’s distribution module – built to the specifications of an OAM (OCP Accelerator Module, where OCP stands for “Open Compute Platform”) configurazione. In totale 800 GB/s of bandwidth are available for on-the fly communication between the two chiplets. AMD already announced that a PCIe version of the MI200 is launching in the future, catering to those who just want drop-in replacements or upgrades.

AMD’s usage of TSMC’s N6 fabrication technology can certainly account for part of the performance and die size improvement. As any manufacturer will do, AMD are employing yield optimizing efforts. This becomes clear when we look at the other product AMD is introducing alongside the MI250X: the MI250 accelerator. The MI250 takes a hit on computational resources by dropping from the MI250X’s 110 CUs to 104 CUs per chiplet. That’s actually the only change; it should be around 5% slower than the fully-enabled MI250X.

All in all, the MI200 series is a marked improvement for AMD performance-wise. Eppure, NVIDIA is sure to announce their own next-generation compute solution soon. How will the updated, CDNA2-powered MI200 series stand?

Notizie

Baldur’s Gate 3 dev continues to champion indie games, begging Steam to let him overpay the creators of his favorite games

I migliori download di PlayStation Store di 2024 – PlayStation.Blog

15 Best Sidequests In Switch Games

Assassin’s Creed Shadows had to have 2 different characters because players want both stealth and combat, and you couldn’t have both with just one

Prepare for Monster Hunter Wilds with a look back at the series’ evolution on PlayStation – PlayStation.Blog

How to get Diablo 4 Coven’s Favor fast

The Dark Ages launches on PS5 May 15 – id Software reveals new gameplay details – PlayStation.Blog

Azione della settimana: Wuthering Waves – PlayStation.Blog

Sonic X Shadow Generations Speeds Past Two Million Sales Worldwide

AMD Instinct MI200: Chiplet a doppia GPU; Architettura CDNA2; 128 GB HBM2E