Meta Shows Open-Architecture NVIDIA "Blackwell" GB200-System für Rechenzentren -

During the Open Compute Project (OKP) Summit 2024, Meta, one of the prime members of the OCP project, showed its NVIDIA “Die Mutter von allem” GB200 systems for its massive data centers. We previously covered Microsoft’s Azure server rack with GB200 GPUs featuring one-third of the rack space for computing and two-thirds for cooling. A few days later, Google showed off its smaller GB200 system, und heute, Meta is showing off its GB200 system—the smallest of the bunch. To train a dense transformer large language model with 405B parameters and a context window of up to 128k tokens, like the Llama 3.1 405B, Meta must redesign its data center infrastructure to run a distributed training job on two 24,000 GPU clusters. That is 48,000 GPUs used for training a single AI model.

Called “Catalina,” it is built on the NVIDIA Blackwell platform, emphasizing modularity and adaptability while incorporating the latest NVIDIA GB200 Grace Blackwell Superchip. To address the escalating power requirements of GPUs, Catalina introduces the Orv3, a high-power rack capable of delivering up to 140kW. The comprehensive liquid-cooled setup encompasses a power shelf supporting various components, including a compute tray, switch tray, the Orv3 HPR, Keil 400 fabric switch with 12.8 Tbps switching capacity, management switch, battery backup, and a rack management controller. Interessant, Meta also upgraded its “Grand Teton” system for internal usage, such as deep learning recommendation models (DLRMs) and content understanding with AMD Instinct MI300X. Those are used to inference internal models, and MI300X appears to provide the best performance per Dollar for inference. According to Meta, the computational demand stemming from AI will continue to increase exponentially, so more NVIDIA and AMD GPUs is needed, and we can’t wait to see what the company builds.

Nachrichten

„Nairi: Steigende Flut’ Entwickler darüber, wie er sechs Jahre später aus der gemütlichen Masse von Switch hervorsticht

Wo kann man Diablo farmen? 4 Runen zum Basteln und Runenwörter

Life is Strange: Doppelgefährdung – welche Zugänglichkeitsoptionen es im Spiel gibt?

Hoch oben auf dem Silent Hill reiten 2 Remake, Bloober Team enthüllt eine originelle Survival-Horror-IP, die wie ein lustiger Science-Fiction-Albtraum aussieht

„RPG Maker mit’ Denn Switch bekommt PlayStation 5 Plattformübergreifende Unterstützung

Nachdem ich Black Myth geliebt habe: Wukong, Ich setze dieses neue chinesische Soulslike-Action-Rollenspiel nach einem blutigen Trailer, der die Sekiro-Vibes verdoppelt, ganz nach oben auf meine Wunschliste

PowerWash-Simulator: Werfen Sie einen genaueren Blick auf das Shrek-Spezialpaket

Ich bin es leid, auf Silksong zu warten? Dieses kommende Metroidvania könnte die Lücke füllen

Nach Gegenreaktion und einer Steam-Rezensionsbombe, Tekken 8 dev gibt aus $5 um über umstrittene DLCs zu berichten und sagt, dass es beim nächsten Mal anders laufen wird

Meta zeigt Open-Architecture NVIDIA “Die Mutter von allem” GB200-System für Rechenzentren