Apple Trained its Apple Intelligence Models on Google TPUs, Not NVIDIA GPUs -

Apple has disclosed that its newly announced Apple Intelligence features were developed using Google’s Tensor Processing Units (TPUs) rather than NVIDIA’s widely adopted hardware accelerators like H100. This unexpected choice was detailed in an official Apple research paper, shedding light on the company’s approach to AI development. The paper outlines how systems equipped with Google’s TPUv4 and TPUv5 chips played a crucial role in creating Apple Foundation Models (AFMs). These models, including AFM-server and AFM-on-device, are designed to power both online and offline Apple Intelligence features introduced at WWDC 2024. For the training of the 6.4 billion parameter AFM-server, Apple’s largest language model, the company utilized an impressive array of 8,192 TPUv4 chips, provisioned as 8×1024 chip slices. The training process involved a three-stage approach, processing a total of 7.4 trillion tokens. Meanwhile, the more compact 3 billion parameter AFM-on-device model, optimized for on-device processing, was trained using 2,048 TPUv5p chips.

Apple’s training data came from various sources, including the Applebot web crawler and licensed high-quality datasets. The company also incorporated carefully selected code, math, and public datasets to enhance the models’ capabilities. Benchmark results shared in the paper suggest that both AFM-server and AFM-on-device excel in areas such as Instruction Following, Tool Use, and Writing, positioning Apple as a strong contender in the AI race despite its relatively late entry. However, Apple’s penetration tactic into the AI market is much more complex than any other AI competitor. Given Apple’s massive user base and millions of devices compatible with Apple Intelligence, the AFM has the potential to change user interaction with devices for good, especially for everyday tasks. Hence, refining AI models for these tasks is critical before massive deployment. Another unexpected feature is transparency from Apple, a company typically known for its secrecy. The AI boom is changing some of Apple’s ways, and revealing these inner workings is always interesting.

News

Payday 3, High on Life, Pac-Man World Re-Pac – PlayStation.Blog

Random: Local Supermarket Wins Trademark Battle Against Nintendo

Gorgeous JRPG homage Clair Obscur sells out its collector’s edition months before launch, dev says it didn’t think “the demand for our physical editions would be so high”

Introducing MLB The Show x Homage apparel partnership – PlayStation.Blog

What Do We Actually Want From ‘Mario Kart 9’? – Talking Point

As No Man’s Sky adds “billions of new solar systems and trillions of new planets,” Hello Games says it’s also “extremely busy” with its open-world survival RPG Light No Fire

Marvel’s Spider-Man 2 PC features and ray-tracing options detailed, out tomorrow – PlayStation.Blog

GTA 6 will run at 30fps on console, former GTA 5 and Red Dead Redemption 2 animator expects, and your best shot at 60fps may be hoping for PC

No Man’s Sky’s latest update introduces billions of new stars, planets, and more today – PlayStation.Blog

Apple Trained its Apple Intelligence Models on Google TPUs, Not NVIDIA GPUs