The company has previously said that this generation of NVLink enables bidirectional throughput per GPU to reach 1.8 TB/s. Nvidia said the GB200 NVL4 Superchip features 1.3 TB of coherent memory ...
Iniysa on X reports that Apple's M4 Max accomplished an audio transcode with Whisper V3 Turbo in half the time of Nvidia's Ampere-based RTX A5000 GPU while using nearly eight times less power.