We’ve partnered with @AMD, @Broadcom, @Intel, @Microsoft, and @NVIDIA, to release Multipath Reliable Connection (MRC), a new open networking protocol that helps large AI training clusters run faster and more reliably, with less wasted GPU time.
https://openai.com/index/mrc-supercomputer-networking/
中文: 我们已与 @AMD、@Broadcom、@Intel、@Microsoft 和 @NVIDIA 合作,推出了 Multipath 可靠连接(MRC),这是一种新型开放网络协议,可帮助大型人工智能训练集群更快、更可靠地运行,同时减少 GPU 的浪费。