Nvidia, Intel Show First Tests to Train GPT-3
| Date | 28th, Jun 2023 |
|---|---|
| Source | IEEE Spectrum - Scientific and Educational Websites |
DESCRIPTION
For the first time, a large language model—a key driver of recent AI hype and hope—has been added to MLPerf, a set of neural-network training benchmarks that have previously been called the Olympics of machine learning. Computers built around Nvidia’s H100 GPU and Intel’s Habana Gaudi2 chips were the first to be tested on how quickly they could perform a modified train of GPT-3, the large language model behind ChatGPT.