AMD EPYC™ Processors and New AMD Instinct™ MI100 Accelerator

During this year’s SC20 virtual tradeshow, AMD (NASDAQ: AMD) is showcasing its leadership in the high performance computing (HPC) industry. It launched the new AMD Instinct™ MI100 accelerator with ROCm™ 4.0 open ecosystem support and showcased a growing list of AMD EPYC™ CPU and AMD Instinct accelerator based deployments, and highlighted its collaboration with Microsoft Azure for HPC in the cloud. AMD also remains on track to begin volume shipments of the 3^rd Gen EPYC processors with “Zen 3” core to select HPC and cloud customers this quarter in advance of the expected public launch in Q1 2021, aligned with OEM availability.

The new AMD Instinct™ MI100 accelerator, is the world’s fastest HPC GPU accelerator for scientific workloadsand the first to surpass the 10 teraflops (FP64) performance barrier [i]. Built on the new AMD CDNA architecture, the AMD Instinct MI100 GPU enables a new class of accelerated systems for HPC and AI when paired with 2^nd Gen AMD EPYC processors. Supported by new accelerated compute platforms from Dell, HPE, Gigabyte and Supermicro, the MI100, combined with AMD EPYC CPUs and ROCm 4.0 software, is designed to propel new discoveries ahead of the exascale era.

“No two customers are the same in HPC, and AMD is providing a path to today’s most advanced technologies and capabilities that are critical to support their HPC work, from small clusters on premise, to virtual machines in the cloud, all the way to exascale supercomputers,” said Forrest Norrod, senior vice president and general manager, Data Center and Embedded Solutions Business Group, AMD. “Combining AMD EPYC processors and Instinct accelerators with critical application software and development tools enables AMD to deliver leadership performance for HPC workloads.”

AMD and Microsoft Azure Power HPC In the Cloud

Azure is using 2^nd Gen AMD EPYC processors to power its HBv2 virtual machines (VMs) for HPC workloads. These VMs offer up to 2x the performance of first-generation HB-series virtual machines[ii], can support up to 80,000 cores for MPI jobs[iii], and take advantage of 2^nd Gen AMD EPYC processors’up to 45% more memory bandwidth than comparable x86 alternatives[iv].

HBv2 VMs are used by numerous customers including The University of Illinois at Urbana-Champaign’s Beckman Institute for Advanced Science & Technology which used 86,400 cores to model a plant virus that previously required a leadership class supercomputer and the U.S. Navy which rapidly deploys and scales enhanced weather and ocean pattern predictions on demand. HBv2 powered by 2^nd Gen AMD EPYC processors also provides the bulk of the CPU compute power for theOpenAI environment Microsoft announced earlier this year.

AMD EPYC processors have also helped HBv2 reach new cloud HPC milestones, such as a new record for Cloud MPI scaling results with NAMD, Top 20 results on the Graph500, and the first 1 terabyte/sec cloud HPC parallel filesystem. Across these and other application benchmarks, HBv2 is delivering 12x higher scaling than found elsewhere on the public cloud.

Adding on to its existing HBv2 HPC virtual machine powered by 2^nd Gen AMD EPYC processors, Azure announced it will utilize next generation AMD EPYC processors, codenamed ‘Milan’, for future HB-series VM products for HPC.

You can see more about the AMD and Azure collaboration in this video with Jason Zander of Azure and Lisa Su of AMD.

AMD Continues to Be the Choice for HPC

AMD EPYC processors and Instinct accelerators have the performance and capabilities to support numerous HPC workloads across a variety of implementations. From small clusters at research centers, to commercial HPC, to off premise and in the cloud, to exascale computing, AMD continues to provide performance and choice for HPC solutions.

Hewlett Packard Enterprise (HPE), CSC Finland and EuroHPC recently introduced a new pre-exascale system, LUMI. Based on the HPE Cray EX supercomputer architecture, LUMI will use next generation AMD EPYC CPUs and Instinct accelerators and is expected to provide a peak performance of 552 petaflops when it comes online in 2021, making it one of the fastest supercomputers in the world.

Beyond LUMI, AMD powered HPC systems continue to grow in volume. Since SC19, there have been more than 15 supercomputing systems announced using AMD EPYC CPUs, Instinct GPUs, or both. A highlight of the systems includes

Chicoma – Los Alamos National Laboratory– this system is based on the HPE Cray EX supercomputer architecture and uses 2^nd Gen AMD EPYC CPUs, combined with 300 terabytes of system memory for COVID-19 research,
Corona – Lawrence Livermore National Laboratory – this system was recently upgraded with funding from the Coronavirus Aid, Relief and Economic Security (CARES) Act, adding nearly 1,000 AMD Instinct MI50 accelerators, pushing peak performance to more than 11 petaFLOPS,

Mammoth – Lawrence Livermore National Laboratory– the “big memory” cluster uses 2^nd Gen AMD EPYC Processors to perform genomics analysis and graph analytics required by scientists working on COVID-19.
Northern Data – a distributed computing system in Europe that is using AMD EPYC CPUs and Instinct accelerators for large scale HPC applications such as rendering, artificial intelligence and deep learning,
Pawsey Supercomputing Centre – Using the HPE Cray EX supercomputer architecture and future AMD EPYC CPUs and AMD Instinct accelerators, the supercomputer at Pawsey will be Australia’s most powerful supercomputer.

In addition, AMD is also powering the following supercomputers: Anvil and Bell – Purdue University, Big Red 200 – Indiana University, Bridges 2 – Pittsburgh Supercomputing Center, CERN, European Centre for Medium-Range Weather Forecasts, Expanse – San Diego Supercomputer Center, Goethe University Frankfurt, IT4Innovations National Supercomputing Center, Jetstream 2 – Indiana University, Mahti– CSC, Mangi – University of Minnesota, National Oceanic and Atmospheric Administration,Red Raider – Texas Tech University, TinkerCliffs– Virginia Tech.

“With the Expanse supercomputer, our goal is to give scientists and researchers cloud-like access to a high-performance machine that can handle everything from astrophysics to zoology,” said Michael Norman, Director of the San Diego Supercomputer Center. “The 2nd Gen AMD EPYC processors have helped us achieve fantastic performance with Expanse, enabling our researchers to do more science than before. We also have a great collaboration with AMD and have worked together to create a forum for AMD HPC customers to share experiences, information and more, to better benefit HPC research.”

Paving the Path to Exascale Computing

To help researchers start on the path to exascale, AMD has provided Oak Ridge National Labs access to the new AMD Instinct MI100 accelerator, which delivers a giant leap in compute and interconnect performance. The Instinct MI100 accelerator enables a new class of accelerated systems and delivers true heterogeneous compute capabilities from AMD for HPC and AI. Designed to complement the 2^nd Gen AMD EPYC processors, and built on the AMD Infinity Architecture, the AMD Instinct MI100 delivers true heterogeneous compute capabilities from AMD for HPC and AI.

“Frontier, powered by AMD, represents a huge increase in computational power compared to today’s systems. It’s going to allow scientists to answer questions that we didn't have the answer to before,” said Bronson Messer, director of science, Oak Ridge Leadership Computing Facility. “The ability to run molecular simulations that aren't just a few million atoms, but a few billion atoms, provides a more realistic representation of the science, and to be able to do that as a matter of course and over and over again will lead to a significant amount of important discoveries.”

AMD continues to provide the performance, capabilities and scale needed to power current and future HPC workloads, no matter if they are helping students at a research center, improving aerodynamic efficiency for an auto manufacturer, or providing valuable insights for critical medical breakthroughs. Read more about the AMD presence at SC20 and its HPC capabilities here.

[i] Calculations conducted by AMD Performance Labs as of Sep 18, 2020 for the AMD Instinct™ MI100 (32GB HBM2 PCIe® card) accelerator at 1,502 MHz peak boost engine clock resulted in 11.54 TFLOPS peak double precision (FP64), 46.1 TFLOPS peak single precision matrix (FP32), 23.1 TFLOPS peak single precision (FP32), 184.6 TFLOPS peak half precision (FP16) peak theoretical, floating-point performance. Published results on the NVidia Ampere A100 (40GB) GPU accelerator resulted in 9.7 TFLOPS peak double precision (FP64). 19.5 TFLOPS peak single precision (FP32), 78 TFLOPS peak half precision (FP16) theoretical, floating-point performance. Server manufacturers may vary configuration offerings yielding different results. MI100-03

[ii] Source: https://azure.microsoft.com/en-us/blog/introducing-the-new-hbv2-azure-virtual-machines-for-high-performance-computing/

[iii] Source: https://azure.microsoft.com/en-us/blog/azure-hbv2-virtual-machines-eclipse-80000-cores-for-mpi-hpc/

[iv] AMD EPYC™ 7002 Series processors have 45% more memory bandwidth than Intel Scalable processors in the same class.