DOE and Cray Announce Exascale Supercomputer Frontier

Here at Cray, we’re charging into the exascale era. We followed March’s announcement of the United States' first exascale system ― Argonne National Laboratory’s Aurora supercomputer ― with another last week. At an Oak Ridge National Laboratory-hosted event, the U.S. Department of Energy (DOE) revealed Frontier, an even more powerful exascale supercomputer expected to land at ORNL in 2021. Frontier will be based on our new Shasta™ architecture and Slingshot™ interconnect and will feature high-performance AMD EPYC™ CPU and AMD Radeon Instinct™ GPU technology. And the stats on the system are impressive: 1.5 exaflops of performance (more operations per second than 160 of today’s top supercomputers ... [ Read More ]

How Hyperparameter Optimization Improves Machine Learning Accuracy

In January, a team of Cray developers and researchers published a paper, “Recombination of Artificial Neural Networks,” on arXiv.org, highlighting the hyperparameter optimization (HPO) capability Cray announced in November. We cover their findings in this blog post. Using a variety of high-performance computing systems and neural network models, the Cray team demonstrated that the hyperparameter optimization capabilities introduced in the Cray® Urika®-CS and Urika®-XC AI and analytics software suites improve the time-to-accuracy as well as final accuracy of machine learning models trained on Cray systems. The table below, excerpted from the paper, highlights the improvements achieved using Cray’s HPO capability across a range of ... [ Read More ]

Meet Slingshot: An Innovative Interconnect for the Next Generation of Supercomputers

Today Cray is announcing our next-generation “Shasta™” supercomputer, featuring our latest generation of scalable interconnect, code-named “Slingshot.” As a long-time network architect, I’m pretty excited. Slingshot is our 8th major generation of scalable HPC network, and there have been some great milestones along the way. We started back in 1992 with the Cray T3D, Cray's first massively parallel system. Implemented in BiCMOS, its network latency was just 12 ns per hop. That was followed in 1996 by the pioneering Cray T3E system, which had the first-ever implementation of adaptive routing in an HPC network (by a long shot!). In 2005, Cray pioneered the design of high-radix switches. Our YARC switch for the Cray X2 implemented ... [ Read More ]

3 Reasons Why CosmoFlow on a Cray System is a Big Deal

Today, Cray, NERSC (the National Energy Research Scientific Computing Center at Lawrence Berkeley National Laboratory) and Intel announced the results of a three-way collaboration around CosmoFlow, a deep learning 3D convolutional neural network (CNN) that can predict cosmological parameters with unprecedented accuracy using the Intel-powered Cray® XC™ series “Cori” supercomputer at NERSC. Supercomputers are unique in their ability to be instruments of discovery for problems on the smallest and largest of scales — from subatomic scale to the cosmos. Cosmologists who study the origin, evolution and eventual fate of the universe use a combination of empirical observations and theoretical computer simulations to define and refine a ... [ Read More ]

Can LS-DYNA Scale Any Higher?

Processing and memory bottlenecks can run but they can’t hide. Not indefinitely, at least. And especially not when four technology leaders combine efforts against them. Cray, Livermore Software Technology Corporation (LSTC), the National Center for Supercomputing Applications (NCSA) and Rolls-Royce are partnering on an ongoing project to explore the future of implicit finite element analyses of large-scale models using LS-DYNA, a multiphysics simulation software package, and Cray supercomputing technology. As the scale of finite element models — and the systems they run on — increase, so do scaling issues and the amount of time it takes to run a model. Understanding that, ultimately, only time and resource constraints limit the size ... [ Read More ]