How Hyperparameter Optimization Improves Machine Learning Accuracy

In January, a team of Cray developers and researchers published a paper, “Recombination of Artificial Neural Networks,” on, highlighting the hyperparameter optimization (HPO) capability Cray announced in November. We cover their findings in this blog post. Using a variety of high-performance computing systems and neural network models, the Cray team demonstrated that the hyperparameter optimization capabilities introduced in the Cray® Urika®-CS and Urika®-XC AI and analytics software suites improve the time-to-accuracy as well as final accuracy of machine learning models trained on Cray systems. The table below, excerpted from the paper, highlights the improvements achieved using Cray’s HPO capability across a range of ... [ Read More ]

Deep Learning at Scale Using Cray Distributed Training

This article was written by the following Cray and NERSC contributors: Steve Farrell, Machine Learning Engineer; Thorsten Kurth, Application Performance Specialist; Jacob Balma, Performance Engineer; Peter Mendygral, Performance and Software Engineer; Nick Hill, Software Engineer. Deep neural networks (DNN) are revolutionizing science across many domains including high energy physics, cosmology, biology, and climate. As the field of deep learning advances, DNN architectures grow more sophisticated and capable of solving complex tasks in scientific problems such as classification, regression, and simulation. Training and evaluating such models requires increasingly large datasets and computing resources. Through the NERSC Big Data ... [ Read More ]

Supercomputer Shaheen II Flying Full Speed at KAUST

At King Abdullah University of Science and Technology, high-performance computing is in our DNA. We launched our first supercomputer Shaheen I at our university’s founding in 2009. Six years later, we put Shaheen II, our second system, into service. Since its launch in 2015, Shaheen II has been the computational engine behind several record-breaking simulations and many significant discoveries. It’s the largest and most powerful supercomputer in the Middle East, and its speed and performance have attracted some of the brightest computationally oriented scholars from all over the world. As such, Shaheen II has been integral in assisting KAUST to grow as a destination for science and technology research. A fun fact about Shaheen is that ... [ Read More ]

Meet Slingshot: An Innovative Interconnect for the Next Generation of Supercomputers

Today Cray is announcing our next-generation “Shasta™” supercomputer, featuring our latest generation of scalable interconnect, code-named “Slingshot.” As a long-time network architect, I’m pretty excited. Slingshot is our 8th major generation of scalable HPC network, and there have been some great milestones along the way. We started back in 1992 with the Cray T3D, Cray's first massively parallel system. Implemented in BiCMOS, its network latency was just 12 ns per hop. That was followed in 1996 by the pioneering Cray T3E system, which had the first-ever implementation of adaptive routing in an HPC network (by a long shot!). In 2005, Cray pioneered the design of high-radix switches. Our YARC switch for the Cray X2 implemented ... [ Read More ]

Bloomsbury AI Uses Cray Tech to Top Artificial Intelligence Record

One of our initiatives here at Cray is to provide supercomputer resources so innovators can advance artificial intelligence in its many forms — and, ultimately, human understanding. One way that we do this is through a partnership with the UK innovation center Digital Catapult and its Machine Intelligence Garage program. Machine Intelligence Garage helps businesses access the computation power and expertise they need to develop and build machine learning and artificial intelligence solutions. As part of this partnership, one of the program participants, Bloomsbury AI, used a Cray® CS-Storm™ system in the Cray Accel AI™ lab to train and optimize the deep learning models within Cape, an open-source technology that can answer ... [ Read More ]