Heathcare & Life Sciences

Cray Solutions for Healthcare and Life Sciences

Get to the answer faster.

Connect With an Expert

CONTACT US NOW

INSIGHTS AT THE SPEED OF LIFE

POWERING FASTER R&D ANALYTICS AND BETTER PATIENT OUTCOMES

Advances in affordable sequencing and high-resolution imaging are producing extremely large, varied and complex data sets at ever-higher levels of sample specificity and sensitivity. Combine this with novel analytical methods and new computational tools to explore and collaborate on these data in minutes and at scale, and the stage is set for faster drug discovery, precision disease treatment, improved patient outcomes and more-efficient healthcare.

Cray has leveraged 40 years of dominance in HPC and supercomputing to build a platform that enables all the core components for precision medicine, with the power to take your data scientists from image and sequence analysis to computational modeling; your researchers from analysis tools and machine-learning frameworks to big-graph analytics; and your clinicians from big data store to connectors, query engines and productivity tools. All while minimizing the impact on your IT environment, and protecting your data and your patients.

Areas in which we collaborate with healthcare and life sciences organizations include:

Medical Imaging

  • Analyzing medical images at scale
  • Enabling computational pathology
  • Cryo-electron microscopy

Simulation for Life Sciences

  • Improving the resolution of molecular dynamics

Next-Generation Sequencing for Healthcare & Life Sciences

  • Assembling a large plant or human genome de novo in minutes
  • Finding statistically significant variants in 10,000+ genomes
  • Leveraging big data technologies to contextualize NGS results

Cybersecurity for Healthcare

  • Protecting patients and their data from cyberattacks

Featured Resources

Cray Announces Agile Analytics Platform

The Broad Institute says Cray's new Urika-GX platform “highlights the potential to accelerate delivery of genomic insights to researchers who are making breakthroughs in the fight against disease.”

Enabling Scientific Breakthroughs at the Petascale

If you think the requirements for enterprise storage systems are growing at a dizzying pace, try these numbers on for size...

Cray Storage Solutions for Life Sciences

Genomics, structural biology, informatics, drug discovery, materials science and other life sciences benefit from Cray’s expert solutions. 

See All Resources

Healthcare & Life Sciences Solutions

Imaging Platform to Revolutionize Patient Outcomes

Taking the leap of faith to run code optimized for graphics processing unit (GPU) architectures can provide significant performance benefits for imaging applications, but it can also introduce risks including reduced overall system reliability, upgrade suitability and tedious system manageability.

Read more (open/close details)

Biomedical Imaging Performance at Scale

Migrating to GPU architectures can improve performance by 10 to 20 times for medical imaging applications, but there are potentially many bottlenecks in getting there. Pioneers running medical imaging applications on GPU clusters often face daunting challenges, including:

  • System throttling – Running GPUs at high utilization for extended periods will often create thermal issue, causing systems to throttle and slow down.
  • System instability – Thermal and other performance issues will often impact reliability, causing the system to stutter and ultimately fail, a troubling scenario in a healthcare environment.

Imaging for Digital Pathology Workflows

Cray's CS-Storm cluster supercomputer is well-suited to accelerating digital pathology workflows at scale. Leveraging 40 years of HPC expertise, Cray built the CS-Storm system with balanced I/O, memory and CPU/GPU capabilities to handle today's and tomorrow's most demanding digital pathology workflows. According to NVIDIA, the CS-Storm system is the only cluster capable of running both CPUs and GPUs at maximum sustained utilization, which means throttling or temperature issues won't interrupt your workflow. You'll get the reliable performance you demand in a healthcare environment, instead of the unpredictable performance other GPU clusters provide.

Accelerating Cryo-EM Workflows

Cryo-electron (cryo-EM) microscopy is rapidly becoming a key technology for 3D molecular structure determination. Cray's CS-Storm cluster is ideally suited for cryo-EM workflows in single-particle imaging, where iterative mathematical 3D reconstruction processes are applied to large numbers of 2D particle images to produce quality 3D models.

Improving Pathology Productivity With Machine Learning

As the volume of digital images explodes from myriad sources, it becomes challenging for healthcare professionals to sort through the uninteresting images or sections of images to find that needle in a haystack. Machine learning may be beneficial in this area, not by replacing the function of the healthcare professional, but by improving the signal-to-noise ratio.

Back to top


Platforms Empowering NGS Workflows for Discovery

As the costs of sequencing continue to drop, there has been a corresponding increase in demand for compute and storage solutions. Many organizations are struggling to keep up. Regardless of whether you're working in a research or a clinical setting, Cray can help future-proof your lab with an upgradable compute infrastructure capable of managing your entire NGS ecosystem and scaling as your needs change.

Read more (open/close details)

De Novo Assembly at Scale

De novo assembly represents one of the most computationally challenging methods in biology, and it plays a critical role in metagenomics, agrigenomics and exploring certain regions of the human genome. Learn how Cray has helped researchers scale this difficult computation, delivering a complete de novo assembly of the human genome in under nine minutes.

Leverage Big Data Technologies to Contextualize NGS Results

For many researchers, a fully sequenced sample is where their scientific exploration begins. The process of analyzing and interpreting NGS data can be greatly enhanced by leveraging the emerging collection of big data technologies, including Apache Spark™ and the Cray Graph Engine (CGE).

Apache Spark brings big data analytics to the masses, allowing researchers to quickly process, transform and explore mountains of data (both structured and unstructured) in memory. Cray's Urika®-GX platform has been engineered to deliver industry-leading performance executing Spark workflows.

The Cray Graph Engine allows researchers to take advantage of graph analytics at an unprecedented scale to help support genomic interpretation and contextualization. CGE delivers this unique capability as an open standard SPARQL RDF database, making it easy to get started. By supporting open standards, CGE can immediately leverage important, publicly available data sources like EBI's Uniprot, Reactome and ChEMBL.

Back to top


Empowering Healthcare Analytics

In recent years, healthcare organizations globally have been facing increasing pressure to refocus their businesses away from volume-based drivers toward value-based metrics and improved patient outcomes. This fundamental shift in priorities often leads to broad organizational and operational changes, after which many organizations find their existing infrastructure lacks the analytical capability to support these important changes.

Read more (open/close details)

Cray's Urika®-GX platform, preconfigured with Apache Spark and the Cray Graph Engine (CGE), delivers unparalleled performance and can help support projects focused on:

  • Improving quality of care and patient outcomes
  • Reducing medical errors
  • Monitoring pay-for-performance metrics
  • Optimizing supply chain and reducing costs
  • Reducing fraud and overpayment
  • Improving utilization

Back to top


Breakthrough Molecular Dynamics Performance at Massive Scale

Molecular dynamics, once one of the world's grand challenges, is now seeing broader adoption and application within the life sciences community. As organizations try to run these codes to solve new problems, like pharmaceutical companies tracking rapid processes, they typically find that the molecular dynamics software they're using doesn't run well on commodity clusters.

Read more (open/close details)

The Solution: Uncompromising Supercomputers for Molecular Dynamics

Cray® XC™ supercomputers are capable of handling some of the world's largest simulations, scaling nearly linearly up to tens of thousands of nodes. And Cray has decades of experience working with life sciences academic pioneers and software vendors. Together we have optimized simulation performance to better leverage the advanced performance capabilities in the Cray XC series, such as Aries™ interconnects and the DataWarp™ applications I/O accelerator.

Most significantly, organizations don't have to deploy a 10,000-node supercomputer to get many of the benefits of the XC line. The XC system is a modular platform, allowing organizations to start with a very small air-cooled cabinet and grow their footprint if and when their needs demand it. They can also mix and match technologies, adding GPUs into the mix should they deploy code that's optimized for it.

Back to top


Protecting Patients and Their Data

For healthcare organizations, insurance companies, hospitals and other care providers, patient data is a critical asset — and one that often makes these organizations the victims of hackers and other cyber criminals.

Read more (open/close details)

To guard against ransomware and similar threats, healthcare and life sciences companies trust Cray to provide reliable, scalable systems for cybersecurity. Cray offers an agile analytics platform with the convenience of an appliance and the openness and flexibility of a custom-built solution.

Challenges of Cybersecurity in Healthcare and Life Sciences

  • Hospitals and other healthcare providers are desirable and often easier targets for hackers and cyberattackers.
  • Data in this space is massive and rapidly growing, including patient care information, genomics data and other personal data — legacy analytics approaches can't uncover the insights needed to identify threats.
  • Many organizations do not have the budget or means to build a powerful big data analytics system on their own.
  • Data often sits in silos across departments and locales, so performing analytics can be time consuming and unproductive.

Protecting the Security of Healthcare Data

  • Identify threats faster – With the Urika®-GX platform's powerful graph engine, healthcare organizations can quickly gain the insights needed to see and respond to threats.
  • Improve responsiveness and agility – With the open, flexible Urika-GX system, analysts can run both Hadoop®/Spark™ and graph workloads to reveal connections that would have been nearly impossible to identify before.
  • Reduce infrastructure TCO – High-performance and scalable Cray solutions help bring cut cost of cyberanalytics, from power and upgradability to reduction of data movement and increased efficiency, while helping you manage the risk of provisioning information technology in the face of ever-changing scientific technologies.
The Mayo Clinic leverages Cray analytics to develop advanced capabilities to gain an edge on attackers.

—Mayo Clinic

Back to top


View Bioinformatics Applications
Application
Owner
Description
Application:Abokia BLAST
Owner:Commercial

AbokiaBLAST is a parallel implementation of NCBI BLAST created by the inventors of the open-source mpiBLAST project. AbokiaBLAST inherits the super-scalable architecture from mpiBLAST but is re-factored and re-engineered to offer production quality. With intelligent task parallelization and I/O optimization, AbokiaBLAST enables users to massively accelerate large-scale BLAST search on clusters or supercomputers with a single command.

Application:ABySS
Owner:Open Source

ABySS is a de novo, parallel, paired-end sequence assembler designed for short reads. The single-processor version is useful for assembling genomes up to 100 Mbases in size. The parallel version is implemented using MPI and is capable of assembling larger genomes.

Application:ALLPATHS-LG
Owner:Open Source

ALLPATHS-LG is a short-read assembler that works on both small and large (mammalian size) genomes.

Application:BLAST
Owner:Open Source

The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences and help identify members of gene families.

Application:Bowtie
Owner:Open Source

Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at more than 25 million 35bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end.

Application:BWA
Owner:Open Source

BWA is a software package for mapping low-divergent sequences against a large reference genome, such as the human genome. It consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. The first algorithm is designed for Illumina sequence reads up to 100bp, while the other two are for longer sequences ranging from 70bp to 1Mbp.

Application:ClustalW
Owner:Open Source

ClustalW is a general-purpose multiple alignment program for DNA or proteins. ClustalX is a graphical user interface for the ClustalW multiple-sequence alignment program.

Application:Edena
Owner:Open Source

De novo short reads assembler

Application:EULER-SR
Owner:Open Source

The EULER-SR assembly package contains a suite of programs for correcting errors in short reads and assembling them.

Application:FASTA
Owner:Open Source

FASTA is a more sensitive derivative of the FASTP program that can be used to search protein or DNA sequence data bases and can compare a protein sequence to a DNA sequence database by translating the DNA database as it is searched. FASTA includes an additional step in the calculation of the initial pairwise similarity score that allows multiple regions of similarity to be joined to increase the score of related sequences.

Application:HMMER
Owner:Open Source

HMMER is used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs).

Application:MIRA2
Owner:Open Source

Sequence assembler and sequence mapping for whole genome shotgun and EST/RNASeq sequencing data

Application:MrBayes
Owner:Open Source

MrBayes is a program for Bayesian inference of phylogeny using Markov Chain Monte Carlo methods. MrBayes has a console interface and uses a modified NEXUS format for data and batch files. It handles a wide range of probabilistic models for the evolution of nucleotide and amino acid sequences, restriction sites and standard binary data.

Application:MUMmer
Owner:Open Source

MUMmer is a system for rapidly aligning entire genomes, whether in complete or draft form. 

Application:NCBI BLAST
Owner:Open Source

Genomics/protein analytics, integer/character manipulation intensive, query and database required, shared-memory parallelization

Application:Novoalign
Owner:Commercial

Tool designed for mapping  short reads onto a reference genome from Illumina, Ion Torrent and 454 NGS platforms

Application:Phred Phrap Consed
Owner:Commercial (Univ. Wash.)

Phred is a base calling software with quality estimation; Phrap performs shotgun sequence assembly; and Consed is a sequence assembly editor companion to Phrap. Also available are Swat and CrossMatch, sequence alignment tools; and Phrapview, a graphical tool that provides a "global" view of the Phrap assembly, complementary to the "local" view provided by the Consed.

Application:RAY
Owner:Open Source

Parallel genome assemblies for parallel DNA sequencing

Owner:IO Informatics
The Sentient Knowledge Explorer uses the power of semantic technologies to easily integrate data from virtually any source into coherent, unified knowledge bases.
Application:SeqAn
Owner:Open Source

SeqAn is an open-source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data.

Application:SHARCGS
Owner:Open Source

Short-read assembler based on robust contig extension for genome sequencing

Application:SHRiMP
Owner:Open Source

SHRiMP is a software package for aligning genomic reads against a target genome. It was primarily developed with the multitudinous short reads of next-generation sequencing machines in mind, as well as Applied Biosystems' colourspace genomic representation.

Application:SOAP
Owner:Open Source

SOAP provides a full solution to next generation sequencing data analysis. It consists of a new alignment tool (SOAPaligner/soap2), a re-sequencing consensus sequence builder (SOAPsnp), an indel finder (SOAPindel), a structural variation scanner (SOAPsv) and a de novo short reads assembler (SOAPdenovo).

Application:SOAPdenovo
Owner:Open Source

SOAPdenovo is a novel short-read assembly method that can build a de novo draft assembly for human-sized genomes. The program is specially designed to assemble Illumina GA short reads. 

Application:SSAKE
Owner:Open Source

SSAKE is a de novo assembler for short DNA sequence reads. It is designed to help leverage the information from short-sequence reads by assembling them into contigs and scaffolds that can be used to characterize novel sequencing targets.

Application:Trinity
Owner:Open Source

A novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules — Inchworm, Chrysalis and Butterfly — applied sequentially to process large volumes of RNA-seq reads.

Application:VCAKE
Owner:Open Source

VCAKE is a genetic sequence assembler capable of assembling millions of small nucleotide reads even in the presence of sequencing error.

Application:Velvet
Owner:Open Source

Velvet performs de novo short read assembly using de Bruijn graphs. It can be used for Solexa and 454 sequencing data assembly.

View Drug Discovery Applications
Application
Owner
Description
Application:AutoDock
Owner:Open Source

AutoDock is a suite of automated docking tools that predicts how small molecules, such as substrates or drug candidates, bind to a receptor of known 3-D structure.

Application:DOCK6
Owner:Commercial (UCSF )

DOCK addresses the problem of "docking" molecules to each other. In general, "docking" is the identification of the low-energy binding modes of a small molecule, or ligand, within the active site of a macromolecule, or receptor, whose structure is known. A compound that interacts strongly with, or binds, a receptor associated with a disease may inhibit its function and thus act as a drug. Solving the docking problem computationally requires an accurate representation of the molecular energetics as well as an efficient algorithm to search the potential binding modes.

Application:FlexX
Owner:Commercial (BioSolveIT)

Predicts protein-ligand interactions.

Application:GLIDE
Owner:Commercial (Schrödinger)

A complete solution for ligand-receptor docking, from virtual screening of millions of compounds to binding mode predictions.

Application:GOLD
Owner:Open Source

Gold calculates the docking modes of small molecules into protein binding sites.

Application:ICM
Owner:Commercial (molsoft)

A desktop-modeling environment for molecular structure and function.

Application:LigandFit
Owner:Commercial (Accelrys )

Code for docking ligands into protein active sites. The method employs a cavity detection algorithm for detecting invaginations in the protein as candidate active site regions.

Application:ROCS
Owner:Commercial (OpenEye)

ROCS is a virtual screening tool that can identifies potentially active compounds by shape comparison.

View Materials Science Applications
Application
Owner
Description
Application:ABINIT
Owner:Open Source

Main program helps users find the total energy, charge density and electronic structure of systems made of electrons and nuclei (molecules and periodic solids) within density functional theory (DFT), using pseudopotentials and a planewave or wavelet basis. ABINIT also includes options to optimize the geometry according to the DFT forces and stresses, or to perform molecular dynamics simulations using these forces, or to generate dynamical matrices, Born effective charges, and dielectric tensors, based on density-functional perturbation theory, and many more properties.

Application:BigDFT
Owner:Open Source

BigDFT is a DFT massively parallel electronic structure code (GPL license) using a wavelet basis set. Wavelets form a real space basis set distributed on an adaptive mesh. GTH or HGH pseudopotentials are used to remove the core electrons. With a Poisson solver based on a Green function formalism, periodic systems, surfaces and isolated systems can be simulated with the proper boundary conditions.

Application:CASTEP
Owner:Commercial (Accelrys )

CASTEP is a code for calculating the properties of materials from first principles. Using density functional theory, it can simulate a wide range of material properties including energetics, structure at the atomic level, vibrational properties and electronic response properties. Offers a wide range of spectroscopic features that link directly to experiment, such as infrared and Raman spectroscopies, NMR and core level spectra.

Application:CP2K
Owner:Open Source

CP2K performs atomistic and molecular simulations of solid state, liquid, molecular and biological systems. It provides a general framework for different methods such as density functional theory using a mixed Gaussian and plane waves approach (GPW) and classical pair and many-body potentials.

Application:CPMD
Owner:Open Source

The CPMD code is a parallelized plane wave/pseudopotential implementation of density functional theory, particularly designed for ab-initio molecular dynamics.

Application:DMol3
Owner:Commercial (Accelrys )

DMol3 is a commercial (and academic) software package that uses density functional theory with a numerical radial function basis set to calculate the electronic properties of molecules, clusters, surfaces and crystalline solid materials from first principles. It can either use gas phase boundary conditions or 3-D periodic boundary conditions for solids or simulations of lower dimensional periodicity.

Application:LAMMPS
Owner:Open Source

Large-scale atomic/molecular massively parallel simulator (LAMMPS) is a classical molecular dynamics code.

Application:SIESTA
Owner:Open Source (commercial version from Nanotec)

SIESTA is both a method and its computer program implementation for performing electronic structure calculations and ab initio molecular dynamics simulations of molecules and solids. SIESTA uses strictly localized basis sets and linear-scaling algorithms that can be applied to suitable systems. The code can be used for a wide range of applications, from quick exploratory calculations to simulations that match other approaches, such as plane-wave and all-electron methods.

Application:VASP
Owner:Commercial (Univ. of Vienna)

VASP performs ab-initio quantum-mechanical molecular dynamics (MD) using pseudopotentials and a plane wave basis set. The approach is based on a finite-temperature local-density approximation (with the free energy as variational quantity) and an exact evaluation of the instantaneous electronic ground state at each MD-step using efficient matrix diagonalization schemes and an efficient Pulay mixing.

View Proteomics Applications
Application
Owner
Description
Application:Mascot
Owner:Commercial (Matrix Science)

Mascot is a mass spectral search algorithm that uses mass spectrometry data to identify proteins from primary sequence databases.

Application:OMSSA
Owner:Open Source

Mass spectrometry software used for data acquisition, analysis or representation.

Application:ProteinProspector
Owner:Open Source

Proteomics tools for mining sequence databases in conjunction with mass spectrometry experiments.

Application:SEQUEST
Owner:Open Source

SEQUEST correlates uninterpreted tandem mass spectra of peptides with amino acid sequences from protein and nucleotide databases. It will determine the amino acid sequence and thus the protein(s) and organism(s) that correspond to the mass spectrum being analyzed.

Application:X!Tandem
Owner:Open Source

X! Tandem can match tandem mass spectra with peptide sequences. It generates theoretical spectra for peptide sequences using information about intensity associated with amino acids. These spectra are compared with experimental data to generate an expectation value as a threshold score.

View Structural Biology Applications
Application
Owner
Description
Application:ACEMD
Owner:Acellera

ACEMD is a heavily optimized molecular dynamics engine specially designed to run on NVIDIA GPUs.

Application:AMBER
Owner:Open Source

Assisted model building with energy refinement (AMBER) refers to two things: a set of molecular mechanical force fields for the simulation of biomolecules (which are in the public domain, and are used in a variety of simulation programs), and a package of molecular simulation programs which includes source code and demos.

Application:CHARMM
Owner:Commercial (Accelrys )

CHARMM (Chemistry at HARvard Molecular Mechanics) is the academic version of the CHARMM simulation program available through Harvard. CHARMM uses empirical energy functions to describe the forces on atoms in molecules.

Application:DESMOND
Owner:D.E. Shaw Research

DESMOND performs high-speed molecular dynamics simulations of biological systems on conventional commodity clusters. The code uses novel parallel algorithms and numerical techniques and can run on platforms containing a large number of processors, or on a single computer.

Application:GAMESS
Owner:Open Source

General atomic and molecular electronic structure system (GAMESS) is a general ab initio quantum chemistry package. It can compute wave functions ranging from RHF, ROHF, UHF, GVB and MCSCF, with CI and MP2 energy corrections available for some of these.

Application:Gaussian
Owner:Commercial (Gaussian)

Gaussian 09 provides electronic structure modeling, and is licensed for a wide variety of computer systems.

Application:GROMACS
Owner:Open Source

GROMACS performs molecular dynamics, simulating the Newtonian equations of motion for systems with hundreds to millions of particles. It is primarily designed for biochemical molecules such as proteins, lipids and nucleic acids that have complicated bonded interactions, but is also being used for research on nonbiological systems, such as polymers.

Application:Jaguar
Owner: Commercial (Schrödinger)

Jaguar is an ab initio package for both gas and solution phase simulations, with particular strength in treating metal containing systems.

Application:LAMMPS
Owner:Open Source

Large-scale atomic/molecular massively parallel simulator (LAMMPS) is a classical molecular dynamics code.

Application:MOLPRO
Owner:Commercial (University College Cardiff Consultants Ltd.)

Molpro is a complete system of ab initio programs for molecular electronic structure calculations. The emphasis is on highly accurate computations, with extensive treatment of the electron correlation problem through the multiconfiguration-reference CI, coupled cluster and associated methods.

Application:NAMD
Owner:Open Source

NAMD is a parallel molecular dynamics code designed for simulation of large biomolecular systems. Based on Charm++ parallel objects, NAMD scales to hundreds of processors on high-end parallel platforms.

Application:NWChem
Owner:Open Source

NWChem provides many methods to compute the properties of molecular and periodic systems by using standard quantum mechanical descriptions of the electronic wave function or density. NWChem can perform classical molecular dynamics and free energy simulations. These approaches may be combined to perform mixed quantum-mechanics and molecular-mechanics simulations.

Application:Q-Chem
Owner:Commercial (Q-Chem)

Q-Chem is a comprehensive ab initio quantum chemistry package. Its capabilities range from DFT/HF calculations to post-HF correlation methods.

Application:MPACK
Owner:Open Source

MPACK is a Fortran program package that involves scattering problems of two octet baryons by the quark-model interactions, fss2 and FSS (fssG.f), and their applications to the Faddeev calculations for the triton (triton.f) and the hypertriton (hypt.f

Application:Terachem
Owner:Commercial (PetaChem)

TeraChem is general-purpose quantum chemistry software designed to run on NVIDIA GPU architectures under a 64-bit Linux operating system.

Case Studies

Applying Big Data Analysis Techniques to Simulated DNA Data

Researchers from the Centre for Biomolecular Sciences at the University of Nottingham along with the Edinburgh-based Cray Centre of Excellence team have been carrying out a pioneering study applying big data analysis techniques to simulation-generated DNA data.

The Pawsey Centre’s Cray XC40 Supercomputer “Magnus” Gives Researchers a Big Advantage in Fight Against Lung Disease

Chronic respiratory diseases interrupt the airways and other lung structures, affecting hundreds of millions of people worldwide. Researchers created a first-ever 3D model of the lung using “Magnus,” Pawsey Supercomputing Centre’s Cray XC40 supercomputer, with the goal of improved delivery of aerosolized medications.

Enabling Scientific Breakthroughs at the Petascale

If you think the requirements for enterprise storage systems are growing at a dizzying pace, try these numbers on for size...

Boron Nitride and the Nanoribbons of Tomorrow

Materials modeling on ORNL’S "Jaguar" shows big future for boron nitride

ORNL and Purdue Explore Technology at the Nanoscale

A team led by Gerhard Klimeck of Purdue University has broken the petascale barrier while addressing a relatively old problem in the young field of computer chip design.

Application, Solution and Technology Briefs

Removing Bottlenecks to Large-Scale Genetic and Genomic Data Analysis with DISSECT and the Cray XC Supercomputer

Genomic and genetic research have a big data problem. The field is producing ever-increasing amounts of data. But a lack of sufficiently scalable computational tools prevents researchers from analyzing it adequately. The situation leaves the massive opportunities inherent in this data untapped.

Cray Helps Optimize De Novo Assembler Application Trinity for Use on Massively Parallel Supercomputers

To fully realize the benefits of the Cray XC30 system for NGS, Cray is actively collaborating with leading researchers to improve the performance of NGS workflows.

Cray Demonstrates Top-Level Performance and Scalability on Very Large Datasets with Velvet

Velvet is a de novo genomic assembler designed for short reads generated by NGS sequencers.

Cray Helps Tune Ray De Novo Genomic Assembler Software

Ray is a highly parallel computer software developed at the Université Laval that performs de novo genome assemblies with next-generation DNA sequencing data.

Cray's Next-Generation Sequencing Solution

Cray’s next-generation sequencing solution helps research and clinical institutions manage datasets throughout their life cycle, from assembling raw data to archiving analyzed data. The Cray NGS solution comprises three core elements — computing, storage and analysis.

Cray Storage Solutions for Life Sciences

Built on open systems, Cray’s scalable storage solutions address life science’s data- and I/O-intensive workflows and get results faster.

Tuning NAMD on the Cray XK6 "Titan" Supercomputer

With Cray support, the NAMD developers are optimizing their code on each new iteration of Cray hardware, achieving scaling to hundreds of thousands of cores.

Speed Your Time to Results Using Galaxy on a Cray System

Galaxy is a widely used web-based platform for data integration and analysis in the life sciences.

Customer Solutions

PDC Center for High Performance Computing: Milner Marches On

The Cray XC30 system Milner, which is named in recognition of significant work in neuropsychology that was done by Brenda Milner and her then husband Peter in the 1950s, is part of a high performance computing platform for research in neuroinformatics and was funded by an infrastructure grant from the Swedish Research Council (VR).

The Problem with Cellulosic Ethanol

At Oak Ridge National Laboratory, simulation provides a close-up look at the molecule that complicates next-generation biofuels.

Supercomputers Simulate the Molecular Machines That Replicate and Repair DNA

Researchers used "Jaguar," at Oak Ridge National Laboratory, to elucidate the mechanism by which accessory proteins, called sliding clamps, are loaded onto DNA strands and coordinate enzymes that enable gene repair or replication.

Early Molecular Dynamics Research Blazes through Titan’s New GPUs

A look at the transition from CPUs to GPUs when the Oak Ridge Leadership Computing Facility upgraded its Cray "Jaguar" system to the new "Titan." 

Beagle: The CI Supercomputer for Biomedical Simulations & Data Analysis at the University of Chicago

The official website for "Beagle," a Cray XE6 system used primarily for biomedical research at the University of Chicago's Computation Institute.

Computation Institute, University of Chicago, Beagle Newsletter

The Beagle Cray system is one of the fastest supercomputers in the world that is devoted to life sciences.

Videos and Webinars

Bio-IT World Breakfast: A dialogue with Dr. George Church

Join us for breakfast at the Renaissance Hotel in the Caspian Room from 8:30 to 10:00 a.m. on April 5th.

Webcast Replay: Population Genetics Examples with Apache Spark

Emerging big data analytics techniques hold the promise of accelerating scientific data processing, lowering the cost and complexity of data management and providing new capabilities for genomic interpretation.

Unprecedented Speed: Lumenogix and Cray for Next-Generation Data Sequencing

Dave Anstey, global head of life sciences at Cray Inc., and Boris Umylny, director of bioinformatics services at the National Center for Genome Resources, talk about processing NGS data with unprecedented speed

Why Scientific Innovation Outpaces Infrastructure

Dave Anstey, global head of life sciences at Cray Inc., talks about scientific innovation and how it relates to computer infrastructure.

Accelerating NGS Workflows with Hadoop and Spark

Learn how to accelerate NGS workflows to enable scientific breakthroughs across a wide variety of fields including cancer research, clinical genomics and precision medicine.

Personalized Medicine: Technology Needs to Be Ready

Carlos Sosa, high performance computing architect at Cray Inc., says personalized medicine is on the way, but HPC technology must be more robust to answer questions quickly for patients and doctors.

Leveraging a Cray Supercomputer for Parallel De Novo Transcriptome Assembly Using Trinity

By the Broad Institute, a unique, collaborative community pioneering a new model of biomedical science.

Technical and Analyst Papers

Genomic Applications on Cray Supercomputers: Next Generation Sequencing Workflow

A team of Cray and university researchers discusses its work with Ray, a parallel shortread de novo assembler code. They also present a configuration for an NGS workflow based on a Cray supercomputing system and the Cray Sonexion storage solution. 

Human Analysts at Superhuman Scales

Ray, a scalable genome assembler, addresses big data problems by using optimal resources and producing one correct and conservative timely solution.

Blog

Cray Blog - Life Sciences

Commentary and insight from Cray’s top experts and customers