What are you looking for ?
Infinidat
Articles_top

University of Miami Deploys DDN Storage

Enabling "400%" speed up of genome sequencing

DataDirect Networks, Inc. (DDN) announced that the University of Miami’s Center for Computational Science (CCS) has deployed its GS12K scale-out file storage to speed scientific discoveries and boost collaboration with researchers around the world.

University of Miami

CCS maintains one of the largest centralized academic cyber-infrastructures in the country, which fuels vital and critical discoveries in Alzheimer’s, Parkinson’s, gastrointestinal cancer, paralysis and climate modeling as well as marine and atmospheric science research.

More than 2,000 internal researchers and a dozen expert collaborators across academic and industry sectors work together in workflow management, data management, data mining, decision support, visualization and cloud computing. To streamline workflows and keep pace with data-intensive discovery demands, CCS has integrated its HPC environment with data capture and analytics capabilities so that data can move transparently between research steps.

To simplify data capture and analysis, CCS relies on DDN’s powerful and versatile GS12K storage to handle both high I/O and low IO/s and to support full bandwidth demands in a single system. As a result, the center captures, stores and distributes massive amounts of data generated from multiple scientific models running different simulations on 15 Illumina HiSeq sequencers simultaneously on DDN storage. The center has reduced its number-crunching time for genome mapping and SNP calling from 72 to 17 hours.

DDN enabled us to analyze thousands of samples for the Cancer Genome Atlas, which amounts to nearly a PB of data,” said Dr. Nicholas Tsinoremas, director, the Center for Computational Sciences at the University of Miami. “Having a robust storage platform like DDN is essential to driving discoveries such as our recent study that revealed a link between certain viruses and gastrointestinal cancers. Previously, we couldn’t have done that level of computation.

In addition to providing storage processing power to meet both high I/O and interactive processing requirements, CCS needed a flexible file system that could support large parallel and short serial jobs. Additionally, the center had to address ‘data in flight’ challenges resulting from major data surges during analysis, which often caused a 10x spike in storage. DDN’s ability to adapt to all of CCS’ requirements enabled the center to leverage one centralized storage platform for all its needs while scaling without adding a layer of complexity.

Moreover, DDN’s performance for genomics assembly, alignment and mapping enables CCS to support all its application needs, including the use of BWA and Bowtie for initial mapping as well as SamTools and GATK for variant analysis and SNP calling.

Our arrangement is to share data or make it available to anyone asking, anywhere in the world,” added Tsinoremas. “Now we have the storage versatility to attract researchers from both within and outside the HPC community. With DDN, we’re well positioned to generate, analyze and integrate all types of research data to drive major scientific discoveries and breakthroughs.

Articles_bottom
AIC
ATTO
OPEN-E