What are you looking for ?
Infinidat
Articles_top

University of Wisconsin-Madison Alum Masterminding Next Gen Storage: Solution to Datapocalypse?

The goal of demo is to store 125GB on less than 1 cubic cm of DNA. and does it for $7,000.

By David Tenenbaum, science writer, University of Wisconsin–Madison

Catalog Technologies, Inc., a Boston company co-founded by a recent University of Wisconsin-Madison Ph.D., is preparing to demonstrate the world’s fastest, densest DNA-based data-warehouse.

Hyunjun Park, center, talks about starting a business
at the Weinert Center’s Distinguished Entrepreneurs Lunch February 27.

(Photo by Lisa Collins, Weinert Center )

Hyunjun Park Catalog University Of Wisconsin Madison

In a meeting at the Weinert Center for Entrepreneurship at the Wisconsin School of Business, Hyunjun Park, co-founder and CEO, Catalog, said the device will hold digital information in DNA – life’s evolution-perfected ‘storage’ molecule.

The goal of the demonstration, says Park, is to store 125GB, or about 200 full-length CDs in 24 hours, on less than 1cm3 of DNA. And to do it for $7,000.

That’s a lot more expensive than a HDD, but Park says it’s a million times cheaper than a DNA ‘drive’ demonstrated by Microsoft Corp. – and 100,000 times faster. Indeed, faster and cheaper are Catalog’s two competitive advantages. (The company name is in upper case because it contains C, A, T and G; shorthand for the four ‘letters’ that carry information in DNA.)

Data, from business, spy satellites, telescopes, weather stations, government agencies or the countless gadgets in the IoT is becoming the currency of the era.

That picture was already emerging by the time Park earned a Ph.D. in bacteriology at UW-Madison in 2014. His next step was a post-doctoral position in synthetic biology at MIT – an emerging field dedicated to creating, not growing, organisms and biological structures.

Before leaving Madison, Park attended the week-long Morgridge Entrepreneurial Boot Camp on campus, and began considering leaving academia for the rough and tumble of entrepreneurship.

Catalog Dna Storage Logo

At MIT, Park met Nathaniel Roquet, a Harvard Ph.D. student in biophysics who became Catalog’s other co-founder and chief technology innovation officer.

With 8 employees in Boston, and investments totaling more than $10 million, Catalog is built for speed in the race to find the next great storage system.

We have started from zero, and hopefully are getting to one,” says Park, lapsing into dataspeak.

For a start-up, a solution is less important than a solid problem, Park told the Weinert Center’s Distinguished Entrepreneurs Lunch on Feb. 27. And Park’s problem – the glut of information sometimes called the ‘datapocalypse’ – is a result of a tsunami of data from pretty much every sphere of human activity.

With Catalog, so far, so good, says Dan Olszewski, director of the Weinert Center, who invited Park to speak at the event as a distinguished alumnus. “Hyunjun is a great example of the creativity of someone who was trained to an amazing level of science at UW-Madison. He overlays that on his entrepreneurial creativity, and comes up with an innovative idea that bridges those realms. His solution reads like science fiction, but it makes sense. He’s got the persistence and resilience that are fundamental to entrepreneurs.

The slowing improvements in HDDs and SSDs make DNA a tantalizing alternative, and Microsoft, Intel Corp. and other techno bigs are chasing the ‘Grail of a medium’ that offers millennium-level stability.

Curiously, the molecule of life’s storage lasts far longer than disks and magnetic tape, which must be replaced every few years.

In terms of digital data in a given volume, DNA is also millions of times more capacious. A 2017 demonstration at Columbia University showed a data density of 215 billion MBs per gram of DNA, enough store more than 100 million movies.

On the downside, that DNA cost $3,500/MB to assemble. For comparison, today you can buy five million megabytes of storage (in a combination HDD and SSD unit) for about the same price.

We did the math: that’s about five million times cheaper.

As these numbers show, the problem dogging DNA storage is the same one that hounded hard-disk technology in the 1950s: Cost. Assembling the four ‘letters’ of DNA is expensive, and slow.

Nonetheless, promising technologies can have a slow start and then pick up speed. When Park and Roquet formed Catalog in 2016, they shunned the idea of assembling bases one by one to represent the digital ‘alphabet.’

We don’t think about how nature does it,” says Park. “We scrap all that. We think of DNA as a medium, a polymer, and ask, ‘What is the best way to generate a lot of different molecules?

Catalog opted for prefab: it buys or makes fragments of DNA, ‘in massive quantities,’ and then assembles with a custom-made liquid-handling robot.

DNA molecules are like Lego blocks,” says Park. “We can string them together in virtually infinite combinations. We take advantage of that and start with a few hundred molecules to generate in the end, trillions of different molecules.

Movable type ushered in lower-cost printing and played key role in science, industry and education.
(Credit Willi Heidelbach)

Wisc Unv Metal Movable Type 500x332

Park likens the approach to movable type. Instead of having to write out every letter each time you want to write something, old-style typesetters cast their letters in advance, and then slotted them into position.

The result is a warp-speed improvement on an assembly process that takes advantage of a medium that life has perfected over billions of years of evolution.

If he’s daunted by having Intel and Microsoft as competitors, Park does not show it. “We’re creating a new medium for digital storage, so the opportunity is big enough for more than one company. If large companies like Microsoft are interested, it only helps to validate the idea and to build up the ecosystem for a technology like this.

Resources:
Video:
Introduction to DNA-based data storage and CATALOG      
Blog: Introducing CATALOG

Read also:
R&D: Microsoft Research Podcast With Dr. Karin Strauss on Storing Digital Data in Synthetic DNA
Explains how properties of DNA could eventually enable to store really big data in small places for long time.
November 5, 2018 | Press Release
Storage and Retrieval of Information With Enzymatic DNA Synthesis From Molecular Assemblies
Physical DNA was then ‘read’ by DNA sequencing and converted back to binary data and then to text message.
August 29, 2018 | Press Release
Storage and Retrieval of Information With Enzymatic DNA Synthesis From Molecular Assemblies
Physical DNA was then ‘read’ by DNA sequencing and converted back to binary data and then to text message.
August 29, 2018 | Press Release
R&D: Random Access in Large-Scale DNA Data Storage
Demonstrate viable, large-scale system for DNA data storage and retrieval
March 5, 2018 | Press Release
R&D: Clustering Billions of Reads for DNA Storage
Algorithm achieves higher accuracy and 1000x speedup on three real datasets.
November 29, 2017 | Press Release
Making DNA Data Storage Reality
Few kilograms could store all of humanity’s data, but there are challenges.
October 6,  2017 | In Brief

Articles_bottom
AIC
ATTO
OPEN-E