R&D: DNA Merge-Sort, Family of Nested Varshamov-Tenengolts Reassembly Codes for Out-of-Order Media
To merge and sort fragments in order to recover stored data
This is a Press Release edited by StorageNewsletter.com on March 13, 2024 at 2:00 pmIEEE Transactions on Communications has published an article written by Sajjad Nassirpour, Department of Electrical and Computer Engineering, San Diego State University, San Diego, USA, Ilan Shomorony, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, USA, and Alireza Vahid, Department of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USA.
Abstract: “Motivated by the DNA storage paradigm, we consider the torn-paper channel (TPC), which models data storage in long DNA molecules and breaks the input sequence into a random number of out-of-order variable-length non-overlapped fragments. We propose a computationally-efficient code construction for this model. More specifically, we introduce a family of nested Varshamov-Tenengolts (VT) codes to merge and sort the fragments in order to recover the stored data. We numerically show that our scheme (i) obtains rates that are higher than in prior results, (ii) has a decoding complexity that is cubic in the number of codeword fragments, which is significantly lower than the complexity of the brute-force approach, and (iii) offers decreasing and negligible error rates as the codeword length increases. We also propose a new construction for VT codes, quantify the number of required parity bits, and show that our approach requires fewer parity bits compared to known results.“











