What are you looking for ?
Advertise with us
RAIDON

R&D: Efficient Crash Consistency for NVMe Over PCIe and RDMA

Experimentally showing that MQFS increases IO/s of RocksDB by 36% and 28% compared to file system and Ext4 without journaling, respectively

ACM Transactions on Storage has published an article written by Xiaojian Liao, Youyou Lu, Zhe Yang, and Jiwu Shu, Tsinghua University, Haidian Qu, Beijing Shi, China.

Abstract: This article presents crash-consistent Non-Volatile Memory Express (ccNVMe), a novel extension of the NVMe that defines how host software communicates with the non-volatile memory (e.g., solid-state drive) across a PCI Express bus and RDMA-capable networks with both crash consistency and performance efficiency. Existing storage systems pay a huge tax on crash consistency, and thus cannot fully exploit the multi-queue parallelism and low latency of the NVMe and RDMA interfaces. ccNVMe alleviates this major bottleneck by coupling the crash consistency to the data dissemination. This new idea allows the storage system to achieve crash consistency by taking the free rides of the data dissemination mechanism of NVMe, using only two lightweight memory-mapped I/Os (MMIOs), unlike traditional systems that use complex update protocol and synchronized block I/Os. ccNVMe introduces a series of techniques including transaction-aware MMIO/doorbell and I/O command coalescing to reduce the PCIe traffic as well as to provide atomicity. We present how to build a high-performance and crash-consistent file system named MQFS atop ccNVMe. We experimentally show that MQFS increases the IOPS of RocksDB by 36% and 28% compared to a state-of-the-art file system and Ext4 without journaling, respectively.

Articles_bottom
ExaGrid
AIC
ATTOtarget="_blank"
OPEN-E