R&D: Variable-Sized Stripe Level Data Layout Strategy for HDD/SSD Hybrid Parallel File Systems
Experimental results show that SLDP is feasible and can improve system performance.
This is a Press Release edited by StorageNewsletter.com on November 16, 2016 at 2:42 pmConcurrency and Computation: Practice and Experience has published an article written by Yan Liu, College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China, Xin Huang, State Grid Information and Communication Company of Hunan Electric Power Corp., ChangSha, China,Yizi Huang, Shaofeng Geng, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China, Xin Peng, College of Information and Communication Engineering, Hunan Institute of Science and Technology, Yueyang, China, and Renfa Li, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China.
Abstract: “Parallel file systems commonly distribute a file across multiple file servers with a fixed-size stripe, thereby allowing data access through multiple file servers. This default data layout works well in traditional homogeneous storage systems, but when SSDs are conducted into a storage system, the data layout of hybrid parallel file systems has a chance to obtain better I/O performance. In this study, we propose a variable-sized stripe level data layout strategy for hybrid parallel file systems (SLDP). SLDP divides the file into several regions according to the data access pattern and then finds the optimal configurations for each region among the SSD file server nodes and mechanical HDD file server nodes. It uses variable stripe sizes to reorganize the data layout of file systems. Furthermore, it considers SSD space limitation, the main idea is to distribute key regions of the file to hybrid parallel file systems based on the optimal stripe configuration, which can significantly improve the system I/O throughput performance. The remaining parts of a file are then distributed according to the SSD free space threshold, which can leverage the SSD servers as much as possible. To achieve this, SLDP divides a large file into many fine-grained regions and adjusts the data layout method for each region according to the access patter. Experimental results show that the SLDP is feasible and can improve system performance.”
Copyright 2016 John Wiley & Sons, Ltd.