What are you looking for ?
Advertise with us
RAIDON

R&D: SWEP-RF, Accuracy Sliding Window-Based Ensemble Pruning Method for Latent Sector Error Prediction in Cloud Storage Computing

Method is promising solution for enhancing cloud storage reliability through proactive LSE prediction.

Journal of King Saud University – Computer and Information Sciences has published an article written by Adnan Tahir, Shenzhen University, College of Computer Science and Software Engineering, 518060 Shenzhen, China, and The Islamia University of Bahawalpur, Rahim Yar Khan Campus, Department of Computer Science & IT, 64200 Rahim Yar Khan, Pakistan, Fei Chen, Shenzhen University, College of Computer Science and Software Engineering, 518060 Shenzhen, China, Abdulwahab Ali Almazroi, and Nourah Fahad Janbi, University of Jeddah, College of Computing and Information Technology at Khulais, Department of Information Technology, Jeddah, Saudi Arabia.

Abstract: Latent sector errors (LSEs) in disk drives cause significant outages, data loss, and unreliability in large-scale cloud storage systems. Predicting LSEs can help avoid these problems and improve cloud reliability. Ensemble classifiers typically outperform individual classifiers for LSE prediction with high accuracy but can lead to underfitting and incurring additional computational cost, complexity, and time and memory consumption. This research addresses this challenge by proposing a twofold solution: optimizing the ensemble diversity of the resulting Random Forest (RF) classifier through accuracy sliding window-based ensemble pruning (SWEP-RF) and using this pruned ensemble to predict LSEs in cloud storage. SWEP-RF maximizes its lower margin distribution to adapt the RF prediction performance and produce a strong-performing and effective subensemble. Our approach also reduces ensemble size while maintaining high prediction accuracy. We evaluate our algorithm using datasets from Baidu Inc and Backblaze datacenters. Experimental results demonstrate that our approach achieves over prediction accuracy, a low false alarm rate (FAR) of , and extended meantime to data loss (MTTDL) with lead time in advance (LTA) of up to Hrs. and Hrs., respectively. SWEP-RF outperforms classical models and current state-of-the-art techniques in prediction accuracy, FAR, MTTDL, processing time, memory consumption, and cloud availability. Our method is a promising solution for enhancing cloud storage reliability through proactive LSE prediction.

Articles_bottom
ExaGrid
AIC
ATTOtarget="_blank"
OPEN-E