Home WakeSpace Scholarship › Electronic Theses and Dissertations

Predicting Hard Drive Failures in Computer Clusters

Electronic Theses and Dissertations

Item Files

Item Details

abstract
Mitigating the impact of computer failure is possible if accurate failure predictions are provided. Resources, and services can be scheduled around predicted failure and limit the impact. Such strategies are especially important for multi-computer systems, such as compute clusters, that experience a higher rate of failure due to the large number of components. However providing accurate predictions with sufficient lead time remains a challenging problem. This research uses a new spectrum-kernel Support Vector Machine (SVM) ap- proach to predict failure events based on system log files. These files contain mes- sages that represent a change of system state. While a single message in the file may not be sufficient for predicting failure, a sequence or pattern of messages may be. This approach uses a sliding window (sub-sequence) of messages to predict the likelihood of failure. Then, a frequency representation of the message sub-sequences observed are used as input to the SVM. The SVM associates the messages to a class of failed or non-failed system. Experimental results using actual system log files from a Linux-based compute cluster indicate the proposed spectrum-kernel SVM approach can predict hard disk failure with an accuracy of 80% about one day in advance.
subject
computer science
event prediction
pattern recognition
support vector machines
classification
hard drives
contributor
Featherstun, Robin Wesley (author)
John, David (committee chair)
Fulp, Errin (committee member)
Turkett, William (committee member)
date
2010-05-07T18:30:15Z (accessioned)
2010-06-18T18:59:04Z (accessioned)
2010-05-07T18:30:15Z (available)
2010-06-18T18:59:04Z (available)
2010-05-07T18:30:15Z (issued)
degree
Computer Science (discipline)
identifier
http://hdl.handle.net/10339/14824 (uri)
language
en_US (iso)
publisher
Wake Forest University
rights
Release the entire work immediately for access worldwide. (accessRights)
title
Predicting Hard Drive Failures in Computer Clusters
type
Thesis

Usage Statistics