Just a cursory glance saw one major problem with the study. They use the mere act of replacing a drive because it is said to be failing/failed as evidence of drive failure.
I've seen enough SysAdmins who don't understand the difference between a hardware and software error to question this practice. Many an administrator has tried to sweep a mistake under the rug by claiming "hardware failure." (And to be fair to sysadmins, many a field engineer has tried to blame his mistakes on the SA as well!)
A drive returned to the vendor with NTF certainly deserves some consideration. I simply don't believe treating it equal with a verified failure is the most accurate means of addressing the issue.
Originally Posted By: NJC
I found this Google research - Faluire Trends in a Large Disk Drive Population
From the Conclusion:
Quote:
One of our key findings has been the lack of a consistent pattern of higher failure rates for higher temperature drives or for those drives at higher utilization levels. Such correlations have been repeatedly highlighted by previous studies, but we are unable to confirm them by observing our population. Although our data do not allow us to conclude that there is no such correlation, it provides strong evidence to suggest that other effects may be more prominent in affecting disk drive reliability in the context of a professionally managed data center deployment.
I've seen enough SysAdmins who don't understand the difference between a hardware and software error to question this practice. Many an administrator has tried to sweep a mistake under the rug by claiming "hardware failure." (And to be fair to sysadmins, many a field engineer has tried to blame his mistakes on the SA as well!)
A drive returned to the vendor with NTF certainly deserves some consideration. I simply don't believe treating it equal with a verified failure is the most accurate means of addressing the issue.
Originally Posted By: NJC
I found this Google research - Faluire Trends in a Large Disk Drive Population
From the Conclusion:
Quote:
One of our key findings has been the lack of a consistent pattern of higher failure rates for higher temperature drives or for those drives at higher utilization levels. Such correlations have been repeatedly highlighted by previous studies, but we are unable to confirm them by observing our population. Although our data do not allow us to conclude that there is no such correlation, it provides strong evidence to suggest that other effects may be more prominent in affecting disk drive reliability in the context of a professionally managed data center deployment.