A survey of techniques for modeling and improving reliability of computing systems

Conference/Journal
IEEE
Authors
Sparsh Mittal Jeffrey S Vetter
BibTex
Abstract
Abstract: Recent trends of aggressive technology scaling have greatly exacerbated the occurrences and impact of faults in computing systems. This has madereliability'a first-order design constraint. To address the challenges of reliability, several techniques have been proposed. This paper provides a survey of architectural techniques for improving resilience of computing systems. We especially focus on techniques proposed for microarchitectural components, such as processor registers, functional units, cache and main memory etc. In ...