The Library
Automation of fault-tolerant graceful degradation
Tools
Lin, Yiyan, Kulkarni, Sandeep and Jhumka, Arshad (2019) Automation of fault-tolerant graceful degradation. Distributed Computing, 32 (1). pp. 1-25. doi:10.1007/s00446-017-0319-x ISSN 0178-2770.
|
PDF
WRAP-Automation-fault-tolerant-Jjumka-2017.pdf - Accepted Version - Requires a PDF viewer. Download (800Kb) | Preview |
Official URL: https://doi.org/10.1007/s00446-017-0319-x
Abstract
Traditionally, (nonmasking and masking) fault-tolerance has focused on ensuring that after the occurrence of faults, the program recovers to states from where it continues to satisfy its original specification. However, a problem with this limited notion is that, in some cases, it may be impossible to recover to states from where the entire original specification is satisfied. For this reason, one can consider a fault-tolerant graceful-degradation program that ensures that upon the occur¬rence of faults, the program recovers to states from where a (given) subset of its specification is satisfied. Typically, the subset of specification satisfied thus would be the critical/important requirements.
In this paper, we initially focus on automatically revising a given fault-intolerant program into a fault-tolerant gracefully degrading program. Specifically, we propose a two-step approach: In the first step, we transform the fault-intolerant program into a graceful program. This program is guaranteed to satisfy only the given subset of specification (e.g., critical requirements). In particular, this step involves adding new behaviors that will satisfy the given subset of the specification. The second step involves utilizing the original program and the graceful program to obtain a fault-tolerant gracefully degrading program. We also develop an algorithm to transform the gracefully degrading program into a distributed gracefully degrading program. Afterwards, the second phase of our transformation can be applied to generate a distributed fault-tolerant gracefully degrading program. We showcase the algorithm with three different non-trivial case studies. Finally, we formalize the problem of multi-graceful degradation and propose an algorithm that solves it and we use a complex case study to showcase the viability of the approach. All the algorithms have polynomial time complexity in the size of the state space of the original program.
Item Type: | Journal Article | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | |||||||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | |||||||||
Library of Congress Subject Headings (LCSH): | Fault-tolerant computing., Computer software, Software engineering., Distributed databases -- Reliability. | |||||||||
Journal or Publication Title: | Distributed Computing | |||||||||
Publisher: | Springer Verlag | |||||||||
ISSN: | 0178-2770 | |||||||||
Official Date: | February 2019 | |||||||||
Dates: |
|
|||||||||
Volume: | 32 | |||||||||
Number: | 1 | |||||||||
Page Range: | pp. 1-25 | |||||||||
DOI: | 10.1007/s00446-017-0319-x | |||||||||
Status: | Peer Reviewed | |||||||||
Publication Status: | Published | |||||||||
Access rights to Published version: | Restricted or Subscription Access | |||||||||
Date of first compliant deposit: | 6 November 2017 | |||||||||
Date of first compliant Open Access: | 16 December 2018 | |||||||||
RIOXX Funder/Project Grant: |
|
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |
Downloads
Downloads per month over past year