Czasopismo
2003
|
Vol. 28, No. 2
|
65-81
Tytuł artykułu
Autorzy
Wybrane pełne teksty z tego czasopisma
Warianty tytułu
Języki publikacji
Abstrakty
Failure detection is important and valuable for distributed computation. The concept of failure detector introduced by Chandra and Toueg simplifies the design and verification of fault-tolerant application. This theoretical mechanism makes the presentation of such application and its proof of correctness more modular. However, an obvious question is how to implement such a useful extension in the asynchronous system. This paper tries to summarise the knowledge about software failure detector protocols. By comparison of the design goals and specific assumptions we want to familiarize the reader with the problem of implementing the unreliable failure detector.
Słowa kluczowe
Rocznik
Tom
Strony
65-81
Opis fizyczny
Bibliogr. 16 poz.
Twórcy
autor
- Institute of Computing Sci., Poznań University of Technology, Piotrowo 3A, 60-965 Poznań, Poland, jerzy.brzezinski@put.poznan.pl
autor
- Institute of Computing Sci., Poznań University of Technology, Piotrowo 3A, 60-965 Poznań, Poland, jacek.kobusinski@put.poznan.pl
Bibliografia
- [1] Attiya H., Bar-Noy A., Dolev D., Keller D., Peleg D., Reischuk R., Achievable cases in an asynchronous environment. In Proceedings of the 28th Symposium on Foundations of Computer Science, IEEE Computer Society Press, 1987, 337-346.
- [2] Chandra T.D., Toueg S., Unreliable Failure Detectors for Reliable Distributed Systems, Journal of the ACM, 34, 1, 1996, 225-267.
- [3] Chor B., Dwork C., Randomization in byzantine agreement. In Advances in Computing Research 5: Randomness and Computation, JAI Press, 1989, 443-497.
- [4] Defago X., Sergent N., Schiper A., Impact of a failure detector on the performance of Consensus. In Proc. of the 8th IEEE Pacific Rim Symposium on Dependable Computing, PRDC-8, Seoul, Korea, 2001, 137-145.
- [5] Demers A., Greene D., Hauser C., Irish W., Larson J., Epidemic algorithms for replicated database maintenance, Proceedings of the sixth annual ACM Symposium on Principles of distributed computing, Vancouver, British Columbia, Canada, 1987, 1-12.
- [6] Dolev D., Dwork C., Stockmeyer L. On the minimal synchronism needed for distributed consensus. Journal of the ACM, 34, 1, 1997, 77-97.
- [7] Dolev D., Lynch N.A., Pinter S.S., Stark E.W., Weibl W.E., Reaching approximate agreement in the presence of faults. Journal of the ACM, 33, 3, 1986, 499-516.
- [8] Dwork C., Lynch N.A., Stockmeyer L., Consensus in the presence of partial synchrony. Journal of the ACM, 35, 2, 1988, 288-323.
- [9] Fetzer C., Enforcing Perfect Failure Detection, The 21st International Conference on Distributed Computing Systems, 2001, 350-360.
- [10] Fisher M.J., Lynch N.A., Paterson M.S., Impossibility of distributed Consensus with one faulty process, Journal of the ACM, 32, 2, 1985, 374-382.
- [11] Guerraoui R., Shiper A., "Γ-Accurate" failure detectors. International Workshop on Distributed Algorithms, Springer Verlag, Bologna, 1996.
- [12] Gupta I., Chandra T.D., Goldszmidt G.S., On Scalable and efficient Distributed Failure Detectors, In Proceedings of the 20th Annual ACM Symposium on Principles of Distributed Computing, Newport, Rhode Island, USA, 2001.
- [13] Larrea M., Fernandez A., Arevalo S., Efficient Algorithms to Implement Unreliable Failure Detectors is Partially Synchronous Systems. Proceedings of the 13th International Symposium on Distributed Computing, Bratislava, Slovak Rep., 34-48.
- [14] Larrea M., Fernandez A., Arevalo S., Eventually Consistent Failure Detectors. In Proceedings of the 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing, Grand Canaries Island, Spain, 2002, 91-98.
- [15] Ranganathan S., George A.D., Todd R.W., Chidester M.C., Gossip-Style Failure Detection and Distributed Consensus for Scalable Heterogeneous Clusters, Cluster Computing, 4, 3, 2001, 197-209.
- [16] Van Renesse R., Minsky Y., Hayden M., A Gossip-Style Failure Detection Service, IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing, Springer Verlag, The Lake District, England, 1998, 55-70.
Typ dokumentu
Bibliografia
Identyfikatory
Identyfikator YADDA
bwmeta1.element.baztech-article-BPP1-0035-0079