PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Porcarelli S., Castaldi M., Di Giandomenico F., Bondavalli A., Inverardi P. A framework for reconfiguration-based fault-tolerance in distributed systems. Rogério de Lemos, Cristina Gacek, Alexander Romanovsky (eds.). (Lecture Notes in Computer Science, vol. 3069). Germany: Springer-Verlag Heidelberg, 2004.
 
 
Abstract
(English)
Nowadays, many critical services are provided by complex distributed systems which are the result of the reuse and integration of a large number of components. Given their multi-context nature, these components are, in general, not designed to achieve high dependability by themselves, thus their behavior with respect to faults can be the most disparate. Nevertheless, it is paramount for these kinds of systems to beable to survive failures of individual components, as well as attacks and intrusions, although with degraded functionalities. To provide control capabilities over unanticipated events, we focus on fault handling strategies, particularly on system's reconfiguration. The paper describes a framework which provides fault tolerance of components based applications by detecting failures through monitoring and by recovering through system reconfiguration. The framework is based on Lira, an agent distributed infrastructure for remote control and reconfiguration, and a decision maker for selecting suitable new configurations. Lira allows for monitoring and reconfiguration at components and applications level, while decisions are taken following the feedbacks provided by the evaluation of statistical Petri net models.
Subject System Reconfiguration
Model-based analysis
Petri Nets
D.2.11 Software Architectures
C.4 PERFORMANCE OF SYSTEMS


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional