Replica determinism and flexible scheduling in hard real-time dependable systems

S Poledna, A Burns, A Wellings, P Barrett

Research output: Contribution to journalArticlepeer-review

Abstract

Fault-tolerant real-time systems are typically based on active replication where replicated entities are required to deliver their outputs in an identical order within a given time interval. Distributed scheduling of replicated tasks, however, violates this requirement if on-line scheduling, preemptive scheduling, or scheduling of dissimilar replicated task sets is employed. This problem of inconsistent task outputs has been solved previously by coordinating the decisions of the local schedulers such that replicated tasks are executed in an identical order. Global coordination results either in an extremely high communication effort to agree on each schedule decision or in an overly restrictive execution model where on-line scheduling, arbitrary preemptions, and nonidentically replicated task sets are not allowed. To overcome these restrictions, a new method, called timed messages, is introduced. Timed messages guarantee deterministic operation by presenting consistent message versions to the replicated tasks. This approach is based on simulated common knowledge and a sparse time base. Timed messages are very effective since they neither require communication between the local scheduler nor do they restrict usage of on-line flexible scheduling, preemptions and nonidentically replicated task sets.

Original languageEnglish
Pages (from-to)100-111
Number of pages12
JournalIEEE Transactions on Computers
Volume49
Issue number2
DOIs
Publication statusPublished - Feb 2000

Bibliographical note

Copyright © 2000 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Keywords

  • distributed real-time systems
  • fault tolerance
  • distributed operating systems
  • replica determinism
  • distributed scheduling
  • flexible scheduling
  • COMMON KNOWLEDGE
  • CLOCKS

Cite this