Evolutionary-Guided Synthesis of Verified Pareto-Optimal MDP Policies

Research output: Chapter in Book/Report/Conference proceedingConference contribution


We present a new approach for synthesising Pareto- optimal Markov decision process (MDP) policies that satisfy complex combinations of quality-of-service (QoS) software requirements. These policies correspond to optimal designs or configurations of software systems, and are obtained by translating MDP models of these systems into parametric Markov chains, and using multi-objective genetic algorithms to synthesise Pareto-optimal parameter values that define the required MDP policies. We use case studies from the service-based systems and robotic control software domains to show that our MDP policy synthesis approach can handle a wide range of QoS requirement combinations unsupported by current probabilistic model checkers. Moreover, for requirement combinations supported by these model checkers, our approach generates better Pareto-optimal policy sets according to established quality metrics.
Original languageEnglish
Title of host publication36th IEEE/ACM International Conference on Automated Software Engineering
Publication statusPublished - 20 Jan 2022

Cite this