Synthesis and Verification in Markov Game Structures

Lead Research Organisation: University of Liverpool

Department Name: Computer Science

Abstract

To meet the objectives of our project, we will divide our research into five work packages.The first work package is devoted to representing the control problems that we want to approach and to identify benchmarks and case studies as guidelines for relevant demands and measures of success for the applied aspects of our project. The starting point for our models will be- a generalisation of interactive Markov chains to 2.5 player games, a model in which the decisions of the different players are physically separated by assigning them to different states, and- a generalisation of Markov decision processes to Markov games, a model in which the decisions of both (or, more generally, of all) players are entangled and represented in the same node.We will extend these models by representations of the observational and computational power of the controllers under consideration, and formalisations of the--simple--objectives we want to meet.Additionally, we will develop benchmarks and case studies to guide the applied aspects of our project, and to root it in different communities--in particular in engineering and IT--by reflecting their respective demands.Work packages two, three, and four from the theoretic core of our work. Our second work package will address the simple question of constructing controllers with complete information, while a third work package will address the generalisation of these techniques to controllers with incomplete information but, for distributed controllers, equivalent information about the system state.Different to discrete systems, the abstraction (or restricted observability) of time plays a paramount role when considering incomplete information of these systems. This particular type of abstraction has proven to often simplify the construction of optimal strategies: The construction of optimal time-abstract strategies (and the proof of their existence) is much simpler than the construction of time dependent ones.The fourth work package refers to the extension of these results to distributed schedulers with different observational power.For work packages two, three, and four, we will study the decidability of quantitative and qualitative safety and reachability properties.In a fifth work package we will focus on algorithmic aspects like the development and selection of appropriate data structures of the model checking and optimisation problems, and develop prototype implementations that solve as a proof-of-concept for their applicability for a selection of the developed approaches. These proof-of-concept implementations will also play an important role in determining the applicability and potential of the techniques developed in the project on the target implementations defined in the first work package, and as means to communicate our results for dissemination and exploitation purposes.

Planned Impact

The long term aim of our research is to develop feasible techniques for the automatic development of high quality control strategies. We consider this research effort to be an important step towards this goal: The expected results will provide us with a thorough understanding of the theoretical as well as of the practical challenges involved in the construction and approximation of optimal control. During the conduct of the research effort, we expect to gain insights and experience in the automated design of good scheduling policies that will lead to efficient algorithmic solutions and proof-of-concept implementations to prove their feasibility. These results can serve as a foundation for follow-up research that can lead to - the construction of first and second generation tools that tap the potential of the developed algorithmic concepts; - an extension of the conceptual coverage of research questions, for example by including cost functions or by integrating a symbolic representation of data; and - the integration of technical restrictions of industrial controllers. After completion of the project, we expect our results to gradually mature from an academic proofs-of-concept to wider applicability in these directions. Starting with the development of efficient tools for the automated analysis of the specifications in the languages developed during this research effort, we envisage the development of tools with gradually increasing expressive power and efficiency, guided by the demands of the engineering and IT industry and scientific curiosity alike. It is our belief that such tools could quickly become powerful enough to attack small applications of industrial interest, and finally extend to applicability in the development of complex control scenarios, whose analysis defies traditional manual approaches. Our results will therefore benefit the regional and national engineering industry on the long run, because our results will lead to improved control strategies, and hence to higher time and energy efficiency. The general public will benefit from this through reduced costs.

Funded Value:

£335,486

Funded Period:

Oct 10 - Nov 13

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/H046623/1

Principal Investigator:

Sven Schewe

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Fundamentals of Computing (100%)

Organisations

University of Liverpool (Lead Research Organisation)

People	ORCID iD
Sven Schewe (Principal Investigator)
Andrew Shenton (Co-Investigator)
Dominik Wojtczak (Co-Investigator)

Publications

Author Name

Title Publication Date Published

|< < 1 2 3 4 5 > >|

10 25 50

Fearnley J (2012) Automata, Languages, and Programming

Fearnley J (2012) Automated Technology for Verification and Analysis

Fearnley J (2015) Reachability in two-clock timed automata is PSPACE-complete in Information and Computation

Fearnley J (2011) Mathematical Foundations of Computer Science 2011

Fearnley J (2012) Algorithmic Game Theory

Fearnley J (2014) Finding approximate nash equilibria of bimatrix games via payoff queries

Fearnley J (2015) Synthesis of succinct systems in Journal of Computer and System Sciences

Hahn E (2017) Verification, Model Checking, and Abstract Interpretation

Hainy M (2022) Optimal Bayesian design for model discrimination via classification. in Statistics and computing

Huang C (2016) A Game-Theoretic Foundation for the Maximum Software Resilience against Dense Errors in IEEE Transactions on Software Engineering

Key Findings
Further Funding


Description	We have developed 1. techniques for finding small and concise solutions to optimisation and verification problems, 2. techniques for the fast approximation of optimal solutions for Markov games and decision processes, 3. techniques that capitalise on the scarceness of faults without referring to probabilities, and 4. specialised solutions for related games with special structures. We have also established the existence of finite optimal control for Markov games and decision processes, of optimal (albeit not finite) control for Markov games played on timed automata, and the non-existence of optimal control for Markov games with one player without access to time.
Exploitation Route	The route of making the probabilities abstract in an approach that optimises the fast recovery from rather than the number of tolerable faults is a very simple approach with the potential of being applied. The techniques for finding small and succinct models has potential, but currently lacks a good guidance for finding the solutions. The ad-hoc solutions use SAT and SMT solvers, but it seems that their search heuristics are inappropriate for the problem class. Genetic algorithms have been used with slightly better success, but there are more techniques -- like learning based ones -- that have not been tried yet. A more immediate route to exploitation is given through the development of an efficient and easy-to-use model checker, IscasMC. This web-based model checker facilitates automata based techniques developed in this project and will be a platform for follow-up work.
Sectors	Digital/Communication/Information Technologies (including Software)


Description	Analysis of Probabilistic Systems
Amount	£11,950 (GBP)
Organisation	The Leverhulme Trust
Sector	Charity/Non Profit
Country	United Kingdom
Start	11/2011
End	07/2012


Description	Liverpool India Fellowships
Amount	£8,200 (GBP)
Organisation	University of Liverpool
Sector	Academic/University
Country	United Kingdom
Start	05/2014
End	10/2014


Description	Visiting Fellowship
Amount	£3,000 (GBP)
Organisation	University of Liverpool
Sector	Academic/University
Country	United Kingdom
Start	04/2014
End	05/2014

Abstract

Planned Impact

Organisations

People

ORCID iD

Publications