DIverse DatabasE ReplicatiOn - Performance Comparison (DIDERO-PC)

Lead Research Organisation: City, University of London
Department Name: Centre for Software Reliability

Abstract

This proposal is about experimental evaluation of diverse database replication protocols. It aims at comparing the performance of several database replication protocols, which protect against non-crash failures and are based on use of diverse, i.e. different by design, database management systems (DBMSs). These DBMSs are developed by different vendors, and can be either commercial (e.g. Microsoft SQL Server, Oracle), or open-source products (e.g. MySQL, PostgreSQL). The experimental evaluation will be conducted using two approaches: a model-based approach and a direct experimentation on a testbed with "real" systems.
Diverse database replication protocols have a potential of providing higher dependability assurance than traditional approaches that are based on the use of the DBMSs from the same vendor (research from the Centre for Software Reliability (CSR) at City University London has contributed significantly to that finding). However, besides achieving appropriate dependability level, system designers and other DBMS stakeholders are interested in the performance characteristics offered by alternative solutions. For many applications performance is, in fact, the dominant criterion in the selection process.
A fair and thorough performance comparison between database replication protocols, including the ones studied in this proposal, is of great benefit to system designers when making decisions about which one is most suitable to integrate in their system. This is the main motivation of the proposal.
The project aims at comparing the performance of three database replication protocols which guard against non-crash failures and are based on use of diverse DBMSs: our DivRep (the reference point), SES and Byzantium. There are two main objectives of the particular performance evaluation to be achieved:
- Objective 1 - evaluation based on using "real" systems on an experimental testbed, i.e. using respective implementations of the replication protocols together with the real DBMSs.
- Objective 2 - model-based evaluation, whereby each replication protocol, as well as DBMS resources, are represented with a suitable model (implemented in a particular tool of choice) and the protocol performance measured by simulation of the respective model.
Performance comparison of DBMSs and related replication protocols is too often based only on the comparison that uses (synthetic) benchmarks, which in most cases do not reflect the operational profile of the system under test. In addition, there are limits to what one can achieve with performance comparison based only on experimental evaluation with "real" systems: one can feasibly explore only a limited range of parameter values (such as number of replicas, ratio between read and write operations, etc). In contrast, model-based approach offers greater flexibility by providing means for evaluation with a wide range of configuration parameters in a practical manner. Therefore, the proposal aims at satisfying the following objective:
- Objective 3 - provision of a general-purpose methodology for model-based performance evaluation of database replication protocols.

Planned Impact

A database replication protocol based on the use of diverse Database Management Systems (DBMSs) has been developed by the researchers at the Centre for Software Reliability at City University London. This protocol is referred to as DivRep. The findings of the project could confirm our intuition that DivRep is characterized with good performance. That result will be a significant justification towards building a fully functional diverse database replication solution (referred colloquially in CSR as DivSQL), which requires several other components apart from the replication protocol DivRep. Instigating the development of DivSQL and turning it into a hi-tech software product could, in turn, have far-reaching effects: it will open new employment opportunities for highly skilled personnel (particularly in software development field) and, more generally, will improve the competitiveness of the UK economy. As a result, the society as a whole will benefit financially from the success of the project and so will the institution (City University London).
CSR has invested considerably, mainly through funding from the City University London, in setting an Innovation vision and corresponding strategy. The main pilot project considered as part of the Innovation initiative at CSR is commercialisation of DivSQL. CSR team has been negotiating different commercialisation alternatives with the University's Technology Transfer team; the most viable of which is a set-up of a spin-off company. Our DivRep replication protocol is at the heart of DivSQL, and the project results would feed back directly to the plans for DivSQL development; e.g., they would help us assess DivSQL performance and identify necessary changes to be made. Nevertheless, the results obtained in the project are not meant to be self-serving: any company interested in developing solutions for dependable database replication will benefit from the measurement results and model-based methodology for database replication assessment provided by this project.
The increasing reliance on IT systems by various businesses, of different sizes and from virtually all industrial sectors, is exemplified by the pervasive use of database software. Despite contributing to improved business productivity, these complex software packages do fail, occasionally with serious consequences, e.g. a corruption in Oracle database has left JPMorgan Chase online and mobile banking non-operational for more than 24 hours (http://www.banktech.com/articles/227700032). A part of the failures will manifest themselves as crashes and will be acted upon by system operators, but many will not be self-evident and will remain unnoticed, possibly for long periods of time, causing unknown damage to the system. It is the latter, more subtle, type of failures that diverse database protocols, including our DivRep, protect against and as such aid business continuity. Early adoption of this disruptive technology can be a valuable asset of UK businesses in a long run. If the adoption materializes, the results from this project will help system designers in making informed decision if, and when, to deploy diversity: the detailed analysis of the cost of diverse redundancy for database replication protocols (in terms of performance overhead) will be a valuable contribution in that respect.

Publications

10 25 50
 
Description We have developed an approach to, and associated model and software for, probabilistic, model-based evaluation of database replication protocols.

The approach focuses on the evaluation of performance, the main non-functional characteristic of database replication protocols. It can be extended to cover evaluation of other non-functional properties, such as reliability.

In addition, we have performed extensive performance testing of our replication protocol - DivRep: both using the "real" systems (e.g. database management servers PostgreSQL and Firebird), as well as the newly developed model-based approach.
Exploitation Route Extending the approach and the tool support to cover the evaluation of dependability characteristics, such as reliability, is a promising future work.

Further evaluation and validation of the methodology and the tool support would be beneficial (e.g. using different replication protocols).

We have used the tool to mainly evaluate our own database replication protocol (DivRep). Thus using the approach and the software to evaluate different protocols would strengthen our confidence in their quality, and improve on the technology readiness level of the software.
Sectors Digital/Communication/Information Technologies (including Software),Education

 
Title A conceptual model for probabilistic, model-based performance evaluation of database replication protocols 
Description A stochastic model for probabilistic, model-based performance evaluation of database replication protocols. The model has been validated using the data obtained from the measurements with real systems (e.g. open-source database servers: PostgreSQL and Firebird, and the implementation of our database replication protocol DivRep). Performance evaluation via stochastic modelling has a number of advantages in comparison with the empirical performance measurements, among them: - Saving on ICT infrastructure costs and time on measurement. In our own experience, performance measurement takes a lot of time especially if performance needs to be measured in a wide range of conditions (varying workload, load, size of the data processed by the tested system, hardware constraints, etc.); - Evaluating performance variability, i.e. performance is evaluated not as a single number (e.g. throughput), but as a distribution of the measure of interest. This will take very extensive effort in order to get high confidence in the results. Stochastic models may offer dramatic changes in these circumstances; - Extrapolating from observed measurements to different operational environments (e.g. higher load) is difficult; performance does not scale linearly. This task is relatively easy to solve with stochastic models although the models should be sufficiently accurate in order to preserve the ranking between competing products in the new environment. Making sure that models provide this property is an important part of validation. - Replication protocols - which one to choose. Being able to compare the replication protocols at higher level of abstraction (eliminating the limitations of a particular implementation) is useful before one commits to a particular replication protocol. Typically, new protocols are compared with the performance of a single product, but rarely two competing replication protocols can be measured without the comparison being affected by the quality of the particular implementations. The ranking of the products may reflect the quality of the implementations, not the quality, lack thereof, of the available replication solutions. N.B. We have developed software that implements this conceptual model. The software is based on the Mobius tool distributed by the University of Illinois at Urbana-Champaign in the USA. (please see the associated entry in the 'Software and Technical Products' category). 
Type Of Material Computer model/algorithm 
Provided To Others? No  
Impact N/A 
 
Title Results from real systems, and simulation, experiments with DivRep database replication protocol 
Description We have performed extensive performance evaluation of our replication protocol - DivRep. This has been done in two ways: - using the "real" systems, e.g. performance of the individual database management servers (DBMSs): PostgreSQL and Firebird, and the performance of the replicated system comprising these two DBMSs and our replication protocol - DivRep - using the newly developed model-based, stochastic approach The results are stored in several Microsoft SQL databases. The databases are rather detailed, e.g. they include response times of individual database transactions as well as SQL operations. The size of this extensive data is in order of several hundreds of Gigabytes and thus is available on request. Please contact the PI directly if you are interested in obtaining this data (see below the personal webpage address of the PI for his contact details). 
Type Of Material Database/Collection of data 
Provided To Others? No  
Impact This data can be used by other researchers interested in i) performance of our replication protocol, and possibly comparison to other database replication protocols, and even ii) the performance of the individual database servers on a given SW and HW configuration. 
URL https://www.city.ac.uk/people/academics/vladimir-stankovic
 
Description As a result of DIDERO-PC First Grant project, and its publication in QEST '15 conference, a collaboration between our research centre and the research group from Imperial College London led by Prof Peter Harrison and Prof William Knottenbelt was established. We have submitted a joint EPSRC proposal (EP/P025994/1) with the tile "ACORNS: Autonomous Control and On-the-fly Reconfiguration of NoSQL Data Stores" on 24th Oct. 2016 to EPSRC. The proposal was unfortunately unsuccessful - it was a parti 
Organisation Imperial College London
Department Department of Computing
Country United Kingdom 
Sector Academic/University 
PI Contribution We have worked jointly on the proposal.
Collaborator Contribution We have worked jointly on the proposal.
Impact EPSRC proposal (EP/P025994/1) with the tile "ACORNS: Autonomous Control and On-the-fly Reconfiguration of NoSQL Data Stores"
Start Year 2015
 
Title A conceptual model and the associated software for probabilistic, model-based performance evaluation of database replication protocols 
Description The software we developed implements our conceptual, probabilistic model for performance evaluation of database replication protocols (please see the associated entry in the category Research Databases and Models). We have used the software to measure the performance of DivRep - a database replication protocol developed in the Centre for Software Reliability (CSR) at the City University London. The evaluation is done using TPC-C - an industry standard benchmark from the Transaction Processing Council consortium. The software is implemented in the Mobius tool, which is distributed by the University of Illinois at Urbana-Champaign in the USA (https://www.mobius.illinois.edu/). The team at CSR have significantly extended the Mobius model with our own C++ code to implement the model's logic. NB. The latest version of the software is available from http://openaccess.city.ac.uk/13069/ . Alternatively, please contact Dr Vladimir Stankovic (please see contact details at https://www.city.ac.uk/people/academics/vladimir-stankovic). Please note the following regarding the use and licencing of the software: - The software is available without any warranty. - No liability for any possible damages applies. - Although we will welcome the reports of software faults, we do not guarantee any actions as a result to such reports. - The use of the Mobius tool is necessary for running the software. Anyone wanting to run the software would need a valid Mobius license. The use of the Mobius tool has been traditionally granted free of charge for the academic purposes. Any enquiries about Mobius tool should be directed to the Mobius team at staff@mobius.illinois.edu or mobius-users@crhc.illinois.edu . 
Type Of Technology New/Improved Technique/Technology 
Year Produced 2014 
Impact The conceptual model and the software that implements it provide for a new approach in evaluation of database replication protocols. The model and the associated software are recent outcomes; we nevertheless envisage that they will be used for the following: - saving on ICT infrastructure costs and time for measurement. In our own experience, performance measurement takes a lot of time especially if performance needs to be measured under a wide range of conditions (varying workload, load, size of the data processed by the tested system, hardware constraints, etc. Having a model-based approach helps alleviate this. - evaluating performance variability, i.e. performance is evaluated not as a single number (e.g. throughput), but as a distribution of the measure of interest. This takes very extensive effort in order to get high confidence in the results; our stochastic model may offer dramatic benefits in these circumstances. 
URL http://openaccess.city.ac.uk/4744/
 
Title TPC-C Client - Java-based implementation of the industry standard benchmark for database performance evaluation 
Description An open-source, Java-based implementation of TPC-C, a benchmark for performance evaluation of (Relational) Database Management Systems ((R)DBMSs). TPC-C specifies an On-Line Transaction Processing (OLTP) application. The implementation closely resembles the TPC-C specification. The tested RDBMSs include Firebird, Microsoft SQL and PostgreSQL. 
Type Of Technology Software 
Year Produced 2016 
Open Source License? Yes  
Impact The software facilitates perfromance evaluation of individual Database Management Systems (DBMSs). 
URL http://openaccess.city.ac.uk/16037/
 
Description Visit by Prof Ricardo Jimenez-Peris, Technical University of Madrid, Spain. 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Our research team presented the project goals and the findings available at the time to Prof Jimenez-Peris.

Prof Jimenez-Peris gave a seminar on the topic of scalable database replication to the whole of the School during his visit.

N.B.: The presentations given by the members of our team are available on request.

Our research team shared the expertise on the research directions, and discussed the intermediate project decisions with the internationally leading expert in the field of database replication - Prof Ricardo Jimenez-Peris.

This event contributed to our continued effort in dissemination of the research findings.
Year(s) Of Engagement Activity 2013
 
Description Visit to the research group of Prof Fernando Pedone, University of Lugano, Switzerland. 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact The presentations given by our team, as well as the ones given by the hosts, were mutually well-received and activated interesting technical discussions and opened up possibilities for future collaboration.

NB. The presentations given by our team are available for perusal to interested parties on request.


This visit to the research group of the internationally renowned expert in distributed systems and database replication, Prof Fernando Pedone, helped us in focusing our research and instigated paths to possible future collaboration.
Year(s) Of Engagement Activity 2014