InfoTestSS: Information theory and Test Suite Selection

Lead Research Organisation: University College London

Department Name: Computer Science

Abstract

Software testing is an important part of the software development process but typically is manual, expensive, and error prone. This has led to significant interest in automated test generation (and execution) algorithms, with these having the potential to lead to cheaper, higher-quality software. Despite the interest in automating parts of testing, there are still significant challenges, with auto-testing being mentioned as an EPSRC priority within Software Engineering.

This project will build on initial work by the PIs that has demonstrated that an important aspect of testing can be represented in terms of Quantified Information Flow. Specifically, the PIs previously looked at Failed Error Propagation (FEP), which is sometimes called coincidental correctness. In FEP, a test execution goes through a faulty part of the software, this leads to what would be regarded as a corrupted program state (i.e. the fault has an effect) but ultimately the output is correct. Although studies have shown that FEP can significantly reduce test effectiveness, there is a lack of practical techniques that address FEP. The observation made by the PIs is that FEP corresponds to a failure for information to flow from the fault in the software to output: information is lost through different values for the program state (correct and faulty values) being mapped to the same output.

The PIs have shown how FEP can be represented in terms of an information theoretic notion: Quantified Information Flow (QIF). The results of experiments were highly promising, with there being a rank correlation of over 0.95 between the frequency with which FEP was observed in software and a QIF-based metric. This remarkably strong result opens up the possibility of devising techniques that generate test cases that are less likely to suffer from FEP. In addition, we believe that it is possible to represent other important testing concepts using information theory, specifically: the 'feasibility' of a path (we do not want test automation to waste effort in trying to trigger infeasible paths), the diversity of a test suite (evidence suggests that diverse test suites are effective), and also the effectiveness of probes/oracles added to the code.

This project will develop new methods, based on information theory, for reasoning about the above factors (FEP, feasibility, diversity, and oracles). In doing so it will develop information theoretic measures that can help test automation to overcome the associated issues. It will also develop methods for estimating these measures, integrate these estimates into automated test generation, and evaluate the results on open source software and software provided by our industrial partners. The outcome will be a new theory for software testing, based on information theory, and a set of techniques that use this theory to make software testing more efficient and effective.

Planned Impact

Software increasingly underpins our contemporary world. This trend will continue for the foreseeable future. Software relies on testing to ensure that it behaves as required. Testing is perhaps less science than art and there are a number of known, persistent problems in software testing. The proposed research will address four classic problems in selecting a suitable set of test inputs; will show how solutions to these can improve two state-of-the-art approaches to selection; then will produce a prototype tool that automates the solutions. Furthermore, the research plans to do this by modelling the four classic problems using information theory, producing an underpinning theory that has the potential to develop into a general theory of software testing.

short-term impact
- Develop human capital by deepening and extending the understanding of software testing of the researchers and collaborators directly collaborating in the project, both from academia and industry.
- Within the academic software testing community, increase the understanding of software testing and the knowledge of novel techniques to solve problems within this domain; build a new research community in applications of information theory to software testing and to software engineering more generally; provide new lines of research and new research problems for both the QIF and the software testing communities.
- Improve the UK and European quality of life through cheaper, better quality software. This will, particularly and more immediately, benefit the sectors represented by our industrial partners, namely the banking and automotive sectors.
- Add to the number of options in choosing tools for automating testing.

medium-term impact
- Reduce software cost and improve time to market for all forms of software development.
- Improve the sophistication of software testing and the level of skill and knowledge of its practitioners.
- Lay foundations for an information theory based, coherent theory of software testing; make connections to a more general theory of software engineering based on the theory of information.
- Expand the connections between software testing and formal methods via information theory.
- Create a new venue for discussion, publication and dissemination of ideas about these topics.

long-term impact
- Contribute to providing laws, predictions, and an integrated theory for software testing in particular and software engineering in general.
- Improve society's confidence in software affecting many aspects of life.

Funded Value:

£448,001

Funded Period:

Jan 17 - Dec 20

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/P005888/1

Principal Investigator:

David Clark

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Software Engineering (100%)

Organisations

People	ORCID iD
David Clark (Principal Investigator)
Mark Harman (Co-Investigator)

Publications

Author Name Title Publication Date Published

|< < 1 2 > >|

10 25 50

Jahangirova G (2017) Oracle problem in software testing

Jahangirova G (2018) OASIs: oracle assessment and improvement tool

Bruce D (2019) Search-Based Software Engineering - 11th International Symposium, SSBSE 2019, Tallinn, Estonia, August 31 - September 1, 2019, Proceedings

Clark D (2019) Normalised Squeeziness and Failed Error Propagation in Information Processing Letters

Menendez H (2019) VARMOG: A Co-Evolutionary Algorithm to Identify Manifolds on Large Data

Langdon W (2020) Evolving sqrt into 1/ x via software data maintenance

G. Jahangirova (2020) An Empirical Study of Failed Error Publication in Java Programs with Real Faults

Langdon W (2021) Genetic programming convergence in Genetic Programming and Evolvable Machines

Jahangirova G (2021) An Empirical Validation of Oracle Improvement in IEEE Transactions on Software Engineering

Langdon W (2021) Dissipative polynomials

Key Findings
Further Funding
Collaboration
Software and Technical Products
Engagement Activities


Description	Have developed an open source tool that enables programmers to test assertion oracles embedded in code to improve testing. Have developed a highly novel open source tool that uses output diversity to select test sets and has better error finding capabilities than conventional coverage or input diversity approaches. Have developed a tool that covers a selected point in a program with an input diverse set of tests. This latter is mostly a research tool but could be useful in regression testing as part of a continuous development cycle. Have developed a program transformation that is semantically invariant and when used with a feedback directed mutational fuzzer measurably improves the diversity of the generated tests and improves bug finding. Towards the end of the project, employment of a new research associate has opened up a new research direction with regard to the effects of entropy loss on software robustness. This has now become an important theme in my research and has led to funding from Meta.
Exploitation Route	Programmers in Java and C can download and use these tools to assist them in testing their programs. The new program transformation is not easily packaged into a tool but we are researching ways to automate it. The entropy loss work has potential for impact through liaison with Meta and a further EPSRC proposal.
Sectors	Aerospace, Defence and Marine,Agriculture, Food and Drink,Chemicals,Communities and Social Services/Policy,Construction,Creative Economy,Digital/Communication/Information Technologies (including Software),Education,Electronics,Energy,Environment,Financial Services, and Management Consultancy,Healthcare,Leisure Activities, including Sports, Recreation and Tourism,Government, Democracy and Justice,Manufacturing, including Industrial Biotechology,Culture, Heritage, Museums and Collections,Pharmaceu


Description	Academic Outreach scheme
Amount	$140,000 (USD)
Organisation	Facebook
Sector	Private
Country	United States
Start	08/2021
End	08/2022


Description	ELVEN: Entropy Loss Via ExecutioN
Amount	$100,000 (USD)
Organisation	Facebook
Sector	Private
Country	United States
Start


Description	Collaboration with Fondazione Bruno Keller, Trento, Italy
Organisation	Fondazione Bruno Kessler
Country	Italy
Sector	Private
PI Contribution	I have co-authored a paper with Professor Paolo Tonella and we jointly supervise a PhD student. The student is supported by the grant funding from FBK.
Collaborator Contribution	FBK have supplied the funding for a PhD student, Gunel Jahangirova, whom Professor Tonella and I jointly supervise. We have collaborated extensively, producing one paper so far.
Impact	Gunel Jahangirova, David Clark, Mark Harman, Paolo Tonella: Test oracle assessment and improvement. ISSTA 2016: 247-258.
Start Year	2015


Title	DFT: Diversified Focused Testing
Description	DFT supports focused testing, that is, it generates a set of test inputs sampled uniformly that cover a selected point in the software.
Type Of Technology	Software
Year Produced	2019
Open Source License?	Yes
Impact	none as yet
URL	https://github.com/hdg7/DFT


Title	OaSiS
Description	This software allows a programmer to improve assertions placed in the code via tests for false negatives and false positives with respect to the intended behaviour of the program. The result of the testing process is that the placed assertion will be sound and complete with respect to the software's intended behaviour.
Type Of Technology	Software
Year Produced	2018
Impact	This has been used by other researchers in attempts to eliminate the necessity for a human in the loop and so to fully automate the method
URL	https://github.com/guneljahan/OASIs


Title	OutGen: A white box method of automated test set generation using output diversity
Description	This is an automated test set generator for C language functions. It embodies a novel principle, namely optimising the channel capacity of the testing channel, allowing discovery of inputs that exercise rarer software behaviours
Type Of Technology	Software
Year Produced	2019
Open Source License?	Yes
Impact	None as yet
URL	https://github.com/hdg7/OutGen


Description	IEEE Software Engineering Blog post
Form Of Engagement Activity	A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	A technical blog post explaining the relationship between information loss and software robustness including a highly visual example of an abnormally large arithmetic expression and demonstrating how and where information loss causes a tolerance to errors in the expression.
Year(s) Of Engagement Activity	2021
URL	http://blog.ieeesoftware.org/2021/09


Description	Presentation of the paper, HyperGI: Automated Detection and Repair of Information Flow Leakage
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Presentation of the paper, HyperGI: Automated Detection and Repair of Information Flow Leakage at the New Ideas and Emerging Results track of the international conference on Automated Software Engineering.
Year(s) Of Engagement Activity	2021
URL	https://conf.researchr.org/details/ase-2021/ase-2021-nier-track/3/HyperGI-Automated-Detection-and-Re...


Description	Robustness survey presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Presentation of the short paper, Software Robustness: A Survey, A Theory and Prospects, at the Ideas, Visions and Reflections track at ESEC-FSE 2021. The presentation was by recorded video.
Year(s) Of Engagement Activity	2021
URL	https://2021.esec-fse.org/details/fse-2021-ideas-visions-and-reflections/6/Software-Robustness-A-Sur...


Description	TAROT Summer School 2018
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Summer School on research in software testing held at UCL. Organised by the InfoTestSS team at UCL and Brunel. David Clark presented a talk on InfoTestSS research. Took place over five days an involved many international experts.
Year(s) Of Engagement Activity	2018
URL	https://wp.cs.ucl.ac.uk/tarot2018/


Description	Talk at Leicester Computer Science Department External Seminar series, 7 December
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Professional Practitioners
Results and Impact	Talk on information theory and testing.
Year(s) Of Engagement Activity	2018
URL	https://www2.le.ac.uk/departments/informatics/news/researchevents/external/external-seminars-201819


Description	Talk in Journal First track at ESEC/FSE 2021
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Talk on our journal paper, Output Sampling for Output Diversity in Automatic Unit Test Generation, at the Journal First track of the ESEC-FSE conference in 2021. The talk was given as a live video presentation.
Year(s) Of Engagement Activity	2021
URL	https://2021.esec-fse.org/details/fse-2021-journal-first/5/Output-Sampling-for-Output-Diversity-in-A...


Description	Teaching course in Testing Software: University College MSc in Software Systems Engineering
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	I taught a five week course to MSc students and final year undergraduates. In the course of this I discussed results from InfoTestSS
Year(s) Of Engagement Activity	2021
URL	https://rl.talis.com/3/ucl/lists/CE9E51CA-5518-8200-4888-81B08F02C216.html?lang=en-gb


Description	The 57th CREST Open Workshop - Information Theory and Software Testing
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	This was a workshop designed to introduce the Information Theory for Test Set Selection project to academics and industry. Almost 50 people attended with 40 people formally registered. The audience was drawn from different countries with the majority of the participants being from the UK.
Year(s) Of Engagement Activity	2018
URL	http://crest.cs.ucl.ac.uk/cow/57/


Description	Workshop on Statistics, Learning, Genetic Improvement and Testing for Programs
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	The 61st CREST Open Workshop - Statistics, Learning, Genetic Improvement and Testing for Programs. Date: 21st and 22nd October 2019 Venue: Friends House, 173-177 Euston Road, London, NW1 2BJ Overview: Software testing is expanding, growing, and finding new connections and uses like some strange mutant changeling. This workshop aims to reflect and celebrate this. To this end we want to attract software engineering researchers with interest and expertise in some of the following areas: applications of statistics to problems in SE, applications of machine learning to problems in SE, Genetic Improvement of Programs and, last but not least, Software Testing. We are interested in both the intersections and the unions of these areas and hope to find new research synergies beyond those that exist already. Examples of illustrative developments in this union from within CREST include Software Transplantation, novel approaches to testing machine learning systems, information theoretic models of failed error propagation and oracle improvement in software testing, the use of deep neural nets to create and reconfigure search landscapes in testing, the use of search in adversarial machine learning, and the use of testing in the application of GI to a long list of types of software improvement. To grow and deepen these connections, this workshop will present a list of talks in the union (and sometimes the intersections) of these areas.
Year(s) Of Engagement Activity	2019
URL	http://crest.cs.ucl.ac.uk/cow/61/

Abstract

Planned Impact

Organisations

People

ORCID iD

Publications