BabyTalk: Generating Textual Summaries of Clinical Temporal Data

Lead Research Organisation: University of Aberdeen
Department Name: Computing Science

Abstract

Intensive care units (ICUs) exist to provide specialised care to patients (in our case premature babies) who are very ill. Modern electronic technology has made it possible to make large numbers of measurements on the baby. Heart rate, blood pressures, temperatures and levels of oxygen and carbon dioxide in the blood can all be measured continuously as often as once per second. In addition, the introduction of the 'paperless' ICU means that test results, drug prescriptions, equipment settings, etc. are all entered routinely at the cotside by the medical staff. However, it is not at all clear that people are capable of making full use of all this information. The conventional approach is to display the data as graphs, but we have run experiments which showed that under some circumstances ICU doctors and nurses make better decisions when they are given written summaries in English. It would be beneficial if we could provide these summaries on a routine basis, but in our experiments, the summaries had to be specially written by expert clinicians. Our proposed solution is to develop computer software which would generate a summary automatically whenever anyone wanted. We hope that this would lead to better decisions and therefore better care for the babies. Automatically written summaries could be used for other purposes. For example, at the end of a long 12 hour shift, the nurse has to write a report of what has happened on her shift to give to the nurse who is taking over. If this could be done automatically, it would save time and avoid the possibility of significant events being forgotten. Another possibility which is being explored, is the possibility of giving parents a daily summary of how their baby is doing. When a baby is in intensive care, it is clearly a very anxious and emotional time for the family, and any automatic summary would have to be written in such a way that was sensitive to this. From a computing perspective, the main challenge in generating English summaries of medical data is to choose a small number of words which effectively communicate the important information from megabytes of sensor data. We have successfully developed software to generate textual summaries of weather data (that is, to generate textual weather forecasts), but generating summaries of medical data is harder because there is much more data (megabytes instead of kilobytes), and also because the summaries can be used for many different purposes, including helping doctors make decisions, helping nurses write reports, and informing and reassuring the families of patients.

Publications

10 25 50

publication icon
Hunter J (2011) BT-Nurse: computer generation of natural language shift summaries from complex heterogeneous medical data. in Journal of the American Medical Informatics Association : JAMIA

publication icon
Mishra S (2023) 'Cyclic syndrome' of arrears and efficiency of Indian judiciary. in SN business & economics

publication icon
Moncur W (2010) Modeling the socially intelligent communication of health information to a patient's personal social network. in IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society

publication icon
Moncur W (2014) Providing Adaptive Health Updates Across the Personal Social Network in Human-Computer Interaction

 
Description Babytalk showed that a computer system could automatically generate useful summaries of clinical data for a baby in a neonatal intensive care unit, for doctors (decision support), nurses (shift handover), and parents (stress reduction). There were many detailed technical advances, but perhaps the biggest finding was simple "it can be done"
Exploitation Route Babytalk ideas can be applied to many other medical contexts, they can also be applied to engineering contexts (where what is being monitored is not a baby but a complex piece of machinery). Aberdeen University establishe a spinout company, Data2text, in 2009, to commercialise the data-to-text technology developed in Babytalk and other projects. This company is exploring opportunities to commercialise some of the Babytalk ideas. Unfortunately commercial confidentiality issues do not allow me to give details about Data2text's clients or projects
Sectors Digital/Communication/Information Technologies (including Software),Healthcare

URL http://inf.abdn.ac.uk/research/babytalk/
 
Description Many of the ideas developed in Babytalk (although not specific IP) have been taken up by Data2text and Arria NLG. For example, see https://www.arria.com/case-study-neonatal-A233.php.
First Year Of Impact 2010
Sector Digital/Communication/Information Technologies (including Software)
Impact Types Economic

 
Description CleverMed Ltd 
Organisation CleverMed
Country United Kingdom 
Sector Private 
Start Year 2006
 
Title simplenlg 
Description a package for doing linguistic realisation in natural language generation systems 
Type Of Technology Software 
Year Produced 2010 
Open Source License? Yes  
Impact used by many academic groups, and also several companies 
URL https://code.google.com/p/simplenlg/
 
Company Name Datatext (now part of Arria NLG) 
Description Data2text is a software company which sells data-to-text systems which use natural language generation systems. In 2013, Data2text was bought by another company, Arria NLG 
Year Established 2009 
Impact I'm not sure what you're looking for here. EPSRC did a short writeup, see http://www.epsrc.ac.uk/files/newsevents/publications/case-studies/2014/growth-stories-big-data-is-big-business/
Website http://www.arria.com/