Deep Learning for Classification of Astronomical Archives

Lead Research Organisation: University of Manchester
Department Name: Physics and Astronomy

Abstract

Astronomy is known for producing highly appealing images which often get published and re-published world-wide. HST images, for instance, have appeared on web sites, news papers, and even stamps. Less well known is that these images are the tip of the ice berg. The large majority of images, even from HST, do not contain such pretty structures. Astronomers locate the news-worthy ones by manual inspection of all the data.

Modern astronomical instruments provide far more data than can even be manually inspected. ALMA, for instance, can produce over 10,000 images from a single observation. Each of these measures a slightly different wavelength of radiation, and many of the images will be empty or near-empty. But without looking at each one, how does the astronomer know which images to select? This project will look at techniques to let computers do the work. They can investigate all images much faster than people can, and can learn which ones are of interest by looking for characteristic patterns. The problem is that they need to be taught what is 'characteristic' something even astronomers may find difficult to put into an algorithm. Malaysian scientists have unparalleled expertise in image classification using deep learning techniques, and in semantic descriptions. This expertise will be brought to the astronomical archives, to develop techniques of computer learning to aid astronomers. Manchester is one of the access points to the ALMA archive. The combination of the data and astrophysical expertise in Manchester and image classification by deep learning in Malaysia will provide new and powerful tools for science.

Planned Impact

Malaysia has set itself the goal to become a high income country. For this purpose, it is preparing for the so-called 'fourth industrial revolution', especially in the areas of computing, big data deep learning. STEM skills have been identified as a main priority. The proposed project brings together academics from Malaysia and the UK, develop new techniques and applications in deep learning, and to bring Malaysian academics into contact with the current developments in astrophysical data.

Astrophysical data is currently stored in archives which can reach pentabyte size. Identification of data of interest is based on external information: which target was observed, and which spectral line. This project will explore using internal information derived by deep learning algorithm from the data itself and from the text of original proposal. it builds on deep-learning expertise already present in Malaysia, related to image content and text mining, and combines it with the astronomical data to develop new techniques. This strongly supports the unique skills of the Malaysian academics, and will lead to project for graduate students. It will also provide a tool to gain access to astronomical archives, which can be used to bring astronomy into the class room. Astronomy is very much suited to hard-to-reach audiences and build interest in studying STEM topics.

Publications

10 25 50
 
Description (EXPLORE) - Innovative Scientific Data Exploration and Exploitation Applications for Space Sciences
Amount € 1,991,274 (EUR)
Funding ID 101004214 
Organisation European Commission 
Sector Public
Country European Union (EU)
Start 11/2020 
End 10/2023
 
Description UoM GCRF/Newton Consolidator Funding
Amount £19,000 (GBP)
Organisation University of Manchester 
Sector Academic/University
Country United Kingdom
Start 10/2022 
End 03/2023
 
Description Astronomy in Southeast Asia 
Organisation University Malaysia Sarawak (UNIMAS)
Country Malaysia 
Sector Academic/University 
PI Contribution We are providing UNIMAS and the Malaysian community with access to astronomical archives and with contacts in other countries. This has already lead to the invited presence of the Malaysian collaboration to a meeting in Thailand on development for astronomy in Southeast Asia. They have also made contacts with a medical physics group in Manchester
Collaborator Contribution The Malaysian partners bring knowledge on deep learning and classification of images. They have previously worked on medical images and now develop a system for classifying archival images.
Impact Multi-disciplinary, bringing together physics and computer science
Start Year 2019
 
Description Public outreach talk, Kuching, Malaysia 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Two talks were given at the Planetarium in Kutching, Borneo, Malaysia, 27 April 2019. The event was extensively covered in the Borneo Post on 25 April. The talk was open to the public and attracted a variety of people of all ages and backgrounds. A representative of the government also was present.
Year(s) Of Engagement Activity 2019
 
Description Training session on astronomical data and analysis 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A 2-hour tutorial was given to 35 Malaysian students and academics in computer science, to show the computing systems developed to analyze the big data in astronomy. This was part of a meeting 'CAMP21' on 15-15 June 2021. The invitation wqs directly related to the current grant, and the interest it generated in the computer science community in Malaysia. Related to this, I was asked to evaluate student posters for the award of a prize.
Year(s) Of Engagement Activity 2021
URL https://camp21.pecamp.org
 
Description Workshop with young Malaysian academics 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Talk and discussion session with young academics and students at the Foundation studies of UNIMAS in Kutching, Malaysia, on 25 April 2019. Topic was on interdisciplinarity. Present: around 15 people, equally balanced in gender, from mathematics, engineering and education.
Year(s) Of Engagement Activity 2019