Deep Learning for Classification of Astronomical Archives
Lead Research Organisation:
University of Manchester
Department Name: Physics and Astronomy
Abstract
Astronomy is known for producing highly appealing images which often get published and re-published world-wide. HST images, for instance, have appeared on web sites, news papers, and even stamps. Less well known is that these images are the tip of the ice berg. The large majority of images, even from HST, do not contain such pretty structures. Astronomers locate the news-worthy ones by manual inspection of all the data.
Modern astronomical instruments provide far more data than can even be manually inspected. ALMA, for instance, can produce over 10,000 images from a single observation. Each of these measures a slightly different wavelength of radiation, and many of the images will be empty or near-empty. But without looking at each one, how does the astronomer know which images to select? This project will look at techniques to let computers do the work. They can investigate all images much faster than people can, and can learn which ones are of interest by looking for characteristic patterns. The problem is that they need to be taught what is 'characteristic' something even astronomers may find difficult to put into an algorithm. Malaysian scientists have unparalleled expertise in image classification using deep learning techniques, and in semantic descriptions. This expertise will be brought to the astronomical archives, to develop techniques of computer learning to aid astronomers. Manchester is one of the access points to the ALMA archive. The combination of the data and astrophysical expertise in Manchester and image classification by deep learning in Malaysia will provide new and powerful tools for science.
Modern astronomical instruments provide far more data than can even be manually inspected. ALMA, for instance, can produce over 10,000 images from a single observation. Each of these measures a slightly different wavelength of radiation, and many of the images will be empty or near-empty. But without looking at each one, how does the astronomer know which images to select? This project will look at techniques to let computers do the work. They can investigate all images much faster than people can, and can learn which ones are of interest by looking for characteristic patterns. The problem is that they need to be taught what is 'characteristic' something even astronomers may find difficult to put into an algorithm. Malaysian scientists have unparalleled expertise in image classification using deep learning techniques, and in semantic descriptions. This expertise will be brought to the astronomical archives, to develop techniques of computer learning to aid astronomers. Manchester is one of the access points to the ALMA archive. The combination of the data and astrophysical expertise in Manchester and image classification by deep learning in Malaysia will provide new and powerful tools for science.
Planned Impact
Malaysia has set itself the goal to become a high income country. For this purpose, it is preparing for the so-called 'fourth industrial revolution', especially in the areas of computing, big data deep learning. STEM skills have been identified as a main priority. The proposed project brings together academics from Malaysia and the UK, develop new techniques and applications in deep learning, and to bring Malaysian academics into contact with the current developments in astrophysical data.
Astrophysical data is currently stored in archives which can reach pentabyte size. Identification of data of interest is based on external information: which target was observed, and which spectral line. This project will explore using internal information derived by deep learning algorithm from the data itself and from the text of original proposal. it builds on deep-learning expertise already present in Malaysia, related to image content and text mining, and combines it with the astronomical data to develop new techniques. This strongly supports the unique skills of the Malaysian academics, and will lead to project for graduate students. It will also provide a tool to gain access to astronomical archives, which can be used to bring astronomy into the class room. Astronomy is very much suited to hard-to-reach audiences and build interest in studying STEM topics.
Astrophysical data is currently stored in archives which can reach pentabyte size. Identification of data of interest is based on external information: which target was observed, and which spectral line. This project will explore using internal information derived by deep learning algorithm from the data itself and from the text of original proposal. it builds on deep-learning expertise already present in Malaysia, related to image content and text mining, and combines it with the astronomical data to develop new techniques. This strongly supports the unique skills of the Malaysian academics, and will lead to project for graduate students. It will also provide a tool to gain access to astronomical archives, which can be used to bring astronomy into the class room. Astronomy is very much suited to hard-to-reach audiences and build interest in studying STEM topics.
Publications
Awang Iskandar D
(2020)
Classification of Planetary Nebulae through Deep Transfer Learning
in Galaxies
Iskandar D
(2021)
Classification of Planetary Nebulae through Deep Transfer Learning
Sabin L
(2021)
First deep images catalogue of extended IPHAS PNe
Sabin L
(2021)
First deep images catalogue of extended IPHAS PNe
in Monthly Notices of the Royal Astronomical Society
Description | Machine learning was applied to astronomical images. The results (published as two academic papers) show that it was possible to confidently classify archival images containing planetary nebulae |
Exploitation Route | The goal was new ways to find and access data from astronomical archives, usable by groups with little previous experience with such data |
Sectors | Digital/Communication/Information Technologies (including Software) Education |
Description | (EXPLORE) - Innovative Scientific Data Exploration and Exploitation Applications for Space Sciences |
Amount | € 1,991,274 (EUR) |
Funding ID | 101004214 |
Organisation | European Commission |
Sector | Public |
Country | European Union (EU) |
Start | 11/2020 |
End | 10/2023 |
Description | UoM GCRF/Newton Consolidator Funding |
Amount | £19,000 (GBP) |
Organisation | University of Manchester |
Sector | Academic/University |
Country | United Kingdom |
Start | 09/2022 |
End | 03/2023 |
Description | Astronomy in Southeast Asia |
Organisation | University Malaysia Sarawak (UNIMAS) |
Country | Malaysia |
Sector | Academic/University |
PI Contribution | We are providing UNIMAS and the Malaysian community with access to astronomical archives and with contacts in other countries. This has already lead to the invited presence of the Malaysian collaboration to a meeting in Thailand on development for astronomy in Southeast Asia. They have also made contacts with a medical physics group in Manchester |
Collaborator Contribution | The Malaysian partners bring knowledge on deep learning and classification of images. They have previously worked on medical images and now develop a system for classifying archival images. |
Impact | Multi-disciplinary, bringing together physics and computer science |
Start Year | 2019 |
Description | Public outreach talk, Kuching, Malaysia |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Public/other audiences |
Results and Impact | Two talks were given at the Planetarium in Kutching, Borneo, Malaysia, 27 April 2019. The event was extensively covered in the Borneo Post on 25 April. The talk was open to the public and attracted a variety of people of all ages and backgrounds. A representative of the government also was present. |
Year(s) Of Engagement Activity | 2019 |
Description | Training session on astronomical data and analysis |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | A 2-hour tutorial was given to 35 Malaysian students and academics in computer science, to show the computing systems developed to analyze the big data in astronomy. This was part of a meeting 'CAMP21' on 15-15 June 2021. The invitation wqs directly related to the current grant, and the interest it generated in the computer science community in Malaysia. Related to this, I was asked to evaluate student posters for the award of a prize. |
Year(s) Of Engagement Activity | 2021 |
URL | https://camp21.pecamp.org |
Description | Workshop with young Malaysian academics |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Talk and discussion session with young academics and students at the Foundation studies of UNIMAS in Kutching, Malaysia, on 25 April 2019. Topic was on interdisciplinarity. Present: around 15 people, equally balanced in gender, from mathematics, engineering and education. |
Year(s) Of Engagement Activity | 2019 |