<?xml version="1.0" encoding="UTF-8"?><ns2:project xmlns:ns1="http://gtr.rcuk.ac.uk/gtr/api" xmlns:ns2="http://gtr.rcuk.ac.uk/gtr/api/project" xmlns:ns3="http://gtr.rcuk.ac.uk/gtr/api/fund" xmlns:ns4="http://gtr.rcuk.ac.uk/gtr/api/person" xmlns:ns5="http://gtr.rcuk.ac.uk/gtr/api/project/outcome" xmlns:ns6="http://gtr.rcuk.ac.uk/gtr/api/organisation" ns1:created="2026-06-03T15:52:43Z" ns1:href="http://gtr.ukri.org/gtr/api/projects/FC922306-2DBB-4DA4-9558-C84DC365DF14" ns1:id="FC922306-2DBB-4DA4-9558-C84DC365DF14"><ns1:links><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/persons/47F607AB-D506-45BE-838F-1C2885C50DC9" ns1:rel="PM_PER"/><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/organisations/1957B636-B7EF-439E-BF5B-44D19352EBA4" ns1:rel="LEAD_ORG"/><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/organisations/1957B636-B7EF-439E-BF5B-44D19352EBA4" ns1:rel="PARTICIPANT_ORG"/><ns1:link ns1:end="2023-11-30T00:00:00Z" ns1:href="http://gtr.ukri.org/gtr/api/funds/B77BD5FB-FF1C-4FBE-8CAB-9D3C861503A1" ns1:rel="FUND" ns1:start="2023-05-31T23:00:00Z"/></ns1:links><ns2:identifiers><ns2:identifier ns2:type="RCUK">10075813</ns2:identifier></ns2:identifiers><ns2:title>Hybrid Quantum Transformer Architectures in Genomics</ns2:title><ns2:status>Closed</ns2:status><ns2:grantCategory>Grant for R&amp;D</ns2:grantCategory><ns2:leadFunder>Innovate UK</ns2:leadFunder><ns2:abstractText>Transformers and large language models have shown great potential in processing language-based information. Their strenth in connecting large and complex date sets makes them superior to conventional neural networks. Recently it has been shown that transformers (Google's BERT model) can also be applied to genomics datasets (DNA BERT). The majority of DNA (about 98%) is non-coding (ncDNA) and merely provides instructions on how to read the 2% of coding DNA that is first transcribed into mRNA and then translated into proteins. However, many complex medical conditions and also mental health conditions are associated with mutations within the ncDNA.

The non-coding DNA (ncDNA) is statistically very similiar to human languages. This has been exploited with the DNA BERT. It has already been shown that the final transformer layer of the convential BERT model can be replaced with a variational quantum circuit to undertake classification tasks.

The aim of our project is to explore various quantum transformer hybrids models and benchmark them against their classical counterparts. We will also investigate more efficient ways of encoding genomic data into the quantum state vector and may consider geometric principles, different state vector encoding and compression methods in this endeavour. The goal is to make predictions on functional regions within the ncDNA that have relevance for medical conditions.

Genomics is a rapidly growing market segment and is essential in future drug discovery. Whilst ncDNA is a more challenging target for novel therapeutics, research has shown that selective molecules exist that can bind to these regions and block these regions. Our intention is to create a quantum computational platform that supports the identification of disease-relevant functional regions within ncDNA sequences.</ns2:abstractText></ns2:project>