Beware of scams: Government officials will never ask you to transfer money or disclose bank log-in details over a phone call.

You can also contact A*STAR via our online form or call the 24/7 ScamShield Helpline at 1799 if you are unsure if something is a scam.

Banner

A*STAR Genome Institute of Singapore (A*STAR GIS)

A*STAR Scientists Develop Method to Decode New DNA ‘Letters’ that Could Transform Medicine and Biotechnology

(L to R) A*STAR researchers Ichiro Hirao, Hui Pen Tan, Michiko Kimoto, Mauricio Perez, Niranjan Nagarajan and Rafael Peres da Silva developed a new way to quickly and accurately read DNA with extra “letters,” unlocking new possibilities in medicine and biotechnology.

  • A*STAR scientists have developed a new method to directly and efficiently read DNA containing non-standard “letters” (bases) using AI and nanopore sequencing.
  • Until now, DNA sequencers (reading machines) could only recognise the four natural bases (A, T, C, G). This breakthrough removes that limit, enabling scientists to explore an expanded genetic code.
  • This advancement could accelerate drug discovery, improve disease treatment, and enable new approaches to data and information storage.

A research team led by the A*STAR Genome Institute of Singapore (A*STAR GIS) have developed a method to accurately and efficiently read DNA containing non-standard bases — a task once thought too complex for conventional DNA sequencers. Their work, published in Nature Communications, combines nanopore sequencing with artificial intelligence (AI) to decode these extra “letters” at high speed and accuracy.|

The Challenge: Decoding DNA, Nature’s Hidden Language

DNA is nature’s instruction manual, built from four standard “letters” or bases A, T, C, and G. Scientists have long imagined expanding this genetic alphabet by adding new “letters”, known as non-canonical bases (NCBs). These NCBs can occur naturally in some viruses or be created in the lab, and they have the potential to unlock new ways of designing molecules, materials, and biological systems.

However, DNA sequencing machines were built to recognise only the standard four bases. Because they struggle to detect or decode new ones, scientists have been unable to fully harness their potential to develop more precise medicines, engineer artificial genomes for sustainable chemical production, and design programmable materials and nanoscale devices for future technologies.

“Our ability to quickly read a piece of text relies largely on how familiar we are with the vocabulary used,” said Dr Mauricio Lisboa Perez, Scientist at A*STAR GIS and first author of the study. “Similarly, for an AI model to ‘speed-read’ DNA, it must have seen enough examples of every base. Non-canonical bases are rare and harder to produce, so we had to design creative ways to generate sufficient examples for our AI model to learn from.”

The Solution: Using AI to Translate DNA with Non-Standard Bases

The team created a large library of artificial DNA containing both standard and non-standard bases in different combinations, then used nanopore sequencing to record the unique electrical signals produced as each base passed through microscopic pores. Because the data were often noisy and incomplete, the researchers developed an AI-driven approach that could learn and improve iteratively, refining its predictions over time. They also enhanced the AI model’s learning by creatively rearranging existing signal data to represent more combinations. This adaptive method enabled the AI to accurately recognise each base’s distinct pattern, allowing the sequencer to read new DNA “letters” directly.

While other research groups have explored similar challenges, this study is the first to demonstrate a DNA sequencer that can directly and reliably identify non-standard bases at scale using AI, overcoming key bottlenecks such as limited training data.

“Being able to accurately identify these new bases at scale gives us a much richer vocabulary for writing and reading biological information,” said Dr Niranjan Nagarajan, Associate Director, AI & Compute at A*STAR GIS and senior author of the study. “It’s like learning to recognise new letters, which allows us to understand many more words and meanings in the language of life.”

Transformative Potential of the Method

This breakthrough could drive innovation across multiple fields:

Healthcare and Therapeutics: Accurately reading and analysing non-standard bases removes a major bottleneck in developing DNA- and RNA-based treatments, paving the way for new drugs and diagnostics.

Advanced Materials and Biotechnology: Non-standard bases could serve as new building blocks for nanostructures and nanorobots, leading to breakthroughs in medicine, manufacturing, and sustainable chemical production.

Data and Information Storage: Encoding information using expanded DNA alphabets could make data storage more affordable and energy-efficient, potentially reducing the environmental footprint of data centres.

The researchers plan to extend their work to discover more non-standard bases in viruses and enhance the AI model’s ability to detect them.

“We are excited about this new DNA sequencing method and the possibilities it brings,” said Dr Wan Yue, Executive Director at A*STAR GIS. “Working with an expanded DNA alphabet will create more opportunities for scientists to develop new therapeutics, novel organisms that produce chemicals environmentally, and new programmable materials for nanostructures and nanorobots. These innovations can advance scientific discovery, create economic value, and ultimately improve lives.

 

###

About A*STAR Genome Institute of Singapore (A*STAR GIS)

The A*STAR Genome Institute of Singapore (GIS) is an institute of the Agency for Science, Technology and Research (A*STAR). It has a global vision that seeks to use genomic sciences to achieve extraordinary improvements in human health and public prosperity. Established in 2000 as a centre for genomic discovery, A*STAR GIS pursues the integration of technology, genetics, and biology towards academic, economic and societal impact, with a mission to harness genomic technologies for a better Singapore and world.

Key research areas at the GIS include RNA & DNA Technologies, Single Cell & Spatial Technologies), Precision Medicine, Population Genomics, and AI & Compute. The genomics infrastructure at the GIS is also utilised to train new scientific talent, to function as a bridge for academic and industrial research, and to explore scientific questions of high impact.

For more information about GIS, please visit www.a-star.edu.sg/gis.

Follow us on
LinkedIn
| BlueSky | X | YouTube

 

About the Agency for Science, Technology and Research (A*STAR)

The Agency for Science, Technology and Research (A*STAR) is Singapore's lead public sector R&D agency. Through open innovation, we collaborate with our partners in both the public and private sectors to benefit the economy and society. As a Science and Technology Organisation, A*STAR bridges the gap between academia and industry. Our research creates economic growth and jobs for Singapore, and enhances lives by improving societal outcomes in healthcare, urban living, and sustainability. A*STAR plays a key role in nurturing scientific talent and leaders for the wider research community and industry. A*STAR’s R&D activities span biomedical sciences to physical sciences and engineering, with research entities primarily located in Biopolis and Fusionopolis. For ongoing news, visit www.a-star.edu.sg.

Follow us on
Facebook
| LinkedIn | Instagram | YouTube | TikTok


Upcoming Events

18

Mar

GIS Seminar by Dr. Angli Xue and Dr. Zhen Qiao

GIS Seminar Room (Level 2)


Follow GIS at:

bluesky