Trend AnalysisOther Sciences

Xenobiology and Expanded Genetic Alphabets: Six-Letter DNA and Beyond

Natural DNA uses four nucleotide letters (A, T, G, C). Xenobiologists have expanded this alphabet to six or eight letters, creating DNA with increased information density and novel three-dimensional structures. These expanded genetic systems enable new biotechnologies and test the universality of life's chemistry.

By Sean K.S. Shin
This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

All known life on Earth uses the same four-letter genetic alphabet: adenine (A), thymine (T), guanine (G), and cytosine (C), paired as A-T and G-C. This universality suggests either a frozen accident of life's origin or an optimal solution constrained by chemistry. Xenobiology---the study and creation of alternative biological systems---tests these boundaries by designing synthetic nucleotides that pair orthogonally to natural DNA.

The Artificially Expanded Genetic Information System (AEGIS) adds new base pairs to DNA, creating six-letter (and even eight-letter) genetic alphabets. These expanded systems are not merely curiosities: they increase DNA's information density, enable novel diagnostic tools, and create semi-synthetic organisms that incorporate unnatural bases into their genome.

Why It Matters

Expanding the genetic alphabet has immediate practical applications in diagnostics (AEGIS-based tests for SARS-CoV-2 were deployed during COVID-19), therapeutics (semi-synthetic organisms producing proteins with unnatural amino acids), nanotechnology (DNA structures with enhanced complexity), and fundamental biology (testing whether alternative genetic systems can support Darwinian evolution).

The Research Landscape

Novel 3D Folding Motifs

Wang and Hoshika (2024), with 8 citations in Nature Chemistry, report that adding a synthetic nucleotide (dZ, with a nitro-aminopyridone base) to DNA enables three-dimensional folding motifs impossible with four-letter DNA. This demonstrates that expanding the genetic alphabet does not merely increase linear information density but qualitatively expands the diversity of 3D structures DNA can form---with implications for aptamer selection and molecular engineering.

Six-Letter DNA Nanotechnology

Vecchioni and Hernandez (2024), with 6 citations in Nano Letters, incorporate Z-P base pairs into self-assembling 3D DNA crystals. DNA nanotechnology---building nanoscale structures from programmable DNA tiles---has been limited to four-letter design rules. Six-letter DNA provides additional orthogonal base-pairing interactions, enabling more complex and precisely controlled nanostructures.

Monitoring Expanded Bases in Complex DNA

Wang and Wang (2025) develop methods for monitoring expanded genetic letters (TPT3-NaM pair) in complex DNA contexts. As expanded genetic systems move from controlled laboratory settings to semi-synthetic organisms, tools for tracking unnatural bases within the background of natural DNA are essential for quality control and safety monitoring.

Selectivity Without Polymerase

Noori and Bofill (2025) use quantum chemical calculations to show that pi-pi stacking determines the selectivity of unnatural base pairs even without polymerase enzymes. This finding is significant: it means that the selectivity of expanded base pairs is an intrinsic chemical property, not solely dependent on polymerase recognition---making these systems more robust for applications outside the cell.

Expanded Genetic Alphabet Systems

<
SystemLettersNew PairsKey ApplicationDeveloper
AEGIS (Hachimoji)8Z-P, S-BDiagnostics, aptamersBenner lab
UBP (NaM-TPT3)6NaM-TPT3Semi-synthetic organismsRomesberg lab
ds-Pa6Ds-PaIn vitro evolutionHirao lab
isoG-isoC6isoG-isoCProof of conceptEarly system

What To Watch

The creation of fully autonomous semi-synthetic organisms that replicate expanded genetic alphabets without supplementation of unnatural nucleotides from external sources would be a landmark achievement. Current semi-synthetic organisms require feeding of synthetic nucleotides. An organism that biosynthesizes its own unnatural bases would represent a truly orthogonal genetic system---with profound implications for biosafety, biotechnology, and our understanding of life's chemical boundaries.

References (8)

[1] Wang, B., Rocca, J., & Hoshika, S. (2024). A folding motif with an expanded genetic alphabet. Nature Chemistry.
[2] Vecchioni, S., Ohayon, Y. P., & Hernandez, C. (2024). Six-Letter DNA Self-Assembling 3D Crystals. Nano Letters.
[3] Wang, H., Li, S., & Wang, C. (2025). Monitoring Expanded Genetic Letters in Complex DNA. Current Protocols.
[4] Noori, Z., Bermejo, A., & Bofill, J. (2025). Pi-Pi Stacking and Unnatural Base Pair Selectivity. ACS Physical Chemistry Au.
Wang, B., Rocca, J. R., Hoshika, S., Chen, C., Yang, Z., Esmaeeli, R., et al. (2024). A folding motif formed with an expanded genetic alphabet. Nature Chemistry, 16(10), 1715-1722.
Vecchioni, S., Ohayon, Y. P., Hernandez, C., Hoshika, S., Mao, C., Benner, S. A., et al. (2024). Six-Letter DNA Nanotechnology: Incorporation of Z-P Base Pairs into Self-Assembling 3D Crystals. Nano Letters, 24(45), 14302-14306.
Wang, H., Li, S., Wang, C., Zhu, A., & Li, L. (2025). Advancing Applications of the Expanded Genetic Alphabet: Monitoring Expanded Genetic Letters in Complex DNA Context Via a Bridge‐Base Approach. Current Protocols, 5(11).
Noori, Z., Bermejo, A., Bofill, J. M., & Poater, J. (2026). π–π Stacking Determines the Selectivity of Unnatural DNA Base Pairs Even without Polymerase. ACS Physical Chemistry Au, 6(1), 153-162.

Explore this topic deeper

Search 290M+ papers, detect research gaps, and find what hasn't been studied yet.

Click to remove unwanted keywords

Search 7 keywords β†’