Google DeepMind has launched a transformative AI system known as AlphaGenome, a unified sequence-to-function model published in the journal Nature that aims to decipher the "source code of life". The team, led by Ziga Avsec, is moving past the limitations of earlier models like AlphaMissense, which only analyzed the 2% of the genome that codes for proteins, to explore the vast "dark matter" of the remaining 98% of non-coding DNA. This region is critical because it contains the complex instructions for how DNA is read and regulated; understanding it could provide answers for the large number of rare genetic diseases that currently go undiagnosed. DeepMind’s primary contribution lies in bridging a massive technical gap: the historical trade-off between analyzing long sequences of DNA and maintaining high-resolution detail. By developing a novel architecture that splices DNA into subsequences and processes them across multiple communicating TPUs (Tensor Processing Units), the team enabled the model to process megabase-length sequences at single-base resolution, effectively breaking previous computational limits.
Related article - Uphorial Shopify

The model’s versatility is its hallmark, as it integrates multiple biological modalities—including gene expression, splicing, and three-dimensional contact maps—into a single framework. Splicing, the process of joining non-contiguous genetic information to create functional proteins, is a frequent site of disease-causing mutations, making its inclusion essential for accurate variant prediction. Similarly, the model accounts for how DNA folds within a nucleus, a 3D structure vital for gene regulation. To handle the immense computational load and sparse data sets associated with these processes, DeepMind researchers implemented advanced data compression and parallelized evaluation strategies. This engineering rigor allowed the model to successfully recapitulate experimental "wet lab" findings, such as identifying cancer driver mutations with significantly higher accuracy than random controls.
Beyond the technical breakthrough, Google DeepMind has prioritized the democratization of this technology by releasing the model weights and a specialized API. This allows scientists to run predictions and visualize complex biological data in web-based notebooks without requiring localized high-performance hardware or complex software installations. These tools are designed to help the global scientific community prioritize harmful mutations and maximize the impact of limited research funding. Looking ahead, the team plans to incorporate single-cell atlases to observe how genetic defects manifest in specific cell types, further refining the ability to treat complex diseases at their molecular source. Through AlphaGenome, the team has not only provided a more powerful lens for viewing the genome but has established a unified platform for the next generation of genetic discovery.