Part V: Cellular Context

Part V at a Glance

Central question: How do foundation model principles extend beyond one-dimensional sequence to embrace the full complexity of cellular and systems-level biology?

Prerequisites: Part IV (foundation model families). Parts II-III (7  Transformers and Attention, 8  Pretraining Strategies) for architectural background.

Chapter Topic Key Concepts
19  RNA Structure and Function RNA Models Secondary structure, splicing, RNA foundation models
20  Single-Cell Models Single-Cell Models Sparsity, cell type annotation, perturbation prediction
21  3D Genome Organization 3D Genome Chromatin loops, TADs, Hi-C prediction, spatial transcriptomics
22  Graph and Network Models Network Models GNNs, PPI networks, regulatory networks, pathway integration
23  Multi-Omics Integration Multi-Omics Integration Cross-modal learning, genotype-to-phenotype paths

After completing Part V, you will understand:

  • How RNA structure adds a second dimension to sequence modeling
  • Why single-cell data requires different architectural adaptations
  • How 3D genome organization connects distal regulatory elements
  • When graph neural networks complement sequence models
  • How to integrate multiple data modalities toward phenotype prediction

Biology operates across scales that sequence alone cannot capture. Cells of different types read the same genome differently, activating distinct gene programs that produce neurons, hepatocytes, and immune cells from identical DNA. Genes function not in isolation but within networks of regulation and interaction, where perturbing one node propagates effects throughout the system. The three-dimensional folding of chromatin brings distal elements into contact, creating regulatory logic invisible to models that treat genomes as one-dimensional strings. Sequence foundation models ask what a genome encodes; the models in this part ask what that sequence becomes in particular cellular contexts, interaction networks, and spatial architectures.

This transition from sequence-centric to systems-scale modeling demands new data modalities and new computational approaches. Single-cell transcriptomics reveals the cellular heterogeneity that bulk measurements average over. Hi-C and related methods expose the spatial organization that determines which enhancers contact which promoters. Protein interaction networks and gene regulatory graphs capture relational structure that no amount of sequence analysis can infer. Foundation model principles extend naturally to these modalities: learn representations from large-scale data, then transfer to specific prediction tasks.

RNA structure and function (19  RNA Structure and Function) extend sequence modeling beyond DNA, covering secondary structure prediction, splicing regulation, and the emerging frontier of RNA foundation models. Single-cell transcriptomics and epigenomics (20  Single-Cell Models) present distinct challenges of sparsity, noise, and scale that transformer architectures must adapt to capture. Three-dimensional genome organization (21  3D Genome Organization) adds spatial context, with models predicting chromatin contacts from sequence and connecting spatial structure to gene regulation. Graph neural networks (22  Graph and Network Models) represent biological entities and their interactions as structured graphs, integrating foundation model embeddings with relational reasoning. Multi-omics integration (23  Multi-Omics Integration) broadens the view further, jointly representing genomic, transcriptomic, proteomic, and clinical information to trace the path from genotype to phenotype.

Connections to Other Parts
  • Part IV provides the sequence-level foundation models that feed into systems-scale approaches
  • Parts II-III (7  Transformers and Attention) introduce the attention mechanisms adapted for these new modalities
  • Part VI evaluation principles apply to multi-modal models with additional complexity
  • Part VII clinical applications increasingly depend on systems-level integration