paperSeptember 2025

SimpleFold: Folding Proteins is Simpler than You Think

AuthorsYuyang Wang, Jiarui Lu**, Navdeep Jaitly, Josh Susskind, Miguel Angel Bautista

Protein folding models have achieved groundbreaking results since the introduction of AlphaFold2, typically built via a combination of integrating domain-expertise into its architectural designs and training pipelines. Nonetheless, given the success of generative models across different but related problems, it is natural to question whether these architectural designs are a necessity to build performant models. In this paper, we introduce SimpleFold, the first flow-matching based protein folding model that solely uses general purpose transformer layers. Instead of relying on expensive modules like triangle attention or pair representation biases, or carefully crafted training objectives, SimpleFold employs standard transformer blocks with adaptive layers and is trained via a generative flow-matching objective. We scale SimpleFold to 3B parameters and train it on more than 8.6M distilled protein structures together with experimental PDB data. To the best of our knowledge, SimpleFold is the largest scale folding model ever developed. On standard folding benchmarks, SimpleFold-3B model achieves competitive performance compared to state-of-the-art baselines. Due to its generative training objective, SimpleFold also demonstrates strong performance in ensemble prediction. SimpleFold challenges the reliance on complex domain-specific architectures designs in folding, highlighting an alternative yet important avenue of progress in protein structure prediction.

** Work done while at Apple

Composite figure showing SimpleFold prediction examples, ensemble generation, CASP14 benchmark results, and inference timing across model sizes. — Figure 1: Example predictions of SimpleFold on targets (a) chain A of 7QSW (RubisCO large subunit) and (b) chain A of 8DAY (Dimethylallyltryptophan synthase 1), with ground truth shown in light aqua and prediction in deep teal. (c) Generated ensembles of target chain B of 6NDW (Flagellar hook protein FlgE) with SimpleFold finetuned on MD ensemble data. (d) Performance of SimpleFold on CASP14 with increasing model sizes from 100M to 3B. (e) Inference time of different sizes of SimpleFold on consumer-level hardware, i.e., M2 Max 64GB MacBook Pro.

SimpleFold: Folding Proteins is Simpler than You Think

Related readings and updates.

INRFlow: Flow Matching for INRs in Ambient Space

Updates to Apple’s On-Device and Server Foundation Language Models

Discover opportunities in Machine Learning.