RFdiffusion3

License: RFdiffusion3 is open source and free for academic and commercial use under a BSD-3-Clause license. Please refer to the license for full terms.

Proto is not affiliated with Institute for Protein Design. This toolkit is open source and builds on the implementation produced by this organization. Product names, logos, and trademarks are the property of their respective owners.

GitHub 757 GitHub 757 Preprint Preprint Cite Cite Tool Source Tool Source Open as Notebook Open as Notebook Open on Proto Open on Proto

RosettaCommons/foundry

Central repository for biomolecular foundation models with shared trainers and pipeline components

757 stars

View repo

De novo Design of All-atom Biomolecular Interactions with RFdiffusion3

Jasper Butcher, Rohith Krishna, … David Baker

bioRxiv (2025)

Read preprint

@article{butcher2025rfdiffusion3,
  title={De novo Design of All-atom Biomolecular Interactions with RFdiffusion3},
  author={Butcher, Jasper and Krishna, Rohith and Mitra, Raktim and Brent, Rafael Isaac and Li, Yanjing and Corley, Nathaniel and Kim, Paul T and Funk, Jonathan and Mathis, Simon Valentin and Salike, Saman and Muraishi, Aiko and Eisenach, Helen and Thompson, Tuscan Rock and Chen, Jie and Politanska, Yuliya and Sehgal, Enisha and Coventry, Brian and Zhang, Odin and Qiang, Bo and Didi, Kieran and Kazman, Maxwell and DiMaio, Frank and Baker, David},
  journal={bioRxiv},
  year={2025},
  doi={10.1101/2025.09.18.676967},
  url={https://www.biorxiv.org/content/10.1101/2025.09.18.676967},
  publisher={Cold Spring Harbor Laboratory}
}

Copy citation

proto-bio/proto-tools/proto_tools/tools/structure_design/rfdiffusion3

View source

Open Notebook

Open notebook

Coming soon!

Run this tool directly in Proto with no setup required.

Function	Description
`run_rfdiffusion3()`	De novo protein structure design using RFdiffusion3 (GPU)	Docs Source

Background

RFdiffusion3 (Butcher et al., 2025) is a denoising-diffusion generative model trained to design protein structures at all-atom resolution under arbitrary spatial constraints. Starting from random noise, it iteratively denoises atomic coordinates toward a plausible protein while jointly refining the underlying amino-acid sequence. Training combines structures from the Protein Data Bank with multi-task conditioning, in which each training example is presented with a randomly generated design problem that constrains a sampled combination of motif tokens, atom subsets, residue identities, or sequence-index labels. The model is therefore trained jointly on binder design, motif scaffolding, inverse folding, sidechain placement, and prediction-style tasks under a single objective, and a single trained checkpoint supports every conditioning style at inference. RFdiffusion3 is the successor to RFdiffusion (Watson et al., 2023), which diffused only over the backbone N, Cα, C, and O atoms and required ProteinMPNN as a separate sequence-design step. By denoising every atom and co-designing the sequence, RFdiffusion3 incorporates small-molecule pockets, hydrogen-bond donor and acceptor patterning, and explicit nucleotide and ligand context directly into the generative process. It is the structure-design model within the Foundry framework, which distributes it alongside RoseTTAFold3 for structure prediction and ProteinMPNN for inverse folding.

Tools

RFdiffusion3 Structure Design (`rfdiffusion3-design`)

Generates new protein structures and sequences subject to specified constraints. Each design task is described by an RFdiffusion3DesignSpec containing an optional input structure, a contig string, and per-residue selectors that fix atomic coordinates, constrain sequence positions, or designate hotspot residues. The diffusion sampler returns N candidate structures per specification, each accompanied by its designed amino-acid sequence and the sampled contig.

API Reference

Source

Input: RFdiffusion3Input

design_specs

List[RFdiffusion3DesignSpec]

List of design specifications. Each spec represents an independent design task with its own constraints. Multiple specs will be processed in a single run.

Show RFdiffusion3DesignSpec

input_structure

Structure

Input structure (Structure or PDB/CIF file path) for motif/binder design; written to a file before rfd3 (reads a path). Omit for de novo.

contig

string

Contig string specifying the design topology.

length

string

Per-asymmetric-unit length (int or ‘min-max’); per-protomer under symmetry

ligand

string

Ligand selection by residue name (e.g., ‘HAX,OAA’)

unindex

string | Dict[string, string]

Unindexed motif components for flexible positioning (e.g., ‘A244,A274,A320’)

select_fixed_atoms