Proto is not affiliated with Google DeepMind. This toolkit is open source and builds on the implementation produced by this organization. Product names, logos, and trademarks are the property of their respective owners.
Background
AlphaFold3 (Abramson et al., 2024) predicts the joint 3D structure of a biomolecular assembly from the sequences and chemical components it contains. It extends AlphaFold2 beyond single proteins: one model folds complexes that mix proteins, DNA, RNA, and small-molecule ligands, and predicts how those parts are arranged relative to one another. As in AlphaFold2, each protein chain is paired with a multiple-sequence alignment (MSA) of related sequences, whose covariation patterns give the model an evolutionary signal for placing residues. Internally, AlphaFold3 represents the assembly as a set of tokens: one per amino-acid residue or nucleotide, and one per atom for ligands and modified residues. It then learns a representation of every token and of every token pair. Where AlphaFold2 leaned on the large MSA-centric Evoformer, AlphaFold3 de-emphasizes the MSA, handling it in a separate preliminary module rather than iterating it through the deep trunk, and does most of its work in the ‘Pairformer’, which iteratively refines the token and pair representations through geometry-inspired “triangle attention” updates. The final representations are then fed into a diffusion module that iteratively denoises all-atom coordinates starting from random noise. Run from several random seeds, it produces multiple candidate structures, and the highest-confidence candidate is returned as the final prediction. In addition, AlphaFold3 reports calibrated confidence metrics such as the per-atom predicted local distance difference test (pLDDT) for local reliability, a predicted aligned error (PAE) for how well any two tokens are placed relative to each other, and predicted template-modeling (pTM) and interface predicted template-modeling (ipTM) scores for overall and interface accuracy.Learning Resources
- The Illustrated AlphaFold (by Elana Simon and Jake Silberg) - a visual, diagram-driven walkthrough of the AlphaFold3 architecture, from input preparation through representation learning to structure prediction.
- AlphaFold 3 predicts the structure and interactions of all of life’s molecules (Google DeepMind and Isomorphic Labs) - the official announcement, with an accessible overview of what AlphaFold3 predicts and how it extends earlier models.
Tools
AlphaFold3 Structure Prediction (alphafold3-prediction)
Predicts the 3D structure of a biomolecular complex. Each input complex can combine protein, DNA, RNA, and ligand chains; the assembly is folded by AlphaFold3 and returned as a predicted Structure per complex with confidence metrics: per-residue pLDDT, pTM, interface pTM for multi-chain complexes, and predicted aligned error.API Reference
Input: AlphaFold3Input
Input: AlphaFold3Input
StructurePredictionInput. Each complex can contain one or more sequences of proteins, DNA, RNA, or ligands.ComplexMSAs (per-chain MSAs keyed by chain index); paired=True marks rows taxonomy-aligned across chains. Populated by preprocess() or supplied directly.Config: AlphaFold3Config
Config: AlphaFold3Config
"af3_job".BaseConfig.seed field is unset. Default: [0]. Note: AlphaFold3 will do five diffusion samples per seed, so this often can be set to a single seed. More seeds are required for complex docking tasks, such as antibody-antigen docking._af3_results to the provided string. If None (default), uses a temporary directory that is automatically cleaned up after inference. If specified, creates a persistent directory at the given path that will NOT be automatically deleted. Default: None..bin or .bin.zst file per DeepMind’s release layout). If None (default), weights are resolved from PROTO_ALPHAFOLD3_WEIGHTS_DIR, then PROTO_MODEL_CACHE, then PROTO_HOME/proto_model_cache/alphafold3/ (see notes/storage.md)..sif). When set, the tool runs apptainer run against this image (which dispatches via the sif’s %runscript) instead of the in-env Python install. When None (default), inference.py looks for $VENV_PATH/alphafold3.sif (provisioned by setup.sh) and falls back to the env-based install if absent.StructurePredictionConfig. Default: False."cuda", "cpu"). Inherited from StructurePredictionConfig. Default: "cuda".None waits indefinitely.BaseToolOutput.approx_equal), and the seed participates in cache keys. When None, cacheable seed-sensitive tools skip cache until seeded.False.MSAStructurePredictionConfig. Default: True.use_msa=True. Inherited from MSAStructurePredictionConfig. Default: None.MSAStructurePredictionConfig. Default: True.Output: AlphaFold3Output
Output: AlphaFold3Output
AlphaFold3Metrics instance on .metrics.structures item)| Metric | Type | Range | Availability |
|---|---|---|---|
avg_plddt | float | 0.0 to 100.0 | always |
avg_pae | float | ≥ 0.0 | always |
pae | list[list[float]] | ≥ 0.0 | when include_pae_matrix=True |
ptm | float | 0.0 to 1.0 | depends on model output |
iptm | float | 0.0 to 1.0 | depends on model output |
ranking_score | float | unbounded | depends on model output |
Applications
This tool predicts the structure of multi-component assemblies such as protein-DNA and protein-RNA complexes or protein-ligand binding poses. Running it on a multi-chain complex also estimates how confidently the components are placed relative to each other through interface pTM and PAE, which is informative for assessing predicted interfaces.Usage Tips
use_msadefaults toTrue. An MSA is then generated by a ColabFold search for protein chains; set itFalseto skip the search, or attach precomputed MSAs to the input.- Diffusion sampling is controlled by
seedsandnum_diffusion_samples. AlphaFold3 drawsnum_diffusion_samples(default5) structures per seed and keeps the best by ranking score, so a single seed is often enough; the total number of candidates islen(seeds)timesnum_diffusion_samples. num_recycles(default10) trades accuracy for time. More recycling iterations refine the prediction but increase runtime.- Confidence is reported as pLDDT, pTM, ipTM, and PAE. Average pLDDT (0 to 1) is the primary per-structure quality metric; ipTM is populated only for multi-chain complexes.
Toolkit Notes
These apply to every AlphaFold3 tool in this toolkit (alphafold3-prediction).
- Requires a GPU. AlphaFold3 needs an NVIDIA GPU; CPU execution is not practical.
- Model weights are gated. AlphaFold3 weights are not publicly distributed; access is restricted to non-commercial research and must be requested from Google DeepMind through their form, then made available to the tool before it can run.

Google DeepMind