
License: Evo2 is open source and free for academic and commercial use under an Apache-2.0 license. Please refer to the license for full terms.
This generator is open source. Any third-party models, product names, or trademarks referenced are the property of their respective owners, and Proto is not affiliated with them.

"autoregressive", indicating sequences
are generated token-by-token from left to right.
The number of tokens to generate is automatically calculated based on the
assigned segment’s sequence_length, prompt length, and prepend_prompt setting.
API Reference
Configuration object for Evo2Generator.This class defines configuration parameters for the Evo2 generator, which uses
a 7B parameter genomic language model to generate DNA sequences autoregressively
from prompt sequences.
All prompts must have identical lengths for batched generation. For detailed
information on Evo2 parameters, see: https://github.com/arcinstitute/evo2
Prompt sequences for DNA sequence generation (single prompt or multiple)
Evo2 model variant to load (currently only evo2_7b).Options:
evo2_7b, evo2_20b, evo2_40b, evo2_7b_base, evo2_40b_base, evo2_1b_base, evo2_7b_262k, evo2_7b_microviridaePath to local checkpoint weights for custom or finetuned models
GPU device to run Evo2 on (e.g. ‘cuda’ or ‘cuda:0’).
Limits sampling to the top-k most probable tokens at each generation step.
Nucleus sampling cutoff. Restricts to the smallest token set with cumulative prob ≥ top-p.
Sharpness of sampling. Below 1 favors high-probability tokens; above 1 increases diversity.
Optional number of tokens to prefill in parallel before switching to prompt forcing.
Optional maximum sequence length to generate. Determines the max size of the cache if larger.
Whether to stop at end-of-sequence token
Generate all prompts together in a single batched forward pass. Required for multiple prompts.
Number of sequences to process simultaneously on GPU
Whether to reuse KV-cache state across decoding steps to avoid recomputation.
Retain and expose the per-sequence KV-cache after generation so downstream callers can continue.
Whether to prepend prompt to generation
Whether to print verbose output
Usage
python
Metadata
| Property | Value |
|---|---|
| Key | evo2 |
| Class | Evo2Generator |
| Category | autoregressive |
| Input Type | prompt |
| Uses GPU | True |
| Supported Sequence Types | dna |
| Allows Empty Start | False |