
This constraint is open source. Any third-party models, product names, or trademarks referenced are the property of their respective owners, and Proto is not affiliated with them.
- Frequency mode: Raw k-mer frequency (count / total k-mer positions).
- Usage deviation mode: Observed/expected ratio using a zero-order Markov model (product of individual character frequencies).
Metadata varies by mode:Frequency mode:
{kmer}_frequency: Float frequency value
{kmer}_usage_deviation: Float observed/expected ratio{kmer}_count: Integer observed count{kmer}_expected: Float expected count
API Reference
Configuration for evaluating a single specific k-mer.For evaluating all k-mers of a given length, use KmerFrequencyConstraint.
The specific k-mer to evaluate (e.g., ‘CG’, ‘GATC’, ‘ATG’)
Scoring mode: ‘frequency’ for raw counts, ‘usage_deviation’ for observed/expected ratioOptions:
frequency, usage_deviationMinimum acceptable frequency/deviation based on scoring_mode
Maximum acceptable frequency/deviation based on scoring_mode
ReturnsConstraintOutput
One result per input sequence. The metadata field
carries per-mode k-mer data (see Note).Usage
python
Metadata
| Property | Value |
|---|---|
| Key | specific-kmer-frequency |
| Function | specific_kmer_constraint |
| Category | sequence_composition |
| Mode | discrete |
| Uses GPU | False |
| Supported Types | dna, rna, protein |