Configuration System#

The TACTICS package uses Pydantic v2 for robust configuration management with automatic validation and type checking. The configuration system uses SynthesisPipeline as the single source of truth for reactions and reagents.

Configuration Hierarchy#

Quick Start#

Modern approach (recommended):

Using nested Pydantic configs with SynthesisPipeline#

from TACTICS.library_enumeration import SynthesisPipeline
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig, ReactionDef
from TACTICS.thompson_sampling.config import ThompsonSamplingConfig
from TACTICS.thompson_sampling.strategies.config import RouletteWheelConfig
from TACTICS.thompson_sampling.warmup.config import BalancedWarmupConfig
from TACTICS.thompson_sampling.core.evaluator_config import LookupEvaluatorConfig

# Create synthesis pipeline (single source of truth)
rxn_config = ReactionConfig(
    reactions=[ReactionDef(
        reaction_smarts="[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]",
        step_index=0
    )],
    reagent_file_list=["acids.smi", "amines.smi"]
)
pipeline = SynthesisPipeline(rxn_config)

# Create Thompson Sampling config
config = ThompsonSamplingConfig(
    synthesis_pipeline=pipeline,
    num_ts_iterations=1000,
    strategy_config=RouletteWheelConfig(mode="maximize", alpha=0.1),
    warmup_config=BalancedWarmupConfig(observations_per_reagent=3),
    evaluator_config=LookupEvaluatorConfig(ref_filename="scores.csv"),
    batch_size=10
)

Using presets:

Simplified preset-based configuration#

from TACTICS.library_enumeration import SynthesisPipeline
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig, ReactionDef
from TACTICS.thompson_sampling import get_preset
from TACTICS.thompson_sampling.core.evaluator_config import LookupEvaluatorConfig

# Create synthesis pipeline
rxn_config = ReactionConfig(
    reactions=[ReactionDef(
        reaction_smarts="[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]",
        step_index=0
    )],
    reagent_file_list=["acids.smi", "amines.smi"]
)
pipeline = SynthesisPipeline(rxn_config)

# Get preset configuration
config = get_preset(
    "fast_exploration",
    synthesis_pipeline=pipeline,
    evaluator_config=LookupEvaluatorConfig(ref_filename="scores.csv"),
    num_iterations=1000
)

ThompsonSamplingConfig#

The main configuration class for Thompson Sampling optimization.

Dependencies

Accepts these nested config objects:

SynthesisPipeline - via synthesis_pipeline (required)
Strategy Configs - via strategy_config
Warmup Configs - via warmup_config
Evaluator Configs - via evaluator_config

Core Parameters#
Parameter	Type	Required	Description
`synthesis_pipeline`	`SynthesisPipeline`	Yes	Single source of truth for reactions and reagents.
`num_ts_iterations`	`int`	Yes	Maximum sampling cycles.
`num_warmup_trials`	`int`	No	Warmup trials per reagent. Default: 3.

Component Configs (Modern)#
Parameter	Type	Required	Description
`strategy_config`	`StrategyConfig`	Yes*	Selection strategy configuration.
`warmup_config`	`WarmupConfig`	No	Warmup strategy. Default: BalancedWarmupConfig.
`evaluator_config`	`EvaluatorConfig`	Yes*	Evaluator configuration.

Batch & Performance#
Parameter	Type	Required	Description
`batch_size`	`int`	No	Compounds to sample per cycle. Default: 1.
`processes`	`int`	No	CPU cores for parallel evaluation. Default: 1.
`min_cpds_per_core`	`int`	No	Min compounds per core before batch evaluation. Default: 10.
`max_resamples`	`int`	No	Stop after this many consecutive duplicates.

Output & Advanced#
Parameter	Type	Required	Description
`results_filename`	`str`	No	Output CSV path. Default: `"results.csv"`.
`log_filename`	`str`	No	Log file path.
`hide_progress`	`bool`	No	Hide progress bars. Default: False.
`use_boltzmann_weighting`	`bool`	No	Legacy RWS Boltzmann updates. Default: False.
`auto_detect_smarts_compatibility`	`bool`	No	Auto-detect reagent SMARTS compatibility. Default: False.
`deprotect_for_compatibility`	`bool`	No	Apply deprotection during detection. Default: False.
`desalt_for_compatibility`	`bool`	No	Apply desalting during detection. Default: False.

Strategy Configurations#

All strategy configs are used with ThompsonSamplingConfig.strategy_config.

GreedyConfig#

Configuration for greedy (argmax/argmin) selection.

Creates: GreedySelection

Parameters#
Parameter	Type	Required	Description
`strategy_type`	`Literal["greedy"]`	Auto	Set automatically.
`mode`	`str`	No	`"maximize"` or `"minimize"`. Default: `"maximize"`.

RouletteWheelConfig#

Configuration for roulette wheel selection with thermal cycling and CATS.

Creates: RouletteWheelSelection

Parameters#
Parameter	Type	Required	Description
`strategy_type`	`Literal`	Auto	Set automatically.
`mode`	`str`	No	`"maximize"`, `"minimize"`, or Boltzmann variants.
`alpha`	`float`	No	Base temperature for heated component. Default: 0.1.
`beta`	`float`	No	Base temperature for cooled components. Default: 0.05.
`exploration_phase_end`	`float`	No	Fraction before CATS starts [0, 1]. Default: 0.20.
`transition_phase_end`	`float`	No	Fraction when CATS fully applied [0, 1]. Default: 0.60.
`min_observations`	`int`	No	Min observations for criticality trust. Default: 5.

UCBConfig#

Configuration for Upper Confidence Bound selection.

Creates: UCBSelection

Parameters#
Parameter	Type	Required	Description
`strategy_type`	`Literal["ucb"]`	Auto	Set automatically.
`mode`	`str`	No	`"maximize"` or `"minimize"`. Default: `"maximize"`.
`c`	`float`	No	Exploration parameter. Higher = more exploration. Default: 2.0.

EpsilonGreedyConfig#

Configuration for epsilon-greedy selection with decay.

Creates: EpsilonGreedySelection

Parameters#
Parameter	Type	Required	Description
`strategy_type`	`Literal`	Auto	Set automatically.
`mode`	`str`	No	`"maximize"` or `"minimize"`. Default: `"maximize"`.
`epsilon`	`float`	No	Initial exploration probability [0, 1]. Default: 0.1.
`decay`	`float`	No	Decay rate per iteration (0, 1]. Default: 0.995.

BayesUCBConfig#

Configuration for Bayesian UCB with CATS integration.

Creates: BayesUCBSelection

Parameters#
Parameter	Type	Required	Description
`strategy_type`	`Literal`	Auto	Set automatically.
`mode`	`str`	No	`"maximize"` or `"minimize"`. Default: `"maximize"`.
`initial_p_high`	`float`	No	Percentile for heated component [0.5, 0.999]. Default: 0.90.
`initial_p_low`	`float`	No	Percentile for cooled components [0.5, 0.999]. Default: 0.60.
`exploration_phase_end`	`float`	No	Fraction before CATS starts. Default: 0.20.
`transition_phase_end`	`float`	No	Fraction when CATS fully applied. Default: 0.60.
`min_observations`	`int`	No	Min observations for criticality. Default: 5.

Warmup Configurations#

All warmup configs are used with ThompsonSamplingConfig.warmup_config.

BalancedWarmupConfig (Recommended)#

Configuration for balanced warmup with per-reagent variance estimation.

Creates: BalancedWarmup

Parameters#
Parameter	Type	Required	Description
`warmup_type`	`Literal`	Auto	Set automatically.
`observations_per_reagent`	`int`	No	Observations per reagent. Default: 3.
`use_per_reagent_variance`	`bool`	No	Use per-reagent variance estimation. Default: True.
`shrinkage_strength`	`float`	No	James-Stein shrinkage strength. Default: 3.0.
`seed`	`int`	No	Random seed for reproducibility.

StandardWarmupConfig#

Configuration for standard warmup with random partners.

Creates: StandardWarmup

Parameters#
Parameter	Type	Required	Description
`warmup_type`	`Literal`	Auto	Set automatically.
`seed`	`int`	No	Random seed for reproducibility.

EnhancedWarmupConfig#

Configuration for legacy enhanced warmup (stochastic parallel pairing).

Creates: EnhancedWarmup

Parameters#
Parameter	Type	Required	Description
`warmup_type`	`Literal`	Auto	Set automatically.
`seed`	`int`	No	Random seed for reproducibility.

Evaluator Configurations#

All evaluator configs are used with ThompsonSamplingConfig.evaluator_config.

LookupEvaluatorConfig#

Configuration for CSV-based score lookup.

Creates: LookupEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`ref_filename`	`str`	Yes	Path to CSV file with scores.
`score_col`	`str`	No	Score column name. Default: `"Scores"`.
`compound_col`	`str`	No	Compound ID column. Default: `"Product_Code"`.

DBEvaluatorConfig#

Configuration for SQLite database lookup.

Creates: DBEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`db_filename`	`str`	Yes	Path to SQLite database.
`db_prefix`	`str`	No	Key prefix for lookups. Default: `""`.

FPEvaluatorConfig#

Configuration for fingerprint similarity evaluation.

Creates: FPEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`query_smiles`	`str`	Yes	Reference molecule SMILES.
`radius`	`int`	No	Morgan fingerprint radius. Default: 2.
`n_bits`	`int`	No	Fingerprint bit length. Default: 2048.

ROCSEvaluatorConfig#

Configuration for 3D shape similarity (requires OpenEye).

Creates: ROCSEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`query_molfile`	`str`	Yes	Path to reference structure (.sdf).
`max_confs`	`int`	No	Max conformers to generate. Default: 50.

FredEvaluatorConfig#

Configuration for molecular docking (requires OpenEye).

Creates: FredEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`design_unit_file`	`str`	Yes	Path to receptor file (.oedu).
`max_confs`	`int`	No	Max conformers to generate. Default: 100.

MLClassifierEvaluatorConfig#

Configuration for ML model-based evaluation.

Creates: MLClassifierEvaluator

Parameters#
Parameter	Type	Required	Description
`evaluator_type`	`Literal`	Auto	Set automatically.
`model_filename`	`str`	Yes	Path to pickled sklearn model.

Configuration Presets#

TACTICS provides pre-configured setups for common use cases via get_preset().

Dependencies

Returns fully configured ThompsonSamplingConfig objects with preset strategy, warmup, and evaluator configurations.

Available Presets:

Preset Name	Description
`fast_exploration`	Quick screening with epsilon-greedy (epsilon=0.2, decay=0.995).
`parallel_batch`	Batch processing for slow evaluators with RouletteWheel.
`conservative_exploit`	Hit optimization with greedy strategy.
`balanced_sampling`	General-purpose with UCB (c=2.0).
`diverse_coverage`	Maximum diversity with high-temperature RouletteWheel.
`legacy_rws_maximize`	Original RWS algorithm for maximize mode.
`legacy_rws_minimize`	Original RWS algorithm for minimize mode (docking).

Example usage:

from TACTICS.library_enumeration import SynthesisPipeline
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig, ReactionDef
from TACTICS.thompson_sampling import get_preset
from TACTICS.thompson_sampling.core.evaluator_config import LookupEvaluatorConfig

# Create synthesis pipeline
rxn_config = ReactionConfig(
    reactions=[ReactionDef(
        reaction_smarts="[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]",
        step_index=0
    )],
    reagent_file_list=["acids.smi", "amines.smi"]
)
pipeline = SynthesisPipeline(rxn_config)

# Fast exploration for initial screening
config = get_preset(
    "fast_exploration",
    synthesis_pipeline=pipeline,
    evaluator_config=LookupEvaluatorConfig(ref_filename="scores.csv"),
    num_iterations=1000
)

# Parallel batch for docking
from TACTICS.thompson_sampling.core.evaluator_config import FredEvaluatorConfig

config = get_preset(
    "parallel_batch",
    synthesis_pipeline=pipeline,
    evaluator_config=FredEvaluatorConfig(design_unit_file="receptor.oedu"),
    mode="minimize",
    batch_size=100
)

Factory Functions#

Factory functions create component instances from configurations.

Dependencies

create_strategy() - takes Strategy Configs
create_warmup() - takes Warmup Configs
create_evaluator() - takes Evaluator Configs

Function	Description
`create_strategy(config)`	Create SelectionStrategy from config.
`create_warmup(config)`	Create WarmupStrategy from config.
`create_evaluator(config)`	Create Evaluator from config.

Example:

from TACTICS.thompson_sampling.factories import (
    create_strategy,
    create_warmup,
    create_evaluator
)
from TACTICS.thompson_sampling.strategies.config import RouletteWheelConfig
from TACTICS.thompson_sampling.warmup.config import BalancedWarmupConfig
from TACTICS.thompson_sampling.core.evaluator_config import LookupEvaluatorConfig

strategy = create_strategy(RouletteWheelConfig(mode="maximize", alpha=0.1))
warmup = create_warmup(BalancedWarmupConfig(observations_per_reagent=5))
evaluator = create_evaluator(LookupEvaluatorConfig(ref_filename="scores.csv"))

JSON/YAML Configuration#

Note

ThompsonSamplingConfig requires a SynthesisPipeline object which cannot be serialized directly to JSON. For reproducibility, serialize the ReactionConfig and evaluator/strategy configs separately.

Save ReactionConfig to JSON:

import json
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig, ReactionDef

rxn_config = ReactionConfig(
    reactions=[ReactionDef(
        reaction_smarts="[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]",
        step_index=0
    )],
    reagent_file_list=["acids.smi", "amines.smi"]
)

with open("reaction_config.json", "w") as f:
    json.dump(rxn_config.model_dump(), f, indent=2)

Load and create pipeline:

import json
from TACTICS.library_enumeration import SynthesisPipeline
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig

with open("reaction_config.json", "r") as f:
    data = json.load(f)

rxn_config = ReactionConfig.model_validate(data)
pipeline = SynthesisPipeline(rxn_config)

Example ReactionConfig JSON:

{
    "reactions": [
        {
            "reaction_smarts": "[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]",
            "step_index": 0,
            "pattern_id": null,
            "description": null,
            "deprotections": []
        }
    ],
    "reagent_file_list": ["acids.smi", "amines.smi"],
    "step_inputs": null,
    "step_modes": null,
    "protecting_groups": null
}

Validation Examples#

Type validation:

from TACTICS.library_enumeration import SynthesisPipeline
from TACTICS.library_enumeration.smarts_toolkit import ReactionConfig, ReactionDef
from TACTICS.thompson_sampling.config import ThompsonSamplingConfig
from TACTICS.thompson_sampling.strategies.config import GreedyConfig
from TACTICS.thompson_sampling.core.evaluator_config import LookupEvaluatorConfig
from pydantic import ValidationError

# Create pipeline
rxn_config = ReactionConfig(
    reactions=[ReactionDef(reaction_smarts="[C:1](=O)[OH].[NH2:2]>>[C:1](=O)[NH:2]", step_index=0)],
    reagent_file_list=["acids.smi", "amines.smi"]
)
pipeline = SynthesisPipeline(rxn_config)

try:
    config = ThompsonSamplingConfig(
        synthesis_pipeline=pipeline,
        num_ts_iterations="not_an_integer",  # Invalid!
        strategy_config=GreedyConfig(),
        evaluator_config=LookupEvaluatorConfig(ref_filename="scores.csv")
    )
except ValidationError as e:
    print(f"Validation error: {e}")

Constraint validation:

from TACTICS.thompson_sampling.strategies.config import RouletteWheelConfig
from pydantic import ValidationError

try:
    config = RouletteWheelConfig(
        mode="maximize",
        alpha=-0.1,  # Invalid: must be > 0
    )
except ValidationError as e:
    print(f"Constraint error: {e}")

Best Practices#

Use SynthesisPipeline - Single source of truth for reactions and reagents
Use presets - Start with presets and customize as needed
Validate early - Pydantic catches errors before expensive computations
Save ReactionConfig - JSON export for reproducibility
Use BalancedWarmup - Recommended for most use cases
Choose evaluator wisely - Use processes=1 for fast evaluators (Lookup, DB)