Design Module

The xpyrment.design module contains submodules and components for design.

design

Experimental design, user routing, traffic splits, randomization, and Design of Experiments (DoE).

This package provides utilities for configuring study designs, routing experimental traffic, setting up randomization schemes, and classical Design of Experiments (DoE) matrices: - hash_assign: Cryptographic deterministic hashing engine to route units state-lessly. - TrafficSplitter: Coordinates multi-variant allocations, long-term holdouts, and progressive ramps. - stratified_randomization: Ensures structural balance on pre-experiment covariates. - doe: Comprehensive classical design matrices (factorial, fractional, Taguchi, DSD, CCD, mixture, etc.).

MODULE	DESCRIPTION
`doe`	Classical Design of Experiments (DoE) generators and optimization engines.
`power`	Analytical Power Analysis & Sample Size Estimator (Block 44).
`randomization`	Deterministic, hash-based randomization engines for user and unit assignments.
`splits`	Traffic split coordinators, holdout groups, and exposure ramp schedules.
`stratification`	Stratified and clustered randomization engines for balanced covariate distributions.

CLASS	DESCRIPTION
`TrafficSplitter`	Manages traffic split fractions, global holdout configurations, and progressive exposure ramp schedules.
`AnalyticalPowerCalculator`	Computes sample sizes, statistical power, and Minimum Detectable Effects (MDE).

FUNCTION	DESCRIPTION
`hash_assign`	Assigns a unit to a variant deterministically using MD5 hashing and modulo arithmetic.
`stratified_randomization`	Performs stratified randomization to ensure balance on continuous/categorical covariates.

TrafficSplitter

TrafficSplitter(
    allocations: Dict[str, float],
    holdout_percentage: float = 0.0,
    ramp_schedule: List[float] = None,
)

Manages traffic split fractions, global holdout configurations, and progressive exposure ramp schedules.

A TrafficSplitter divides incoming traffic into discrete experimental buckets. It supports fractional allocations, standard control and treatment arms, and long-term holdout groups.

Business and Operational Context

Fractional Allocations: Allows unequal variants (e.g., 90% control, 10% treatment) to limit exposure during early launch states.
Long-Term Holdouts: A portion of users can be entirely withheld from all experiments within a product area (the "holdout group") to measure the long-term cumulative effects and potential interaction effects of independent product changes over months.
Exposure Ramp Schedules: Step-wise exposure schedules (e.g., exposing 1% of users, then 10%, then 50%, then 100%) act as standard release gates. This mitigates operational risk by validating telemetry and checking for guardrail breaches before exposing the entire user base.

ATTRIBUTE	DESCRIPTION
`allocations`	A dictionary mapping variant names to their allocation weight fractions (values bounded in $[0, 1]$). TYPE: `Dict[str, float]`
`holdout_percentage`	Percentage of global traffic completely excluded from selection and routed to a static holdout arm. Bounded in $[0, 1]$. TYPE: `float`

Examples:

Example

>>> allocations = {"control": 0.45, "treatment": 0.45}
>>> splitter = TrafficSplitter(allocations=allocations, holdout_percentage=0.10)
>>> splitter.holdout_percentage
0.1

Validates that the sum of variant allocations and holdout percentages totals exactly 1.0.

PARAMETER	DESCRIPTION
`allocations`	Mapping of variant labels to active traffic split weights. TYPE: `Dict[str, float]`
`holdout_percentage`	Fractional traffic diverted into a holdout group. Defaults to 0.0. TYPE: `float` DEFAULT: `0.0`
`ramp_schedule`	Custom progressive exposure percentages. Defaults to None. TYPE: `List[float]` DEFAULT: `None`

RAISES	DESCRIPTION
`ValueError`	If any individual allocation weight is negative.
`ValueError`	If the total allocation weight including the holdout percentage does not sum to 1.0.
`ValueError`	If ramp_schedule has values outside [0.0, 1.0], is non-monotonic, or does not end in 1.0.

METHOD	DESCRIPTION
`get_ramp_schedule`	Generates progressive exposure ramp-up schedule coordinates.

Source code in src\xpyrment\design\splits.py

def __init__(
    self,
    allocations: Dict[str, float],
    holdout_percentage: float = 0.0,
    ramp_schedule: List[float] = None,
):
    """Initializes a new TrafficSplitter container.

    Validates that the sum of variant allocations and holdout percentages totals exactly 1.0.

    Args:
        allocations (Dict[str, float]): Mapping of variant labels to active traffic split weights.
        holdout_percentage (float): Fractional traffic diverted into a holdout group. Defaults to 0.0.
        ramp_schedule (List[float], optional): Custom progressive exposure percentages. Defaults to None.

    Raises:
        ValueError: If any individual allocation weight is negative.
        ValueError: If the total allocation weight including the holdout percentage does not sum to 1.0.
        ValueError: If ramp_schedule has values outside [0.0, 1.0], is non-monotonic, or does not end in 1.0.
    """
    self.allocations = allocations
    self.holdout_percentage = holdout_percentage
    self.ramp_schedule = ramp_schedule

    # Validate bounds
    if holdout_percentage < 0.0 or holdout_percentage > 1.0:
        raise ValueError("holdout_percentage must be between 0.0 and 1.0.")
    for variant, weight in allocations.items():
        if weight < 0.0 or weight > 1.0:
            raise ValueError(f"Allocation for variant '{variant}' must be between 0.0 and 1.0.")

    total_alloc = sum(allocations.values()) + holdout_percentage
    if abs(total_alloc - 1.0) > 1e-5:
        raise ValueError("Total allocations including holdout must equal 1.0.")

    # Validate custom ramp-up schedule if provided
    if ramp_schedule is not None:
        if not ramp_schedule:
            raise ValueError("ramp_schedule list cannot be empty.")
        for val in ramp_schedule:
            if val < 0.0 or val > 1.0:
                raise ValueError(f"Ramp schedule value '{val}' must be between 0.0 and 1.0.")
        # Check monotonicity
        for i in range(len(ramp_schedule) - 1):
            if ramp_schedule[i] > ramp_schedule[i + 1]:
                raise ValueError("Ramp schedule values must be monotonically non-decreasing.")
        # Must terminate in 1.0
        if abs(ramp_schedule[-1] - 1.0) > 1e-5:
            raise ValueError("Ramp schedule must terminate at 1.0 (full exposure).")

get_ramp_schedule

get_ramp_schedule() -> List[float]

Generates progressive exposure ramp-up schedule coordinates.

Ramping exposure represents a crucial risk-management strategy. This method returns a list of active exposure fractions defining sequential release gating stages.

Mathematical Representation

Let $R$ be the ramp-up schedule array: $$ R = [r_1, r_2, \dots, r_m] $$ where $r_j \in [0, 1]$ represents the fraction of total traffic exposed to the experiment during step $j$, such that $r_j \le r_{j+1}$.

RETURNS	DESCRIPTION
`List[float]`	List[float]: Ordered list of target traffic fractions representing progressive exposure stages.

Source code in src\xpyrment\design\splits.py

def get_ramp_schedule(self) -> List[float]:
    r"""Generates progressive exposure ramp-up schedule coordinates.

    Ramping exposure represents a crucial risk-management strategy. This method returns a list of active exposure
    fractions defining sequential release gating stages.

    Mathematical Representation:
        Let $R$ be the ramp-up schedule array:
        $$
        R = [r_1, r_2, \dots, r_m]
        $$
        where $r_j \in [0, 1]$ represents the fraction of total traffic exposed to the experiment during step $j$,
        such that $r_j \le r_{j+1}$.

    Returns:
        List[float]: Ordered list of target traffic fractions representing progressive exposure stages.
    """
    if self.ramp_schedule is not None:
        return self.ramp_schedule
    return [0.01, 0.10, 0.50, 1.0]

AnalyticalPowerCalculator

AnalyticalPowerCalculator(
    alpha: float = 0.05, power: float = 0.8
)

Computes sample sizes, statistical power, and Minimum Detectable Effects (MDE).

TODO: Extend power calculations to unequal allocation ratios with multiple treatment arms using Dunnett's adjustment. TODO: Add exact simulation-based power curves utilizing empirical bootstrap baseline variance matrices.

PARAMETER	DESCRIPTION
`alpha`	Targeted significance/Type I error rate. Defaults to 0.05. TYPE: `float` DEFAULT: `0.05`
`power`	Targeted power/1 - Type II error rate. Defaults to 0.80. TYPE: `float` DEFAULT: `0.8`

METHOD	DESCRIPTION
`compute_sample_size`	Computes sample size per group for a two-sample t-test.
`compute_mde`	Computes Minimum Detectable Effect (MDE) under configured constraints.
`compute_power`	Computes statistical power given sample size and target treatment effect size.
`adjust_for_clusters`	Adjusts sample size requirements for clustered designs using the VIF model.