Python is a strong language for research ergonomics, but it is also easy to leak future information through shared globals, cached features, or "helpful" preprocessing across the full dataset.

The fix is architectural: treat walk-forward as an outer orchestration loop, not a flag inside one backtest.

Pattern A: outer loop owns time

Your inner backtester should accept:

candle slice for IS only
parameters

Then the outer loop:

optimizes on IS
freezes parameters
runs OOS on the next slice only
stores metrics + parameters + dataset hash

Never let the inner backtester read the full series when you are claiming OOS.

Pattern B: preprocessing must be window-local

If you normalize, detrend, or label regimes using global statistics, you have created a subtle leak.

Rule: any transform must be fit on IS data only, then applied to OOS using only IS-derived state (or recomputed in a strictly causal way).

Pattern C: persist everything as JSONL

Each window should append one JSON record:

window id
IS range, OOS range
best params
metrics
runtime seconds
versions (requirements.txt hash)

This is what makes results reproducible when you return in six months.

Practical integration checklist

single source of truth for timestamps (UTC)
deterministic sorting of candles
explicit warm-up bars excluded from optimization objective
costs applied identically on IS and OOS

Parallel workers: keep window order deterministic

If you parallelize windows, aggregate results by window id before you compute summaries.

Otherwise you can accidentally mix partial outputs or create non-deterministic tie-breaking in optimizers.

Library boundary note

Whether you use vectorized frameworks or event-driven simulators, the walk-forward contract is the same: no future data in OOS evaluation.

If a library encourages "fit scaler on full dataframe" defaults, override them or wrap preprocessing in your outer loop.

When to stop coding and export for review

If your team debates metrics in Jupyter chaos, export a stable artifact bundle and run a second-pass validation workflow (Data snooping).