Traders search OOS retention trading, out of sample retention metric, and walk forward efficiency metric in the same research session because modern robustness reports show both. They are related, but they are not duplicates.

Out-of-sample (OOS) retention and Walk-Forward Efficiency (WFE) both ask whether your strategy survives contact with unseen data. One stresses how much of an IS result shows up in OOS in relative terms; the other stresses efficiency of that transfer in a rolling walk-forward process and strategy edge OOS carry-through across steps.

Exact definitions depend on implementation. In Kiploks you will often see both next to robustness and data-quality context so you do not optimize a single number in isolation.

OOS retention in plain language

Retention usually measures how returns, ratios, or PnL carry from an in-sample window into the paired out-of-sample window. High retention means the OOS segment still looks like the IS segment in the way the metric defines "carry-through."

What retention is good at

Retention is intuitive for single-window thinking: "Did the next unseen segment still behave like the training segment?"

Where retention misleads

Retention can look good when one OOS window is strong, even if other windows are weak. That is why retention should be read with:

How many windows you have (Minimum windows)
Whether costs are realistic (Cost drag)
Whether trades are independent enough for the implied statistics (How many trades)

Walk-Forward Efficiency in plain language

WFE summarizes how efficiently your process transfers IS behavior into OOS across walk-forward steps. It is tied to the sequence of train-then-test rolls, not only a single split.

If walk-forward is new to you, start with What is Walk-Forward Analysis?. For a focused read on WFE, see Walk-Forward Efficiency (WFE) explained.

What WFE is good at

WFE is a compact answer to: "Does my process keep working when repeated through time?" That is closer to walk-forward test significance than a single split is.

When retention and WFE disagree

Disagreement is a feature, not a bug.

Retention strong, WFE weak can mean one heroic OOS window hides instability across rolls, or costs differ by segment.
WFE strong, retention weak can mean the efficiency framing rewards a pattern that is still thin in absolute dollars, or the retention metric is scaled differently than you expect.

When they diverge, inspect which window broke, fees and slippage, and parameter stability (PSI) before you trade the headline.

How this connects to IS/OOS language

If you are still clarifying in-sample out-of-sample trading, read In-sample vs out-of-sample: the only explanation you need. Retention and WFE are next-level summaries after you already believe your IS/OOS protocol is honest.

Sample size and honesty

Thin samples make both metrics noisy. Window-level instability is common when walk-forward analysis window size is too small for your trade frequency (How to choose IS/OOS window sizes).

Where to go deeper

Guide: walk-forward for how the UI ties windows to metrics.
Methodology for how Kiploks frames evidence.