Uncertainty Training — bpleone / brain

📊 Multiplier distribution

Distribution of active-learning multipliers applied to resolved examples. A healthy mix has examples across all buckets — too many in "low" means uncertainty signals are weak.

📈 Gain signal

Average prediction error for high-multiplier examples vs low-multiplier examples. If the system is working as intended, high-mult should show larger errors — those really were the harder examples.

🧪 Try it

See what multiplier a hypothetical resolved trade would receive.

Predicted prob MC dropout std Bootstrap std

📚 How it works

Classical training treats every example the same. But "easy" examples (model says 90%, wins 90%) carry little new information. The real learning signal comes from examples the model was uncertain about.

This module computes a sample-weight multiplier in [1.0, 3.0] from three uncertainty signals captured at prediction time:

Boundary distance: how close to 0.5 was the prediction?
MC dropout std: how much did random feature masks shift the prediction?
Bootstrap divergence: how much did the K=5 bagged models disagree?

When a trade resolves, the standard sample weight (derived from R-multiple) is multiplied by this active-learning multiplier before being passed to Model.train(). The model gradient gets a larger push on uncertain examples — biasing learning toward the boundary where it matters most.

Reference: Settles (2009) "Active Learning Literature Survey".