Evidence-based verification the brain is actually learning. Every number on this page is recomputed live from raw journal data โ nothing cached.
Q1. Are predictions being captured?
โ
โ
โ
Q2. Are outcomes being resolved?
โ
โ
โ
Q3. Is the model actually learning?
โ
โ
โ
Q4. Is accuracy real or cached?
โ
โ
โ
Q5. Are weights converging?
โ
โ
โ
Q6. Are real prices feeding it?
โ
โ
โ
๐ฏ Worker โ held-out test results (real signal vs noise)
The worker bootstrap trains on 80% of historical data and tests on the remaining 20% the model NEVER saw.
Two splits: random shuffle (stationary upper bound) and walk-forward (honest "trained on past, predicts future" test).
Random 80/20 split โ stationary estimate
Test set size
โ
Accuracy
โ
Brier Skill Score
โ
vs constant 0.5 baseline
p-value vs random
โ
Walk-forward split โ honest trading test
Test set size
โ
Accuracy
โ
Brier Skill Score
โ
vs constant 0.5 baseline
p-value vs random
โ
BSS reading guide: >+0.05 = real edge ยท 0 to +0.05 = weak signal ยท โค0 = below random. p <0.05 = statistically significant. Walk-forward is the truth-test.
If random says REAL SIGNAL but walk-forward says BELOW BASELINE, the model is overfitting / leaking future info.
๐ Weight Ledger โ proves weights are changing over time
Each row is a SHA-style hash of the model's weight vector at that moment. If two hashes are different, the brain trained between them.
If they're all identical, the brain isn't learning โ flag a bug.
Last 20 page-loads. New row added every time you visit this page.
โ Live Verification โ inject a test prediction
Click below to inject a known prediction (AAPL LONG, prob=0.85) into the journal. Then watch
/brain-debug tick โ within 30s the new entry shows up.
After the short-horizon (24h) elapses it resolves automatically; OR click "Force resolve" to immediately simulate the resolution against current AAPL price.
๐งฎ Live Accuracy โ recomputed from raw journal
Recomputes from bpleone_pred_journal_v1 directly. If this disagrees with the
number shown on brain-truth.html, there's a stale cache somewhere โ flag it.
Total resolved
โ
Correct (recomputed)
โ
Accuracy (recomputed)
โ
vs baseline 50%
โ
โ
๐ก Recent Activity Stream โ last 20 events
Live feed showing the brain capturing predictions and resolving them. If this freezes, the brain isn't ticking.
โ What this page does NOT prove
This page proves the mechanics work. It does NOT prove the brain is making GOOD predictions โ
only that it's capturing, resolving, and training. Whether the predictions actually beat random is a separate question (see
/brier-skill and
/sharpe-ratio).
It also does not prove the input features are correct โ only that whatever-was-captured trains the model.
If you suspect bad features, run /self-test which checks
the FeatureExtractor's output shape and bounds.
For a paper-trading audit (did open/stop/target prices match real moves?), run
/daily-replay for any past day.