● Public track record · honest measurement

Public track record

What we measure, how we measure it, and the current state of the measurement — without the "73% accuracy" slogans that typically come without sources. Logged-in users see the per-signal live track inside the dashboard.

Honest disclosure. The full math stack went live April 2026. We've been recording every score every day since 2026-04-09. Short-horizon accuracy (7-day, 14-day, 19-day) is tracked live below — early but real. The 30-day horizon now has at least one complete forward-return window — its IC publishes alongside the shorter horizons as the weekly cron rolls. Every number is either a live diagnostic or a live accuracy measurement — never an in-sample backtest dressed up as out-of-sample performance.

Honest disclosure. Framler's math stack (multi-factor ensemble, BOCPD regime posterior, copula-blended Markowitz, bin-conditional conformal intervals, Kalman dynamic exposures) deployed to production April 2026. signal_history snapshots have accumulated daily since 2026-04-09. Live IC for 7/14/19-day horizons is published below from the weekly accuracy-check cron. The 30-day horizon now has its first complete window of post-snapshot price data and joins the published horizons via the weekly accuracy-check cron. Sharpe / CVaR need ≥3 months of windows for stable estimates. Sector + regime breakdowns shown when sample sizes are sufficient (≥30 paired observations per bucket).

Current state

Universe

1001/1001

1001 of 1001 tickers updated in the rolling 24-hour window. Full universe scoring runs Mon-Fri 06:00 UTC; weekend coverage is partial (only the news-sentiment and macro crons fire). The number drops over weekends and rises again Monday.

Regime detection

risk_on

BOCPD posterior as of 2026-07-03. Fires continuously on SPY returns.

Effective breadth

50%

Grinold-Kahn effective count of independent factors out of 13 raw. Low ratio = factors are redundant; high ratio = genuinely different signals.

Implied 30-day move (VIX)

±4.6%

VIX 16.1 as of 2026-07-02. Forward-looking implied σ for the S&P over the next 30 calendar days. Complements the historical SPY drift used by the engine.

Tail-dependence

234 pairs

Max upper-tail alignment: moderate. Non-parametric co-crash probability per factor pair, per regime.

Prediction intervals

Calibrated

25 Mondrian bins calibrated from accumulating residuals.

Factor weights

Literature prior

Prior weights are the Asness-Moskowitz-Pedersen + Novy-Marx + Sloan literature defaults. Activates after forward returns accumulate.

Calibration IC by horizon

Walk-forward cross-validated Information Coefficient produced by the weekly calibrate-weights cron. Different from the live-accuracy block above — that measures the end-to-end score-to-return relationship; this measures the per-horizon IC of the inverse-covariance-shrunk factor stack the engine uses to form the composite. OOS IC is the headline number; train-IC is shown as the second line for shrinkage-leakage sanity-check (large gap = overfitting risk).

30-day OOS IC

+0.000

Train IC +0.040 · 2072 samples · inv_cov.

Live accuracy — Information Coefficient (IC)

Spearman rank correlation between Framler score on day T and realised price return from T to T+N, averaged across all overlapping windows. Refreshed weekly. 58 days of accumulation as of 2026-06-29. Industry context: a strong multi-factor signal is typically IC 0.03-0.06 out-of-sample.

7-day IC

-0.012

8 windows · hit rate 25% (windows where IC > 0).

14-day IC

0.082

4 windows · hit rate 100% (windows where IC > 0).

19-day IC

0.108

3 windows · hit rate 66.7% (windows where IC > 0).

Rolling IC — last 12 weekly readings

Each tile shows the rolling mean ± 1σ across the last 12 weekly readings. Sparkline traces the actual readings chronologically (oldest left → newest right). Values clip at ±0.20 for the line; the y-axis is centred at zero. Persistent IC ≥ 0 means the engine is empirically predictive over the rolling window, not just on the most-recent snapshot.

7-day rolling IC

+0.023± 0.060

n = 12 weekly readings · 6/12 positive

14-day rolling IC

+0.100± 0.022

n = 11 weekly readings · 11/11 positive

19-day rolling IC

+0.139± 0.050

n = 11 weekly readings · 11/11 positive

Where the engine works — and where it doesn't

Each sector is placed into one of four calibration tiers based on the rank correlation between the composite score and realised forward returns over recent weekly windows. We publish the tier; the magnitude stays internal because raw per-cohort IC is part of the engine moat. Sectors with fewer than 30 paired observations surface as Pending — the weekly cron promotes them once the sample threshold clears.

Communication Services

POSITIVE