Validation

24 articles in this topic

24 Mar 2026

Does insurance-gam actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor book. EBM beats the GLM by 35 Gini points. But the deviance number is misleading. We explain why, and when you should care.
24 Mar 2026

Does HMM telematics scoring actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor fleet. HMM state fractions deliver 5–10pp Gini lift over simple aggregates. State classification recovers >50% of true high-r...
24 Mar 2026

Does GBM-to-GLM Distillation Actually Work for Insurance Pricing?

Honest benchmark: does fitting a surrogate GLM on CatBoost pseudo-predictions recover more discriminatory power than a direct GLM? We test it on 30,000 synthetic UK motor policies.
24 Mar 2026

Does constrained rate optimisation actually work for insurance pricing?

Benchmark results on synthetic UK motor renewal books. The constrained optimiser outperforms flat rate changes on profit and retention simultaneously. What it does not do: fix a...
23 Mar 2026

Exposure-Weighted Gini Coefficient in Python

Exposure-weighted Gini for insurance pricing: correct formula, Python implementation, and why ignoring exposure distorts motor model governance.
23 Mar 2026

Does Whittaker-Henderson smoothing actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor age curve. REML recovers the true frequency well in the data-rich middle. The tails are a different story. Numbers, not claims.
23 Mar 2026

Does Sarmanov copula frequency-severity modelling actually work for insurance pricing?

We read the source, ran the benchmark, and checked the claim: the independence assumption in standard two-part GLMs is wrong for UK motor, and this library corrects it analytica...
23 Mar 2026

Does proxy discrimination testing actually work?

We ran the insurance-fairness proxy detection library against a synthetic motor book with planted proxy effects and compared it against the manual correlation check most teams a...
23 Mar 2026

Does automated model monitoring actually work for insurance pricing?

Aggregate A/E at 0.94 looks fine. The model has been mispricing under-25s for eight months. Benchmark results on a synthetic UK motor book with three planted failure modes.
23 Mar 2026

Does DML causal inference actually work for insurance pricing?

We ran the benchmarks. On a synthetic UK motor book with nonlinear confounding, naive logistic GLM overestimates the telematics treatment effect by 50–90%. DML recovers the grou...
23 Mar 2026

Does conformal prediction actually work for insurance pricing?

Benchmark results on a known-DGP synthetic motor book. Conformal hits 90% across all deciles. Parametric Tweedie under-covers the top decile by 10–15pp. Numbers, not theory.
23 Mar 2026

Does Bühlmann-Straub credibility actually work for insurance pricing?

Benchmark results on 100 synthetic schemes with known true loss rates. Credibility blending reduces MSE by 25–35% vs the best naive alternative. Numbers, not theory.
21 Mar 2026

Why k-Fold CV Is Wrong for Insurance and What to Do Instead

Insurance walk-forward cross-validation prevents the look-ahead bias that makes standard k-fold results useless for prospective evaluation. Complete Python example with insuranc...
13 Mar 2026

Foundation Models for Thin Segments: TabPFN and TabICLv2 in Insurance Pricing

TabPFN and TabICLv2 for thin-segment UK insurance pricing. In-context learning at inference, no gradient descent. insurance-thin-data wraps both for actuaries.
13 Mar 2026

Correcting for Covariate Shift When You Acquire an MGA Book

Correct covariate shift when acquiring an MGA book for UK motor pricing. Importance weighting, density ratio estimation, segment-level diagnostics - Python.
12 Mar 2026

GARCH for Claims Inflation: Modelling Volatility That Clusters

GARCH for UK insurance claims inflation: time-varying variance in trend analysis. insurance-garch - Engle (1982) applied to actuarial trend and pricing models.
11 Mar 2026

Quantitative Model Validation Under PRA SS1/23: Pass/Fail Tests with Reproducible Audit Trails

PRA SS1/23 requires quantitative pass/fail tests, not narrative. insurance-governance automates the full validation suite and generates auditable HTML reports.
09 Mar 2026

DML for Insurance: Practical Benchmarks and Pitfalls

Where double machine learning beats naive regression in UK motor pricing - and where it costs more than it gains. Benchmarks on synthetic data.
08 Mar 2026

How Do You Know Your Sigma Model Is Working?

Three diagnostics prove a GAMLSS sigma submodel is real: quantile residuals, worm plots, split-sample calibration. From insurance-distributional-glm.
06 Mar 2026

Density Ratio Detection for Channel Mix Drift: Correcting Predictions Before the Loss Ratio Reacts

When a new aggregator partnership or competitor exit changes your new business mix, models trained on the old distribution misprice silently.
23 Feb 2026

Why Your Cross-Validation is Lying to You

Standard k-fold CV is wrong for insurance pricing. Temporal leakage and IBNR contamination inflate scores. Walk-forward validation fixes both - Python.
24 Nov 2025

Your Model Validation Is a Checklist, Not a Test

PRA SS1/23 requires quantitative pass/fail tests, not narrative. insurance-governance automates the full validation suite and generates auditable HTML reports.
28 Oct 2025

How Do You Know Your Sigma Model Is Working?

Three diagnostics prove a GAMLSS sigma submodel is real: quantile residuals, worm plots, split-sample calibration. From insurance-distributional-glm.
23 Sep 2025

Your New Business Mix Changed. Your Model Didn't Notice.

When a new aggregator partnership or competitor exit changes your new business mix, models trained on the old distribution misprice silently.