How-To

151 articles in this topic

04 Apr 2026

Whittaker-Henderson Smoothing in Python: A Step-by-Step Tutorial for Insurance Rating Tables

A complete Python tutorial for Whittaker-Henderson smoothing of insurance rating tables. Replace your Excel moving average or SAS graduation with automatic REML lambda selection...
04 Apr 2026

Telematics Pricing in Python: From Raw Trip Data to a GLM in 30 Minutes

A practical Python tutorial for telematics pricing: load raw GPS trip data, classify driving regimes with a Hidden Markov Model, and produce GLM-ready risk features using insura...
04 Apr 2026

Survival Analysis for Insurance Lapse Modelling in Python: A Complete Tutorial

A hands-on Python tutorial for insurance pricing analysts on survival analysis and lapse modelling. Covers Kaplan-Meier, Weibull AFT, mixture cure models, customer lifetime valu...
04 Apr 2026

Insurance Model Monitoring in Python: A Practitioner's Guide

How to set up insurance model monitoring in Python from scratch: PSI, Gini drift, and A/E with the insurance-monitoring library. Know when to redeploy, recalibrate, or refit.
04 Apr 2026

Insurance Fairness Audit in Python: A Complete FCA Consumer Duty Walkthrough

Step-by-step Python tutorial for running an insurance fairness audit. Covers proxy discrimination detection, exposure-weighted bias metrics, FCA Consumer Duty mapping, and gener...
04 Apr 2026

Building a GLM Frequency Model in Python: A Step-by-Step Guide for Insurance Pricing

End-to-end GLM frequency model in Python using freMTPL2 from OpenML. Data prep, exposure handling, glum fitting, deviance residuals, actual vs expected, and factor relativity ex...
04 Apr 2026

GAM Insurance Pricing in Python: An EBM Tariff Tutorial with insurance-gam

A hands-on tutorial on GAM insurance pricing in Python using the insurance-gam library. Covers EBM tariff construction, shape function extraction, GLM comparison, Shapley values...
04 Apr 2026

Credibility Theory in Python: A Complete Buhlmann-Straub Tutorial for Insurance Pricing

A practical Python tutorial on credibility theory for insurance pricing analysts. Covers Buhlmann-Straub model, the insurance-credibility library, UK motor example, GLM integrat...
04 Apr 2026

Conformal Prediction for Insurance Python: A Frequency-Severity Tutorial

A step-by-step tutorial on conformal prediction for insurance Python models, specifically the frequency-severity decomposition. Covers the calibration subtlety that breaks naive...
04 Apr 2026

Causal Inference for Insurance Pricing in Python: A Complete DML Tutorial

A hands-on tutorial on causal inference for insurance pricing in Python using the insurance-causal library. Covers double machine learning (DML), CatBoost nuisance models, ATE/C...
04 Apr 2026

Actuarial Model Validation in Python: Automated Reports for UK Insurance Pricing

How to run actuarial model validation in Python for UK insurance pricing models. Covers Solvency II Article 120 and Consumer Duty requirements, the five-test validation suite, a...
03 Apr 2026

Your Gini Is Stable. Your Model Isn't.

A stable Gini coefficient is not evidence that a model is performing well. It is evidence that the model is still ranking risks in the same order. Score decomposition separates ...
03 Apr 2026

NeuralGaussianMixture vs the insurance-distributional Stack: When to Use What

insurance-distributional now has five distributional model classes. NeuralGaussianMixture is the newest and the most demanding. A routing guide: which model for which problem, a...
03 Apr 2026

NCD Underreporting Has a Second Problem You Are Probably Missing

The hunger-for-bonus effect biases your NCD frequency relativities. It also biases your severity model. The two errors partially offset each other — but the combined underpricin...
03 Apr 2026

KL Fairness Corrections and Multiplicative GLMs: The Production Deployment Problem

Miao & Pesenti's KL discrimination-insensitive result is theoretically clean. Deploying it in a production GLM-based pricing system is not. The paper is silent on how to extract...
03 Apr 2026

The Interest Rate Effect Nobody Put in Their NCD Model

Holtan (2001) showed that the NCD reporting threshold falls when interest rates rise — the NPV of future premium penalties shrinks, so policyholders become more willing to claim...
03 Apr 2026

Your Hurdle-Poisson Model Might Say Accidents Make Drivers Safer

Lee et al. (arXiv:2602.02398) prove that standard hurdle-Poisson models with bivariate normal random effects can violate credibility order — your frequency estimate goes down af...
02 Apr 2026

ModelMonitor: Calibration Testing That Actually Tells You What To Do

insurance-monitoring v1.0.0 adds ModelMonitor with check_gmcb and check_lmcb — separate tests for global and local calibration drift, wired into a three-way REDEPLOY/RECALIBRATE...
02 Apr 2026

How Long Does an Inflationary Shock Last? Lessons from Pandemic Mortality Persistence

A new mortality model from Liu & Zhou (2026) shows that cause-specific shocks decay heterogeneously — some fast, some slow. The analogy to UK claims inflation is exact, and the ...
02 Apr 2026

Your NCD Relativities Are Wrong, and the Maths Now Tells You How Wrong

Two January 2026 arXiv papers formalise what motor actuaries have always known informally: NCD creates rational incentives to suppress small claims, and the GLM you're using to ...
02 Apr 2026

Your NCD Table Has a Frequency Bias

insurance-credibility v0.1.9 adds BMSEquilibriumSimulator — Lemaire NPV reporting thresholds, Liang 2-class equilibrium, and a frequency correction for the selection bias in NCD...
02 Apr 2026

From Competitor Quotes to Risk Parameters: ABC for Entry Pricing in Python

A runnable Python implementation of Goffard, Piette, and Peters (ASTIN Bulletin 2025): infer claim frequency and severity from competitor PCW quotes using ABC-SMC with isotonic ...
01 Apr 2026

Applying Bonus-Malus to Driving Behaviour, Not Just Claims

Yanez, Guillen and Nielsen (ASTIN Bulletin 2025) apply a bounded Bonus-Malus System not to claims but to telematics signals themselves, updating weekly. The result: Gini from 0....
01 Apr 2026

The 99th Percentile From 200 Claims: Wasserstein Robust Quantile Regression for Thin Portfolios

Zhang, Mao and Wang (arXiv:2603.14991, March 2026) prove a closed-form equivalent for worst-case quantile regression under Wasserstein distributional uncertainty — a result that...
01 Apr 2026

Protected NCD Is a Fair Value Problem Waiting to Happen

Protected NCD is widely misunderstood by consumers, and the product may not deliver the value it charges for. The Consumer Duty fair value test and the hunger-for-bonus literatu...
01 Apr 2026

Online Conformal Validity on Claims Data: Beta-Mixing Conditions and the Seasonal Caveat

Standard conformal prediction gives valid coverage only when calibration and test data are exchangeable. For insurance models deployed for 12+ months — through claims inflation ...
01 Apr 2026

The Hunger for Bonus: How UK Motor NCD Pricing Gets the Frequency Wrong

Two January 2026 arXiv preprints formalise what UK pricing teams have long intuited: observed claim frequency at high-NCD classes understates true frequency by 15–35%, because p...
01 Apr 2026

Formal Statistical Tests for Insurance Pricing Model Drift

insurance-monitoring v0.10.0 adds PricingDriftMonitor (Brauer/Menzel/Wüthrich arXiv:2510.04556) and CalibrationCUSUM (Franck et al. arXiv:2510.25573): formal statistical tests f...
01 Apr 2026

The Neural GAM Actuaries Can Explain to Their CRO

ANAM (Laub, Pho, and Wong, NAAJ 2025) fits each rating factor as a neural subnetwork with hard monotonicity constraints, exposure offsets, and proper actuarial losses. The insur...
31 Mar 2026

When your coverage guarantee means nothing: optimal regret in online conformal prediction under drift

ACI satisfies its marginal coverage guarantee while producing months of invalid intervals after a claims inflation shock. A new paper proves the minimax-optimal algorithm flushe...
31 Mar 2026

Your Policyholders Are Playing a Game with Your NCD Ladder

Policyholders with good NCD rationally choose not to report small claims. Your frequency model is trained on that suppressed data. Two January 2026 papers formalise what this me...
31 Mar 2026

Gini Drift Tests and Murphy Decomposition: What Insurance Model Monitoring Was Missing

Brauer, Menzel & Wüthrich (arXiv:2510.04556) give us two things we have been missing: a formal hypothesis test for Gini drift and a Murphy score decomposition that tells you whe...
31 Mar 2026

Explainable Boosting Machines for Insurance Pricing — An Interpretable Alternative That Actually Works

EBMs achieve near-XGBoost predictive performance on insurance claims data while remaining fully interpretable by design — no post-hoc SHAP required. We show the Poisson frequenc...
31 Mar 2026

Conformal Prediction with Change Points: When Your Coverage Guarantee Breaks and What to Do About It

Conformal prediction with change points (CPTC, arXiv:2509.02844, NeurIPS 2025) extends adaptive conformal inference to detect structural breaks and reset coverage guarantees pro...
31 Mar 2026

How Long Does an Inflationary Shock Last? A Gamma-Decay Model for Claims Cost Persistence

Fitting one aggregate trend to UK motor claims 2019–2024 embeds a single implicit decay rate across parts shortage, labour shortage, and social inflation — components that norma...
28 Mar 2026

Smoothing Motor Age Curves with Whittaker-Henderson Poisson: a freMTPL2 Benchmark

We fit WhittakerHendersonPoisson to driver age frequencies from 677K French MTPL policies. The Poisson smoother handles count data correctly, REML selects lambda automatically, ...
28 Mar 2026

Tweedie Regression for Insurance Pricing in Python

Why Tweedie GLM is the standard for aggregate loss modelling in insurance, with a complete Python example covering power parameter selection, exposure offset, and comparison wit...
28 Mar 2026

Survival Models for Insurance Lapse Prediction: What Actually Works

Deep learning survival models underperform Cox regression on tabular insurance data. Cure models are the real story post-GIPP. Here is what the research says and what UK pricing...
28 Mar 2026

Structural vs Cyclical Claims Inflation: How to Decompose What You're Actually Seeing

UK motor average claim costs reached a record £5,300 in Q4 2024. But applying a flat 8% trend assumption treats structural and cyclical inflation identically. They have opposite...
28 Mar 2026

Stochastic Reserving in Python: Mack and Bootstrap ODP with chainladder

How to produce a full IBNR distribution in Python using the Mack method and Bootstrap ODP sampling. Covers analytical standard errors, 5,000-simulation bootstrap, percentile tab...
28 Mar 2026

Reserving with chainladder-python Part 3: Neural Networks vs Traditional Methods

When does it make sense to reach beyond chain ladder and bootstrap ODP for neural reserving methods? We compare DeepTriangle, individual RNN approaches, and the Richman-Wüthrich...
28 Mar 2026

Motor Insurance Pricing in Python: A Complete Walkthrough

End-to-end motor insurance pricing in Python using the French MTPL dataset. Frequency-severity GLMs, exposure offsets, coefficient interpretation, validation, and calibration to...
28 Mar 2026

What PRA SS1/23 Validation Looks Like on Real Data: 677K French Motor Policies

Most governance tooling is tested on toy examples with clean DGPs and inflated Gini coefficients. We ran the full insurance-governance validation suite on 677K freMTPL2 policies...
28 Mar 2026

Loss Ratio Trending and Rate Adequacy in Python

The standard rate adequacy workflow — earned premium at current rate level, ultimate losses, trend to future period, expense load, indicated rate change — built in Python with p...
28 Mar 2026

How to Smooth Noisy Insurance Loss Curves

Raw loss ratios by age band are noisy. A 5-year moving average introduces boundary bias and requires a judgment call you cannot defend in an IFRS 17 review. This tutorial shows ...
28 Mar 2026

How to Build a GLM-Equivalent Tariff from a GBM

The GBM sits in a notebook outperforming the production GLM. This tutorial shows how to extract multiplicative rating relativities from a CatBoost Poisson model using shap-relat...
28 Mar 2026

How to Add Prediction Intervals to an Insurance Pricing Model

Point estimates from pricing models are incomplete. This tutorial shows how to add distribution-free prediction intervals to a CatBoost Tweedie model using insurance-conformal —...
28 Mar 2026

GLM Rating Factors in Python: A Practical Workflow for Insurance Pricing

Fit a Tweedie GLM to insurance data in Python, extract multiplicative rating relativities, and visualise factor curves — with working code throughout.
28 Mar 2026

Fairness-Accuracy Tradeoffs in Insurance Pricing — Pareto Frontiers with NSGA-II

Single-objective fairness constraints force a binary choice. NSGA-II finds the full tradeoff surface, so governance committees can make an explicit, documented decision about wh...
28 Mar 2026

Does Whittaker-Henderson Smoothing Actually Work for Insurance Pricing?

We benchmarked Whittaker-Henderson against raw rates and a 5-point weighted moving average on a synthetic UK motor driver age curve with known truth. W-H reduces MSE by 57.2% vs...
28 Mar 2026

Does PSI Actually Catch Pricing Model Drift?

PSI detects covariate shift but not rank collapse. On a synthetic UK motor book where a new risk factor emerges post-deployment, PSI stays GREEN while Gini drops 8 points. The B...
28 Mar 2026

Does Monotonicity-Constrained EBM Actually Work for Insurance Pricing?

On a UK motor DGP with a monotone young-driver requirement, unconstrained EBM violates monotonicity in 31% of runs. Constrained EBM matches GLM monotonicity compliance at 100% w...
28 Mar 2026

Does Bühlmann-Straub Credibility Actually Work?

We benchmarked Bühlmann-Straub credibility against raw experience and manual Z-factors on a 30-segment synthetic UK motor fleet book with a known DGP. On thin schemes, it reduce...
28 Mar 2026

Does Automatic Lambda Selection for Whittaker-Henderson Actually Work?

REML-selected lambda beats manual tuning on a 63-band age curve benchmark: 22% lower MSE on thin tail bands, zero analyst discretion, and principled credible intervals. The hone...
28 Mar 2026

Chain Ladder Reserving in Python: A Practical Tutorial with chainladder

Build loss development triangles, calculate IBNR reserves, and plot development patterns using Python and the chainladder library.
28 Mar 2026

BonusMalus Is Endogenous: DML on 677K French Motor Policies

BonusMalus is built from past claims — a naive regression conflates the causal effect with selection. We ran Double Machine Learning on 677K freMTPL2 policies to isolate what Bo...
28 Mar 2026

Capital Modelling Basics in Python: Aggregate Distributions and Monte Carlo SCR

Build a working Solvency II SCR estimate from scratch using compound distributions and Monte Carlo simulation. Poisson/NegBin frequency, lognormal severity, 50k simulations, VaR...
28 Mar 2026

The Burning Cost Method for Excess of Loss Reinsurance Pricing in Python

A practical Python walkthrough of the burning cost method for pricing excess of loss reinsurance treaties — loss trending, development, pure rate calculation, and sensitivity an...
28 Mar 2026

Bühlmann-Straub on freMTPL2: What Regional Credibility Actually Looks Like

We ran Bühlmann-Straub credibility on the freMTPL2freq dataset — 677K French MTPL policies, 22 regions — and quantified how much thin regions get pulled toward the portfolio mea...
28 Mar 2026

AutoML for Insurance Pricing — Does It Actually Work?

H2O, FLAML, and AutoGluon are genuinely useful tools. None of them handle the log(exposure) offset that makes insurance frequency modelling work. Here is an honest account of wh...
26 Mar 2026

Building a Tweedie GLM for Insurance Pricing in Python

A complete Python tutorial for building a Tweedie GLM for insurance pricing: synthetic motor data, statsmodels, exposure offset, interpreting the p parameter, residual diagnosti...
26 Mar 2026

The PtU Reserving Algorithm in Python: Filling the Gap Left by Richman-Wüthrich

Richman-Wüthrich's one-shot PtU reserving paper (arXiv:2603.11660) ships with R code only. We map the algorithm to Python, explain the censored-claims exposure mechanism that ma...
26 Mar 2026

Model Value in Pounds: Translating Gini Improvement to Loss Ratio

The LRE metric converts Pearson correlation improvement into expected loss ratio change in basis points. Here is how to use it, and where it breaks.
25 Mar 2026

UK Motor Is Going Loss-Making Again — Here's the Data

EY forecasts a 111% net combined ratio for UK motor in 2026. WTW documents a 13% annual premium fall. Here is what the data shows and what pricing teams should do about it.
25 Mar 2026

Severity Interactions in Gamma GLMs: Weaker Signal, Higher Bar

Applying CANN + NID to severity (Gamma) GLMs. Why the signal is weaker than frequency, what configuration changes are needed, and when a severity interaction is worth adding.
25 Mar 2026

Physical Climate Risk in UK Home Insurance Pricing

How UK home insurers should model physical climate risk: UKCP18 projections, Flood Re's 2039 exit, ABI claims data, and practical code using insurance-whittaker, insurance-confo...
25 Mar 2026

Pet Insurance Pricing in 2026: Why the FCA Is Watching

The FCA has explicitly flagged pet insurance for monitoring in its 2026 regulatory priorities. FOS complaint upheld rates hit 52% in Q1 2025 — the highest of any UKGI business l...
25 Mar 2026

The Motor Pricing Floor — How to Know When You've Stopped Burning

How to detect when a motor book has hit the floor of its underwriting cycle — using PSI on new business mix, segment-level A/E, Gini stability, and mSPRT to know when the next m...
25 Mar 2026

Our 12-Module Insurance Pricing Course Is Now Free and Open Source

Modern Insurance Pricing with Python and Databricks - all 12 modules, free, on GitHub. GLMs through causal elasticity, fairness auditing, spatial BYM2 territory models, and mode...
25 Mar 2026

Measuring Rate Change Impact with DiD and ITS in Python

A hands-on tutorial for the RateChangeEvaluator in insurance-causal v0.6.0. DiD when you have a control group. ITS when you don't. Real code, real API.
25 Mar 2026

How to Quantify What a Model Improvement Is Worth in Pounds

A 5pp Gini improvement means nothing to a CFO. The Loss Ratio Error framework from arXiv:2512.03242 converts model correlation into expected loss ratio — and from there into pou...
25 Mar 2026

How to Build a Double-Lift Chart in Python

Build a double-lift chart to compare GLM vs GBM predictions. Bin by prediction ratio, compute A/E per decile, plot with matplotlib. Standard tool for pricing committee model val...
25 Mar 2026

Detecting Model Decay: Gini Drift Testing with Statistical Power

The first Python implementation of the asymptotic Gini drift test from Brauer et al. (2025). A proper z-test for ranking degradation — not a heuristic, not a threshold, a p-value.
25 Mar 2026

What FCA EP25/2 Actually Shows: Cost-Push, Not Regulation, Drove UK Insurance Inflation

FCA EP25/2 published July 2025. Expected claim costs per home policy up 49% from £92 to £138. Average inception premium up only 5%. The data says insurers absorbed the shock — n...
25 Mar 2026

What Can I Change to Lower My Premium? The Consumer Duty Obligation Most Pricing Teams Are Ignoring

FCA Consumer Duty PRIN 2A requires insurers to tell policyholders what they can change to get a better outcome. Most pricing teams have not built this. insurance-recourse does i...
25 Mar 2026

Discrimination-Insensitive Pricing: Beyond Removing the Protected Variable

insurance-fairness v0.6.3 ships DiscriminationInsensitiveReweighter. Here's why dropping the protected column doesn't work, how propensity-based reweighting does, and what the A...
25 Mar 2026

Consumer Duty Fair Value Evidencing: A 12-Step Technical Checklist for Pricing Actuaries (2026)

EP25/2 (the FCA's evaluation of GIPP price-walking remedies) flags ongoing fair value supervision in motor and home. No single technical checklist exists for the pricing actuary...
25 Mar 2026

Commercial Lines Pricing with Python

Fleet motor, property, and liability pricing in Python. Covers Bühlmann-Straub credibility for fleet schemes, GPD large loss loading, MBBEFD ILF tables, and PSI drift detection ...
25 Mar 2026

Causal AI for Pricing Actuaries: A Practical Guide

Why GLM coefficients aren't causal effects, and how to fix that using insurance-causal: DML with CatBoost nuisances, causal forests for heterogeneous treatment effects, and DiD/...
25 Mar 2026

CANN: The Combined Actuarial Neural Network in Python

A clean Python tutorial for the most-cited neural network architecture in actuarial pricing: the Combined Actuarial Neural Network (Schelldorfer & Wüthrich, 2019). Architecture,...
24 Mar 2026

Zero-Inflated and Hurdle Models for Insurance Claims

Standard Tweedie GLMs handle zeros implicitly. When that implicit handling breaks — specialty lines, niche segments, specific peril models — you need ZIP or hurdle models. Here ...
24 Mar 2026

Walk-Forward Cross-Validation for Insurance GLMs in Python

How to implement walk-forward cross-validation for insurance GLMs in Python using insurance-cv. Covers IBNR buffers, fold design, and a full worked example on freMTPL2-style mot...
24 Mar 2026

Tweedie vs Frequency-Severity Split: When to Use Which

The Tweedie GLM and frequency-severity split both model pure premium. They are not interchangeable. Here is how to decide which one you actually need.
24 Mar 2026

The Python Insurance Pricing Benchmark: GLM vs XGBoost vs CatBoost vs LightGBM on freMTPL2

Definitive Python benchmark: Poisson GLM vs XGBoost vs CatBoost vs LightGBM for insurance frequency modelling on freMTPL2. Poisson deviance, Gini coefficient, and A/E calibratio...
24 Mar 2026

Ogden Rate and PPOs: Pricing Large Bodily Injury in Python

How the Ogden discount rate and Periodical Payment Orders change the maths of large BI pricing in the UK — with Python code to calculate lump sum equivalents, discount PPO cash ...
24 Mar 2026

Modelling Social Inflation in UK Motor Severity: A Python Approach

UK motor bodily injury severity has outrun CPI since 2022. This post implements a multiplicative severity separation model and Whittaker-Henderson smoothing in Python to separat...
24 Mar 2026

Migrating from Emblem to Python: What Actually Changes

Migrating from Emblem to Python for insurance GLM pricing: what changes in workflow, what gets easier, what gets harder, and what the transition actually looks like in practice.
24 Mar 2026

How to Export a Factor Table to Excel in Python

Step-by-step: extract CatBoost factor tables with shap-relativities and write a clean Excel file with openpyxl. Formatted output ready to paste into Radar or Emblem.
24 Mar 2026

GLM, GAM, and GBM for UK Motor Pricing in Python

The Python equivalent of the IFoA MLR Working Party's R tutorial: Poisson GLM baseline, EBM GAM, and CatBoost GBM on UK motor data, with the full pipeline from data to governance.
24 Mar 2026

Fitting a Motor Insurance GLM in Python: Poisson Frequency and Gamma Severity with statsmodels

A practical statsmodels tutorial for pricing actuaries: Poisson frequency model with exposure offset, Gamma severity model, overdispersion tests, factor table extraction, and A/...
24 Mar 2026

Does insurance-gam actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor book. EBM beats the GLM by 12.6 Gini points (0.455 vs 0.329). But the deviance number is misleading. We explain why, and when...
24 Mar 2026

Does HMM telematics scoring actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor fleet. HMM state fractions deliver 5–10pp Gini lift over simple aggregates. State classification recovers >50% of true high-r...
24 Mar 2026

Does GBM-to-GLM Distillation Actually Work for Insurance Pricing?

Honest benchmark: does fitting a surrogate GLM on CatBoost pseudo-predictions recover more discriminatory power than a direct GLM? We test it on 30,000 synthetic UK motor policies.
24 Mar 2026

Claims Inflation Decomposition: Taylor Two-Factor Separation in Python

Extract the calendar-year inflation component from a claims development triangle using Taylor's two-factor separation. Python from scratch, then connect to severity trending.
24 Mar 2026

Claims Inflation Adjustment in Pricing Models: Beyond CPI

CPI-adjusting your historical claims data before fitting a pricing model introduces systematic bias. How to apply line-specific inflation indices for motor and home insurance in...
23 Mar 2026

Python vs R for Actuarial Pricing: A Practical Comparison

A practical comparison of Python and R for UK personal lines insurance pricing — data wrangling, GLMs, GBMs, deployment, and Databricks. Honest about where R still wins.
23 Mar 2026

One-Way Analysis in Python: From Scratch to Production

One-way analysis in Python for pricing actuaries: pandas from scratch, credibility-weighted confidence intervals, thin cell handling, GBM shortcuts.
23 Mar 2026

How to Reproduce an Emblem GLM in Python

Reproduce an Emblem frequency-severity GLM in Python: factor tables, one-way plots, deviance residuals, and lift charts using statsmodels, CatBoost, and Polars.
23 Mar 2026

GLM Assumptions in Insurance Pricing: What Actually Matters

Which GLM assumptions actually matter for insurance pricing, which ones you routinely violate without consequence, and the diagnostics worth running before signing off a product...
23 Mar 2026

Getting Started: Three Libraries, One Workflow

A practical walkthrough for pricing analysts: use insurance-causal for causal inference, insurance-conformal for prediction intervals, and insurance-monitoring for drift detecti...
23 Mar 2026

Exposure-Weighted Gini Coefficient in Python

Exposure-weighted Gini for insurance pricing: correct formula, Python implementation, and why ignoring exposure distorts motor model governance.
23 Mar 2026

Does Whittaker-Henderson smoothing actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor age curve. REML recovers the true frequency well in the data-rich middle. The tails are a different story. Numbers, not claims.
23 Mar 2026

Does automated model monitoring actually work for insurance pricing?

Aggregate A/E at 0.94 looks fine. The model has been mispricing under-25s for eight months. Benchmark results on a synthetic UK motor book with three planted failure modes.
23 Mar 2026

Does Bühlmann-Straub credibility actually work for insurance pricing?

Benchmark results on 100 synthetic schemes with known true loss rates. Credibility blending reduces MSE by 25–35% vs the best naive alternative. Numbers, not theory.
23 Mar 2026

How to Build a Burning Cost Model for Insurance Pricing in Python

Build a burning cost model in Python: frequency-severity split, exposure offsets, large loss capping, IBNR adjustment, and combined pure premium for UK pricing.
22 Mar 2026

scikit-learn TweedieRegressor vs insurance-distributional: Why Fixed Tweedie Isn't Enough for Insurance Pricing

sklearn's TweedieRegressor is a well-engineered GLM. It fits a fixed-power Tweedie model correctly. The problem is that insurance pricing needs per-risk variance, not a single p...
22 Mar 2026

Python Insurance Pricing Cookbook: 20 Recipes for Common Tasks

20 short code recipes for common insurance pricing tasks. Each recipe uses a real API from one of our open-source libraries. Copy and adapt.
22 Mar 2026

Insurance Model Monitoring in Python: Gini, A/E, and Double-Lift

Tutorial on monitoring insurance pricing models using actuarial KPIs. Gini tracking, segmented A/E, double-lift for champion/challenger. Why generic drift tools miss what matters.
22 Mar 2026

GLMs for UK Insurance Pricing in Python: What the Generic Tutorials Miss

GLM insurance Python for UK pricing actuaries: exposure handling, Consumer Duty, frequency-severity split, and Emblem/Radar deployment with glum.
22 Mar 2026

CatBoost for Insurance Pricing: Frequency-Severity on freMTPL2

Build a CatBoost frequency-severity pricing model on freMTPL2 using Polars. Poisson frequency, Gamma severity, combined burning cost, SHAP factor extraction, and distillation to...
21 Mar 2026

Why k-Fold CV Is Wrong for Insurance and What to Do Instead

Insurance walk-forward cross-validation prevents the look-ahead bias that makes standard k-fold results useless for prospective evaluation. Complete Python example with insuranc...
21 Mar 2026

Tweedie Regression for Insurance: What sklearn Doesn't Tell You About Exposure

sklearn's TweedieRegressor tutorial gets you to a fitted model in six lines. It also produces predictions that are wrong for any policy with non-annual exposure. Here is the cor...
21 Mar 2026

Insurance Model Monitoring in Python: Beyond Generic Data Drift

Insurance model monitoring in Python that understands exposure weighting, development lags, and Gini drift. Why Evidently and NannyML miss what matters for pricing, and what ins...
20 Mar 2026

Censoring-Corrected GPD Fitting for Capped Claims: Unbiased Tail Index Estimation Under Policy Limits

Standard GPD fitting is biased when claims are capped by policy limits. Most actuaries know this and do it anyway. insurance-severity v0.2.0 fixes it.
20 Mar 2026

FCA Consumer Duty Pricing Fairness in Python

The FCA expects pricing teams to demonstrate their models don't proxy-discriminate under Consumer Duty. Most teams do this in Excel. Here is how to do it properly in Python, usi...
18 Mar 2026

Whittaker-Henderson Smoothing for Rating Tables: The Penalised Least-Squares Method UK Actuaries Should Already Be Using

Every UK pricing actuary smooths experience tables. Most do it with a 5-point moving average or a polynomial fitted by eye.
17 Mar 2026

Bühlmann-Straub Treats Last Year the Same as Five Years Ago

Static credibility weights all years equally. The dynamic Poisson-gamma state-space model weights recent experience more - and quantifies how much more.
15 Mar 2026

Monthly Covariate Shift Monitoring: When to Reweight and When to Retrain

How to run covariate shift detection as a recurring monthly check: monitoring cadence, ESS ratio trends, and the thresholds that trigger a retraining...
15 Mar 2026

Adaptive Conformal Inference for Non-Exchangeable Claims Series: Handling Trend Without Retraining

Standard split conformal prediction requires exchangeability - a condition insurance claims time series systematically violate.
15 Mar 2026

Debiasing Price Elasticity Estimates with Double Machine Learning: Removing the Risk Model's Fingerprint

OLS elasticity in formula-rated books is contaminated by your own risk model. insurance-causal fixes this with CausalForestDML and CatBoost nuisance.
14 Mar 2026

Optimal Binning for GLM Rating Factors: Beyond the Eyeball Test

Automated GLM factor banding for UK insurance pricing: R2VF fused lasso, neural embeddings for high-cardinality categoricals, SKATER spatial clustering.
14 Mar 2026

Per-Segment Large Loss Loading with Quantile GBMs: TVaR and ILFs at Risk Level

December is the season for year-end rate reviews where someone adds a flat 8% large loss loading to every segment regardless of tail weight.
14 Mar 2026

The Python Insurance Pricing Stack: 35 Libraries for Everything Emblem Can't Do

35 open-source Python libraries for UK insurance pricing: GBM-to-GLM distillation, causal inference, FCA fairness auditing, rate optimisation, PRA SS1/23.
14 Mar 2026

One Package, One Install: PRA SS1/23 Validation and MRM Governance Unified

insurance-governance merges insurance-validation and insurance-mrm. PRA SS1/23 statistical validation and MRM governance in one install - no version conflicts.
13 Mar 2026

Credibility-Weighted Broker and Scheme Effects with REML

Two-stage CatBoost plus REML random effects for UK insurance broker adjustments. insurance-multilevel - Buhlmann-Straub credibility weighting, not guesswork.
13 Mar 2026

Foundation Models for Thin Segments: TabPFN and TabICLv2 in Insurance Pricing

TabPFN and TabICLv2 for thin-segment UK insurance pricing. In-context learning at inference, no gradient descent. insurance-thin-data wraps both for actuaries.
13 Mar 2026

Individual Experience Rating Beyond NCD: From Bühlmann-Straub to Neural Credibility

Four-tier experience rating in Python: Buhlmann-Straub, Poisson-Gamma state-space, GBM surrogate, attention credibility. Policy-level multiplicative factors.
13 Mar 2026

Mixture Cure Models for Retention Pricing: Separating Structural Non-Lapsers from the At-Risk Book

Logistic regression treats all non-lapsers the same. Mixture cure models split them into two groups: structural non-lapsers who will never leave, and...
12 Mar 2026

Building a Modern Insurance Pricing Pipeline in Python

Complete UK insurance pricing pipeline in Python: CatBoost GLM distillation, causal inference, FCA fairness auditing, rate optimisation, PRA SS1/23 governance.
12 Mar 2026

GARCH for Claims Inflation: Modelling Volatility That Clusters

GARCH for UK insurance claims inflation: time-varying variance in trend analysis. insurance-garch - Engle (1982) applied to actuarial trend and pricing models.
10 Mar 2026

When You Can't Fit a GLM from Scratch: Transfer Learning for Thin Segments

GLMTransfer borrows statistical strength from a related source book to price thin target segments. Motor-to-fleet, home-to-landlord, and fleet roll-outs.
10 Mar 2026

GAMLSS in Python, Finally

GAMLSS in Python: seven families, RS algorithm, variance as function of covariates. insurance-distributional-glm - the actuarial implementation Python lacked.
09 Mar 2026

Whittaker-Henderson Smoothing for Insurance Pricing

Whittaker-Henderson smoothing for noisy experience rating tables in Python. REML lambda selection, Bayesian confidence intervals, 2D surface smoothing.
09 Mar 2026

Getting Spatial Territory Factors Into Production

From CatBoost frequency model to BYM2 spatial territory factors for Emblem or Radar. Data engineering, MCMC convergence checks, Polars joins - Python.
09 Mar 2026

EBMs for Insurance Pricing: Better Than a GLM, Readable by a Pricing Committee

insurance-gam wraps EBM for UK pricing teams: Poisson/Tweedie loss, exposure offsets, RelativitiesTable, MonotonicityEditor, GLM comparison diagnostics.
08 Mar 2026

How Do You Know Your Sigma Model Is Working?

Three diagnostics prove a GAMLSS sigma submodel is real: quantile residuals, worm plots, split-sample calibration. From insurance-distributional-glm.
07 Mar 2026

Quantile GBMs for Insurance: TVaR, ILFs, and Large Loss Loadings

CatBoost MultiQuantile plus actuarial output layer: TVaR, ILFs, large loss loadings, exceedance probabilities for UK insurance pricing. insurance-quantile.
05 Mar 2026

Distributional GBMs for Insurance: Pricing Variance, Not Just the Mean

insurance-distributional models the full conditional loss distribution, not just the mean. First open-source Python implementation of the ASTIN 2024 Best Paper.
04 Mar 2026

How to Build a Large Loss Loading Model for Home Insurance

Per-risk large loss loadings for UK home insurance using quantile GBMs. Avoids the flat-loading trap by making the loading a function of the risk itself.
04 Mar 2026

GLM Interaction Detection: A Six-Step Walkthrough with CANN, NID, and SHAP

Step-by-step tutorial: plant two interactions in synthetic motor data, detect them with CANN + NID, validate with SHAP, confirm with A/E surfaces, and...
02 Mar 2026

How to Extract GLM-Style Rating Factors from a CatBoost Model

Step-by-step: extract multiplicative CatBoost rating factors using shap-relativities. SHAP decomposition to GLM-format exp(beta) tables with CI and...
01 Mar 2026

From CatBoost to Radar in 50 Lines of Python

Python library distilling CatBoost GBMs into multiplicative GLM factor tables for Radar and Emblem. Open-source GBM-to-GLM distillation for UK pricing teams.
28 Feb 2026

When Credibility Meets CatBoost: Choosing Between Classical and Modern Approaches

Bühlmann-Straub vs CatBoost vs two-stage multilevel for UK motor pricing: when each wins and how insurance-credibility and insurance-multilevel combine them.
27 Feb 2026

Finding the Interactions Your GLM Missed

Automated interaction search for UK motor GLMs using CANN residuals and NID. Bonferroni-corrected shortlist before manual testing - insurance-interactions.
27 Feb 2026

Experience Rating: NCD and Bonus-Malus

Python library for NCD and bonus-malus in UK motor insurance. Optimal claiming thresholds peak at 20% NCD discount, not 65% - derived mathematically.
21 Feb 2026

From GBM to Radar: A Complete Databricks Workflow for Pricing Actuaries

Databricks workflow for UK pricing actuaries: CatBoost plus MLflow tracking, SHAP relativities, and Radar export. End-to-end motor pricing in Python.
17 Feb 2026

SHAP Relativities for Insurance GBMs: GLM-Format Factor Tables in Python

How to extract SHAP relativities from insurance GBMs. Multiplicative factor tables in GLM exp(beta) format, with confidence intervals and exposure weighting. Python, CatBoost, U...
28 Jul 2025

Telematics Risk Scoring: From Raw Trips to GLM Features

How to convert raw telematics trip data into GLM-ready features for UK motor pricing. Covers HMM state segmentation and score calibration to GLM relativities.
13 Jul 2025

Your Model Validation Is a Checklist, Not a Test

PRA SS1/23 requires quantitative pass/fail tests, not narrative. insurance-governance automates the full validation suite and generates auditable HTML reports.
14 May 2025

Your Frequency-Severity Independence Assumption Is Costing You Premium

Your frequency GLM and severity GLM are both correct. Multiplying them is not. How to test and correct for the dependence your pricing model ignores.
15 Mar 2025

Spliced Severity Distributions: When One Distribution Isn't Enough

A practitioner tutorial on fitting spliced composite severity distributions for UK motor claims using insurance-severity.