Techniques

362 articles in this topic

05 Apr 2026

Repricing at the floor: how to find headroom in UK motor without triggering Consumer Duty

UK motor NCR is at 111% (EY Q4 2024). The market is at or near technical floor. Here is how to identify underpriced segments using loss cost trending, A/E monitoring, and Bühlma...
05 Apr 2026

Conformal Prediction Passes 90% Coverage, But Your Young Drivers Get 82%: ConditionalCoverageAssessor Fixes That

insurance-conformal v1.3.1 adds ConditionalCoverageAssessor — a tool for detecting and decomposing conditional coverage failures in conformal prediction intervals. Here is the p...
04 Apr 2026

Whittaker-Henderson Smoothing in Python: A Step-by-Step Tutorial for Insurance Rating Tables

A complete Python tutorial for Whittaker-Henderson smoothing of insurance rating tables. Replace your Excel moving average or SAS graduation with automatic REML lambda selection...
04 Apr 2026

What Postcode Can't Tell You: Flood Pricing Without a JBA Budget

You don't need a JBA licence to build a materially better flood model. A French study on 968,000 policies shows which open data sources actually move the needle — and the answer...
04 Apr 2026

Telematics Pricing in Python: From Raw Trip Data to a GLM in 30 Minutes

A practical Python tutorial for telematics pricing: load raw GPS trip data, classify driving regimes with a Hidden Markov Model, and produce GLM-ready risk features using insura...
04 Apr 2026

Survival Analysis for Insurance Lapse Modelling in Python: A Complete Tutorial

A hands-on Python tutorial for insurance pricing analysts on survival analysis and lapse modelling. Covers Kaplan-Meier, Weibull AFT, mixture cure models, customer lifetime valu...
04 Apr 2026

When Your Distributional Regression Has Too Many Parameters

Regression by composition — the framework that generalises GAMLSS and transformation models — suffers from a subtle non-identifiability when you stack multiple flows. Kadhem (20...
04 Apr 2026

Conformal Risk Control Assumes Your Loss Decreases With Interval Width. Most Insurance Losses Don't.

Conformal risk control (Angelopoulos et al. ICLR 2024) requires monotone loss functions for its finite-sample guarantees. The Winkler score, two-sided regulatory tests, and capi...
04 Apr 2026

Is Your Model Improvement Worth Building? The Loss Ratio Error Framework

C. Evans Hedges (Lemonade, December 2025) derives the first closed-form formula connecting model discrimination to expected loss ratio. LRE translates a correlation improvement ...
04 Apr 2026

glum Insurance Pricing in Python: Fitting, Intervals, Monitoring, Fairness, Governance

glum fits the Tweedie GLM in seconds. Here is how our libraries handle everything around it: distribution-free prediction intervals, PSI/CSI drift monitoring, Consumer Duty prox...
04 Apr 2026

GAM Insurance Pricing in Python: An EBM Tariff Tutorial with insurance-gam

A hands-on tutorial on GAM insurance pricing in Python using the insurance-gam library. Covers EBM tariff construction, shape function extraction, GLM comparison, Shapley values...
04 Apr 2026

Five Libraries, One Pipeline: End-to-End Motor Pricing in Python

A single freMTPL2 motor pipeline running through insurance-gam, insurance-conformal, insurance-monitoring, insurance-fairness, and insurance-governance. No other open-source eco...
04 Apr 2026

Credibility Theory in Python: A Complete Buhlmann-Straub Tutorial for Insurance Pricing

A practical Python tutorial on credibility theory for insurance pricing analysts. Covers Buhlmann-Straub model, the insurance-credibility library, UK motor example, GLM integrat...
04 Apr 2026

Conformal Prediction for Insurance Python: A Frequency-Severity Tutorial

A step-by-step tutorial on conformal prediction for insurance Python models, specifically the frequency-severity decomposition. Covers the calibration subtlety that breaks naive...
04 Apr 2026

When Empirical Bayes Goes Wrong: Lessons From Conformal Inference for Insurance Credibility

Seo and Lim (arXiv:2604.01629, April 2026) show that standard empirical Bayes methods inflate FDR to 0.25–0.35 when the prior is misspecified — even when the nominal rate is 0.1...
04 Apr 2026

Causal Inference for Insurance Pricing in Python: A Complete DML Tutorial

A hands-on tutorial on causal inference for insurance pricing in Python using the insurance-causal library. Covers double machine learning (DML), CatBoost nuisance models, ATE/C...
04 Apr 2026

Confidence Intervals for EBM Shape Functions — Boulevard Regularisation for Insurance Pricing

Boulevard regularisation turns EBM shape functions into kernel ridge regression estimates, giving valid CLT-based confidence intervals. We show why this matters for model govern...
04 Apr 2026

When Does Your Conformal Model Break? Monitoring Coverage Without False Alarms

You deploy a conformal pricing model and check its coverage every month. After twelve months, you see a coverage failure. Is the model broken, or did you just run twelve tests? ...
04 Apr 2026

Amortized Bayesian Credibility: Neural Posteriors Without the MCMC Wait

Habermann et al. (ICLR 2025, arXiv:2408.13230) train a neural network to approximate posteriors for hierarchical Bayesian models — once. After training, any new segment gets a f...
03 Apr 2026

Your Thin-Data Quantile Estimates Are Overfit. Here Is the Fix.

Standard quantile regression overfits badly on small insurance segments. A new closed-form result from Zhang, Mao and Wang (2026) gives a distributionally robust QR estimator wi...
03 Apr 2026

Reserve Models Have a Calibration Problem Nobody Is Testing

Frequency model monitoring has a growing toolkit. Reserve models — which target specific quantiles of the loss distribution, not the mean — have almost nothing. ScoreDecompositi...
03 Apr 2026

Quantile Premium Principle: What the NAAJ 2025 Benchmark Gets Right (and What It Missed)

Zanzouri et al. (NAAJ 2025) benchmark four ML severity models inside the QPP framework. The tau adjustment is elegant. CatBoost was missing from the comparison. Our library alre...
03 Apr 2026

PyINLA: INLA Finally Comes to Python. Here's What Pricing Teams Should Know.

INLA — the method that made Bayesian spatial GLMMs tractable in R — now has a proper Python package. 92–278x faster than PyMC, no R dependency. We explain what it does, why pric...
03 Apr 2026

Post-Selection Inference Fixes Your Frequency Model. Your Severity Model Is Still Broken.

Shen et al. (arXiv:2603.24875) gives valid CIs for Lasso-selected Poisson GLMs. That fixes your frequency model. UK motor pricing is Poisson × Gamma — and the Gamma severity mod...
03 Apr 2026

Post-Selection GLM Inference Is Now Usable in Python

insurance-gam v0.3.0 ships PostSelectionGLM and DataSplitPostSelectionGLM: two classes that produce valid confidence intervals for Poisson frequency models after Lasso variable ...
03 Apr 2026

PAVA in Three Places: Isotonic Regression for Insurance Pricing

The Pool Adjacent Violators Algorithm solves an O(N) monotonicity problem with no parametric assumptions. It appears in three distinct insurance pricing contexts: as the link fu...
03 Apr 2026

Offset vs Ratio Exposure in Tweedie GLMs: When It Matters (and When It Doesn't)

Boucher & Coulibaly (arXiv:2502.11788) prove that offset and ratio exposure handling are equivalent for Poisson frequency models — but diverge for Tweedie pure-premium GLMs, whe...
03 Apr 2026

Validating a Mixture Severity Model: When NE-GMM Earns Its Keep and When GammaGBM Still Wins

NeuralGaussianMixture is now in insurance-distributional v0.4.0. The question is not whether it can fit bimodal severity — it can. The question is whether your data actually nee...
03 Apr 2026

NeuralGaussianMixture vs the insurance-distributional Stack: When to Use What

insurance-distributional now has five distributional model classes. NeuralGaussianMixture is the newest and the most demanding. A routing guide: which model for which problem, a...
03 Apr 2026

Why NLL Fails to Train Mixture Severity Models: The Engineering Fix in NE-GMM

Negative log-likelihood is a proper scoring rule. So why does NLL training collapse a Mixture Density Network to a single component? The answer is in the loss surface geometry, ...
03 Apr 2026

NCD Underreporting Has a Second Problem You Are Probably Missing

The hunger-for-bonus effect biases your NCD frequency relativities. It also biases your severity model. The two errors partially offset each other — but the combined underpricin...
03 Apr 2026

Your Joint Prediction Sets Are 20–40% Too Wide

The Bonferroni correction for joint frequency-severity prediction sets is conservative by construction. Braun et al. (arXiv:2507.20941) show that covariance whitening produces e...
03 Apr 2026

Competitor Quotes Are Risk Data. We Just Pretend They Are Not.

UK personal lines generates hundreds of millions of competitor quotes per year. The industry treats them as competitive positioning data. They are, in fact, risk calibration dat...
03 Apr 2026

From Competitor Quotes to Risk Parameters: Implementing Market-Based Ratemaking in Python

A working Python implementation of Goffard, Piette & Peters (2025) ABC-SMC market-based ratemaking. Forty lines of vectorised numpy, PAVA via scipy, loss ratio corridor tuning, ...
03 Apr 2026

KL Fairness Corrections and Multiplicative GLMs: The Production Deployment Problem

Miao & Pesenti's KL discrimination-insensitive result is theoretically clean. Deploying it in a production GLM-based pricing system is not. The paper is silent on how to extract...
03 Apr 2026

The Interest Rate Effect Nobody Put in Their NCD Model

Holtan (2001) showed that the NCD reporting threshold falls when interest rates rise — the NPV of future premium penalties shrinks, so policyholders become more willing to claim...
03 Apr 2026

Your Hurdle-Poisson Model Might Say Accidents Make Drivers Safer

Lee et al. (arXiv:2602.02398) prove that standard hurdle-Poisson models with bivariate normal random effects can violate credibility order — your frequency estimate goes down af...
03 Apr 2026

Your Peril GLMs Don't Add Up — MinTrace Reconciliation for Insurance Pricing

Separate peril GLMs routinely disagree with each other by 3–8% at cover level. insurance-reconcile applies premium-weighted MinTrace to make your hierarchy coherent without disc...
03 Apr 2026

GAMformer: A Genuinely Clever Idea That Cannot Help You Today

GAMformer produces GAM shape functions in a single forward pass, no hyperparameter search, sub-second inference. For insurance pricing today: three hard blockers — max 500 rows,...
03 Apr 2026

Five Ways Insurance Fraud ML Papers Mislead You (And How to Spot Them)

Insurance fraud ML papers routinely overstate their results through five avoidable errors: wrong evaluation metric, no external baseline, random train/test split, foreign data, ...
03 Apr 2026

Your Monitoring Thresholds Are Made Up

PSI > 0.2 and A/E > 1.15 are industry folklore, not statistics. Conformal SPC replaces them with calibrated p-values that have a finite-sample false alarm rate guarantee, no nor...
03 Apr 2026

Causal SHAP: Fixing Correlated Feature Attribution — and Why It Is Harder Than It Looks for Pricing

IJCNN 2025 paper arXiv:2509.00846 introduces Causal SHAP: it uses causal discovery to estimate a DAG, then computes SHAP values that respect causal structure. The correlated-fea...
03 Apr 2026

Bayesian Nonparametric Severity: Fitting Heavy Tails Without Choosing a Threshold

The hardest part of fitting a GPD is picking the threshold. A new Bayesian nonparametric approach eliminates the choice entirely — and tells you what fraction of your book has i...
03 Apr 2026

Threshold-Free Heavy-Tail Severity: A Genuinely Novel Idea That Isn't Ready for Production

Nieto-Barajas (arXiv:2602.07228) proposes a Bayesian nonparametric mixture of Shifted Gamma-Gamma distributions that eliminates EVT threshold selection entirely and produces a p...
03 Apr 2026

Bayesian Doubly Robust Causal Inference: What Entropic Tilting Adds and Why We Are Watching, Not Building

Orihara, Momozaki & Sugasawa (arXiv:2506.04868) produce a Bayesian posterior over the ATE by tilting the product of independent posteriors to satisfy the DR moment condition. We...
02 Apr 2026

Two-Hump Distributions: Why Your Severity Model Gets the 95th Percentile Wrong

UK motor bodily injury severity is structurally bimodal. A GammaGBM fits one mode between two humps, understating the 95th percentile by 30-40%. NeuralGaussianMixture fixes this...
02 Apr 2026

The Survival Treatment Effect Your Retention Model Is Not Computing

If you have called RetentionUpliftModel with outcome='survival', your model silently ran as binary. There is no pip-installable Python package for survival CATE. We explain the ...
02 Apr 2026

Stop Picking One Mortality Model: Shapley Values Tell You How Much Each Actually Contributes

Bimonte et al. show that age-specific Shapley weights across 15 mortality models outperform any single model at 10-20 year horizons — exactly the range that matters for annuity ...
02 Apr 2026

Your SHAP Bar Chart Has No Error Bars

SHAPInference (in development for shap-relativities): asymptotically valid confidence intervals on global SHAP feature importance using de-biased U-statistics. Every SHAP import...
02 Apr 2026

When Reserves Learn from Experience: What RL-Based Reserving Means for Your Pricing Loss Ratios

Avanzi, Richman, Wüthrich et al. (arXiv:2601.07637) treat individual claim development as a Markov decision process, using Soft Actor-Critic to revise outstanding claim liabilit...
02 Apr 2026

Quantum Cat Pricing: Impressive Physics, Wrong Bottleneck

Kirke (arXiv:2603.15664) applies Quantum Amplitude Estimation to catastrophe insurance tail-risk pricing and claims quadratic speedup over classical Monte Carlo. The maths is re...
02 Apr 2026

Pricing Fairly Without Seeing the Data: Local Differential Privacy for Insurance

insurance-fairness v1.2.0 adds PrivatizedFairPricer: discrimination-free pricing when the sensitive attribute is privatised via local differential privacy. Based on Zhang, Liu &...
02 Apr 2026

When Burr XII Isn't Fat Enough — The PowerBurr Family for Large-Loss Severity Modelling

Burr XII's body and tail are controlled by the same parameters — you can't fix one without breaking the other. Liu & Meng's PowerBurr adds a fourth parameter that decouples them...
02 Apr 2026

Doubly Robust IBNR: The Middle Ground Between Chain-Ladder and Micro-Level Reserving

PopulationSamplingReserve lands in insurance-severity v0.4.0. IBNR as a missing-data problem: the AIPW doubly-robust estimator hedges your bets between the chain-ladder's aggreg...
02 Apr 2026

Beyond Weibull: When Does the Shape of Your Hazard Function Actually Matter?

A new paper models hazard functions as solutions to nonlinear ODEs, producing shapes no standard parametric family can match. The maths is genuinely interesting. The absence of ...
02 Apr 2026

Energy Score-Guided Mixture Models: What the NE-GMM Paper Actually Contributes

Yang et al. (arXiv:2603.27672) fix mode collapse in Mixture Density Networks by adding an analytic Energy Score term to the training objective. The contribution is real and spec...
02 Apr 2026

Can LLMs Pass Their Insurance Exams? The Wrong Question for Pricing Teams

Beauchemin & Khoury (arXiv:2603.07825) benchmark 51 LLMs on Quebec insurance regulatory certification questions. Passing insurance exams is the wrong success metric for pricing ...
02 Apr 2026

P2P Insurance Has a Theory Problem. This Paper Fixes It.

insurance-optimise v0.6.0 adds LinearRiskSharingPool: Cramér-Lundberg ruin analysis for community-based risk pools, based on Denuit, Flores-Contró & Robert (arXiv:2603.29530). W...
02 Apr 2026

Locally Adaptive Score Selection for Conformal Intervals: insurance-conformal v1.2.0

insurance-conformal v1.2.0 adds LCPModelSelector: locally adaptive conformal model and score selection that gives per-prediction tighter intervals while maintaining coverage gua...
02 Apr 2026

You've Just Bought a Book. Your Model Doesn't Know These Risks.

When you acquire a portfolio or enter a scheme, your pricing model was fitted on a different risk population. Weill and Wang (2026) give a kernel GLM framework for correcting th...
02 Apr 2026

How Long Does an Inflationary Shock Last? Lessons from Pandemic Mortality Persistence

A new mortality model from Liu & Zhou (2026) shows that cause-specific shocks decay heterogeneously — some fast, some slow. The analogy to UK claims inflation is exact, and the ...
02 Apr 2026

Individual Claims Reserving: Why a Linear Regression Beats a Transformer

Richman & Wüthrich's one-shot individual claims reserving framework (arXiv:2603.11660) shows that a simple OLS model — using case estimates as the primary feature — matches or b...
02 Apr 2026

Genetic GAMs: An Interesting Idea That EBM Already Solved

Shankar & Cohen automate GAM structure search using NSGA-II evolutionary algorithms. The idea is legitimate; the problem is that EBM already does this better for insurance prici...
02 Apr 2026

Building-Level Flood Pricing: What Flood Re's 2039 Phase-Out Means for Your Models

Flood Re ends in 2039. From that date, 350,000+ currently subsidised properties need risk-reflective pricing. We work through Moriah et al. 2026's sequential GLM using geolocate...
02 Apr 2026

Conformal Fairness When the Protected Attribute Is Missing

Kong, Liu & Yang prove that standard conformal coverage guarantees degrade unevenly when protected attributes are absent at test time. With post-ECJ gender prohibition and GDPR ...
02 Apr 2026

Per-Prediction Audit Trails for Pricing Models: insurance-governance v0.3.0

insurance-governance v0.3.0 adds ExplainabilityAuditTrail: a per-prediction audit log that records SHAP values, fairness flags, and plain-language summaries for every pricing de...
02 Apr 2026

Your SCR Is an Expected Shortfall. Stop Estimating It Via a Quantile Detour.

ExpectedShortfallRegressor in insurance-quantile v0.5.0 implements direct ES regression via the i-Rock method. No quantile detour. Sandwich SEs. Directly relevant to Solvency II...
02 Apr 2026

When Parallel Trends Break: Gaussian Processes for Staggered Rate Change Evaluation

Gevorgyan et al. propose exchangeable multi-task Gaussian processes for causal effect estimation in staggered-adoption designs. The method handles nonlinear trends that break SD...
02 Apr 2026

Dynamic Scale Models for Solvency II: Interesting for Banks, Less So for Insurers

Liu & Luger (arXiv:2603.02357) build a semiparametric VaR/ES forecaster that models scale dynamics through quantile differences rather than GARCH variance equations. The method ...
02 Apr 2026

Your Reserve Already Has a Risk Margin — What RL Actually Adds (and Doesn't)

A new paper (arXiv:2504.09396) uses PPO reinforcement learning with a CVaR constraint to manage a reserve buffer. The framing is interesting — but this is not reserving in the a...
02 Apr 2026

Your Conformal Intervals Lie About Some Policyholders

Marginal 90% coverage can hide severe undercoverage for specific risk profiles. ConditionalCoverageAssessor — new in insurance-conformal v1.2.0 — quantifies it with CVI, decompo...
02 Apr 2026

You Cannot Properly Score Expected Settlement Time

RMSE on closed claims is biased. Corollary 3 of Taggart, Loveday & Louis (arXiv:2603.14835) proves mean settlement time is not scoreable under right-censoring. CensoredForecastE...
02 Apr 2026

Your Threshold Is a Choice, Not a Fact: BNP Heavy-Tail Severity Modelling

Nieto-Barajas (arXiv:2602.07228, 2026) proposes a Bayesian nonparametric mixture of Scaled Generalised Gaussian distributions that eliminates threshold selection entirely. The m...
02 Apr 2026

Your NCD Relativities Are Wrong, and the Maths Now Tells You How Wrong

Two January 2026 arXiv papers formalise what motor actuaries have always known informally: NCD creates rational incentives to suppress small claims, and the GLM you're using to ...
02 Apr 2026

Your NCD Table Has a Frequency Bias

insurance-credibility v0.1.9 adds BMSEquilibriumSimulator — Lemaire NPV reporting thresholds, Liang 2-class equilibrium, and a frequency correction for the selection bias in NCD...
02 Apr 2026

Biased Mean Regression: A Principled Test for Systematic Reserve Errors

Malandii & Uryasev's Biased Mean Quadrangle (arXiv:2603.26901) provides a linear-programming-based method for estimating E[Y]+x — a biased mean offset. For reserving actuaries, ...
02 Apr 2026

Your Premium Sufficiency Guarantee Has an Expiry Date

Every time you re-run conformal risk control calibration on a growing book, you are implicitly doing multiple testing. Hultberg et al. (2026) formalise the fix.
02 Apr 2026

ANAM's Monotonicity Insight Is Real. The tensorflow_lattice Dependency Is Not.

Laub/Pho/Wong's Actuarial Neural Additive Model has a genuine architectural insight in PWLCalibration monotonicity. It also depends on an unmaintained TensorFlow library. EBM is...
02 Apr 2026

From Competitor Quotes to Risk Parameters: ABC for Entry Pricing in Python

A runnable Python implementation of Goffard, Piette, and Peters (ASTIN Bulletin 2025): infer claim frequency and severity from competitor PCW quotes using ABC-SMC with isotonic ...
01 Apr 2026

TabPFN as a Conditional Density Estimator: What the Benchmark Actually Shows for Severity Pricing

Izbicki and Rodrigues (arXiv:2603.26611, March 2026) benchmark TabPFN-2.5, RealTabPFN-2.5 and TabICL-Quantiles as conditional density estimators across 39 datasets. The thin-dat...
01 Apr 2026

Applying Bonus-Malus to Driving Behaviour, Not Just Claims

Yanez, Guillen and Nielsen (ASTIN Bulletin 2025) apply a bounded Bonus-Malus System not to claims but to telematics signals themselves, updating weekly. The result: Gini from 0....
01 Apr 2026

The 99th Percentile From 200 Claims: Wasserstein Robust Quantile Regression for Thin Portfolios

Zhang, Mao and Wang (arXiv:2603.14991, March 2026) prove a closed-form equivalent for worst-case quantile regression under Wasserstein distributional uncertainty — a result that...
01 Apr 2026

Territory Rating Beyond Postcode Lookup Tables

UK territory rating is mostly postcode-to-band lookup tables. That creates both actuarial and regulatory risk. We work through the spatial statistics toolbox that does better: B...
01 Apr 2026

Your Retention Model Is Wrong About When Customers Lapse

Most retention models measure whether a customer lapses. surv-iTMLE measures when — and what your pricing intervention caused. We explain the estimand, why left truncation is mo...
01 Apr 2026

SPQRx: Semi-Parametric Severity Modelling Without Threshold Selection

Threshold selection is the Achilles' heel of extreme value theory in insurance pricing. Majumder and Richards (arXiv:2504.19994) eliminate it by blending a spline-neural-network...
01 Apr 2026

Shape-Adaptive Conformal Prediction: Why Your Intervals Are Wrong for Skewed Claims

Standard conformal prediction gives symmetric intervals calibrated on average. For right-skewed claim distributions, that average includes a lot of zero claims pulling threshold...
01 Apr 2026

Semi-Structured Multi-State Models for Policy Lapse: A Mortgage Paper Worth Watching

Medina-Olivares, Xia, Lessmann and Klein (arXiv:2603.26309, March 2026) build a semi-structured neural network model for mortgage delinquency transitions. The method combines a ...
01 Apr 2026

Two Ways to Control Risk in Automated Underwriting: Conditional vs Marginal

Two rigorous frameworks for automated underwriting triage — SelectiveConformalRC (SCRC) controls expected loss on your auto-priced book; SCoRE controls total deployed risk via e...
01 Apr 2026

SafeDriver-IQ: Telematics Scoring via Inverse Crash Probability

Roy, Singh, and Das (arXiv:2603.14841) build a 0-100 driver safety score by inverting a crash classifier and multiplying in condition-specific penalty factors. The maths is clea...
01 Apr 2026

QPP ratemaking: when quantile loading genuinely helps, and the exposure offset gap

A follow-up to our QPP introduction: the honest case for quantile-based loading (it works for heavy-tailed lines and low-frequency risks, it does not work below the zero mass), ...
01 Apr 2026

Your GLM Confidence Intervals Are Wrong After Variable Selection

Every UK pricing GLM pipeline that uses Lasso variable selection then reports Wald confidence intervals is producing coverage rates of 70–80% at a nominal 95%. A March 2026 pape...
01 Apr 2026

Your GLM Confidence Intervals Are Wrong After Lasso — Here Is the Bias-Corrected Fix

PenalizedGLMInference in insurance-gam v0.5.0 implements Manna et al. arXiv:2410.01008: bias-corrected confidence intervals for Poisson, Gamma, and Tweedie GLMs after Lasso or e...
01 Apr 2026

Online Conformal Validity on Claims Data: Beta-Mixing Conditions and the Seasonal Caveat

Standard conformal prediction gives valid coverage only when calibration and test data are exchangeable. For insurance models deployed for 12+ months — through claims inflation ...
01 Apr 2026

The Hunger for Bonus: How UK Motor NCD Pricing Gets the Frequency Wrong

Two January 2026 arXiv preprints formalise what UK pricing teams have long intuited: observed claim frequency at high-NCD classes understates true frequency by 15–35%, because p...
01 Apr 2026

MGA Entry Pricing: A Four-Stage Architecture from Day Zero to Year Three

An MGA launching on a UK PCW needs prices on day one with zero claims history. Here is the full architecture: market ABC as the prior, Bühlmann-Straub blending as claims arrive,...
01 Apr 2026

Mahalanobis Conformal Prediction: When Ellipsoids Beat Rectangles (and When They Do Not)

Braun et al. (arXiv:2507.20941) replace the standard hyperrectangular joint prediction set with an ellipsoid built from a Mahalanobis nonconformity score. For d=2 (frequency + s...
01 Apr 2026

Uncertainty for Free: Prediction Intervals on Your GBM Pricing Model Without Retraining

LoBoostCP in insurance-conformal v1.0.0 implements Santos et al. arXiv:2602.22432 — local conformal prediction that uses the leaf structure of your existing GBT to calibrate pre...
01 Apr 2026

Formal Statistical Tests for Insurance Pricing Model Drift

insurance-monitoring v0.10.0 adds PricingDriftMonitor (Brauer/Menzel/Wüthrich arXiv:2510.04556) and CalibrationCUSUM (Franck et al. arXiv:2510.25573): formal statistical tests f...
01 Apr 2026

Five Ways to Get Market Data for Entry Pricing — and What Each Actually Delivers

A UK MGA at launch has five routes to market data: competitor quote reverse-engineering, rate indices from Consumer Intelligence, ABI aggregate statistics, capacity provider dat...
01 Apr 2026

EIOPA's AI Governance Opinion: What UK Pricing Teams Need to Do

EIOPA published its AI Governance Opinion (EIOPA-BoS-25-360) in August 2025. It names the actuarial function as responsible for AI controls, endorses SHAP and LIME explicitly, a...
01 Apr 2026

Double/Debiased Machine Learning for Insurance Pricing: A Practitioner Guide

DML removes the omitted variable bias that makes naive GLM price elasticity estimates wrong by 20–80%. We explain why it works, show the two core insurance applications — price ...
01 Apr 2026

DebiasedGLM: Honest Confidence Intervals for Lasso-Selected Rating Models

insurance-gam v0.4.0 adds DebiasedGLM — the first Python implementation of debiased Lasso confidence intervals for Poisson, Gamma, and Tweedie GLMs. It corrects the bias that ma...
01 Apr 2026

CRPS-Optimal Conformal Binning: When Actuarial Scoring Drives the Interval

Paolo Toccaceli's CRPS-Optimal Binning for Conformal Regression (arXiv:2603.22000) partitions the covariate space using dynamic programming to minimise LOO-CRPS, then calibrates...
01 Apr 2026

Cross-Product Claim Scores: Do They Work in the UK?

Verschuren (2021) showed that a Dutch insurer's home claim history predicts motor risk, and vice versa. The framework is technically clean. The UK structural context — aggregato...
01 Apr 2026

Full Reserve Distributions: Continuous-Time Bootstrapping for IBNR and IBNER

Baradel (arXiv:2603.11258) extends continuous-time bootstrapping to Schnieper's model, separating IBNR (new claims) from IBNER (cost development of known claims) and producing t...
01 Apr 2026

Your Prediction Intervals Are Unfair (And You Haven't Checked)

Vadlamani et al. (ICLR 2025, arXiv:2505.16115) formalise fairness at the prediction-set level. A model can be statistically valid at 90% coverage while covering elderly policyho...
01 Apr 2026

Distribution-Free Process Monitoring: Replace Your Arbitrary Thresholds with Calibrated Ones

Burger (arXiv:2512.23602, Dec 2025) applies conformal prediction to insurance model monitoring, replacing PSI > 0.2 and A/E > 1.15 with thresholds that are calibrated from data ...
01 Apr 2026

Conformal Prediction for Censored Survival Outcomes: ConformalisedSurvival in insurance-conformal v1.2.0

Standard conformal prediction fails with right-censored survival data because you never observe the true event time for censored policies. ConformalisedSurvival in insurance-con...
01 Apr 2026

Prediction Intervals That Survive an Instrument: Conformal IV Regression for Pricing Teams

Standard conformal prediction breaks under instrumental variable regression — the calibration residuals are not exchangeable. Kato (arXiv:2603.25509, March 2026) fixes this by r...
01 Apr 2026

Conformal Control Charts: Replacing PSI Thresholds With a Proper False Alarm Rate

PSI thresholds of 0.1 and 0.2 are industry convention, not statistical calibration. Burger (arXiv:2512.23602, Dec 2025) replaces them with conformal p-values, giving a distribut...
01 Apr 2026

Why Your CI Cause-of-Death Models Disagree With Each Other (And How to Fix It)

Independent Lee-Carter models per cause-of-death produce forecasts that do not sum to total mortality — a coherence failure that flows directly into CI and LTC reserves. Nigri, ...
01 Apr 2026

ADAS Near-Miss Counts as a Risk Signal: A Zero-Inflated Poisson Framework

Zhang, Guillen, Li, Li and Chen (arXiv:2509.02614) build a group-based zero-inflated Poisson model for ADAS near-miss event counts using 354 commercial drivers across 8.1 millio...
01 Apr 2026

The Neural GAM Actuaries Can Explain to Their CRO

ANAM (Laub, Pho, and Wong, NAAJ 2025) fits each rating factor as a neural subnetwork with hard monotonicity constraints, exposure offsets, and proper actuarial losses. The insur...
31 Mar 2026

Conformal Prediction for Lapse Timing When Your Book Has Shifted

Shin, Lee and Kang (arXiv:2512.03738, Dec 2025) provide the first finite-sample coverage guarantee for time-to-event prediction under covariate shift. Here is what it means for ...
31 Mar 2026

Transfer Learning for Thin Portfolios: What Works, What Doesn't, and Why DANN Is the Wrong Tool

When you launch a new product with no claims history, you borrow from a related portfolio. Transfer learning formalises this. But the most-cited deep learning method for domain ...
31 Mar 2026

Scoring a Trip as a Function, Not a Feature Vector

Standard telematics pricing throws away most of the information in a trip by reducing it to 100+ scalar features. Two new papers from Toronto's Badescu group show what you recov...
31 Mar 2026

Tab-TRM: Best on the Benchmark, Not the Right Starting Point

Tab-TRM sets the French MTPL benchmark at 23.589×10⁻² Poisson deviance, beating PIN ensemble by 0.3%. The linearisation result — Tab-TRM is approximately a state-space model — i...
31 Mar 2026

Risk-Informed Renewal Classification: Bridging the Pricing-Retention Silo in UK Motor

Boonkrong et al. (MDPI Risks 14(3):57, March 2026) show that adding actuarial pricing features to a renewal classifier materially improves prediction — the insight being that th...
31 Mar 2026

Retention-Aware UBI Pricing: Risk Score, Churn Elasticity, and Consumer Duty

Li, Luo, Zhang, Huang and Jiang (IME 2025) combine telematics risk scoring with individual price sensitivity estimation and constrained discount allocation. Here is how to adapt...
31 Mar 2026

Setting premiums at the 85th percentile: quantile premium pricing with neural networks

The quantile premium principle maps a single number — your risk appetite parameter tau — to per-risk safety loadings. Zanzouri et al. (NAAJ 2025) shows QRNN outperforms tree-bas...
31 Mar 2026

Privacy Risk Benchmarking for Synthetic Insurance Data: What the Standard Metrics Get Wrong

Membership Inference Attacks are essentially uninformative on tabular insurance data — a finding from Zuo et al. (arXiv:2602.09288) with direct consequences for how UK pricing t...
31 Mar 2026

Pricing SME Cyber with Python - What Works and What Doesn't

UK SME cyber is growing at 13% CAGR with 2.8% standalone penetration. The data is terrible, the commercial tools are six figures, and the best published frequency model explains...
31 Mar 2026

A New Archimedean Family from Information Theory: Power-Divergence Copulas

Pearse & Bondell (arXiv:2510.06177, October 2025) derive a new Archimedean copula family from Cressie-Read phi-divergences. The generator table maps KL divergence to one copula,...
31 Mar 2026

When your coverage guarantee means nothing: optimal regret in online conformal prediction under drift

ACI satisfies its marginal coverage guarantee while producing months of invalid intervals after a claims inflation shock. A new paper proves the minimax-optimal algorithm flushe...
31 Mar 2026

Your Policyholders Are Playing a Game with Your NCD Ladder

Policyholders with good NCD rationally choose not to report small claims. Your frequency model is trained on that suppressed data. Two January 2026 papers formalise what this me...
31 Mar 2026

Balance and Fairness Are the Same Problem — Multicalibration in Insurance Pricing

Denuit, Michaelides & Trufin (arXiv:2603.16317) prove that autocalibration and group fairness are mathematically equivalent. A GBM that is well-calibrated overall but miscalibra...
31 Mar 2026

From Competitor Quotes to Risk Parameters: Implementing Market-Based Ratemaking in Python

The Goffard/Piette/Peters ABC-SMC method infers risk parameters from competitor quotes with no claims history. No Python implementation exists - only IsoPriceR in R. Here is the...
31 Mar 2026

LLM Feature Engineering for Insurance Pricing — What Actually Works

Three published frameworks use LLMs to generate tabular features and beat classical search tools on generic benchmarks. None has been tested on an actuarial dataset. We explain ...
31 Mar 2026

When Your Calibration Set No Longer Matches Your Book — KMM-CP for Covariate Shift

Laghuvarapu, Deb and Sun (arXiv:2603.26415, March 2026) replace per-test-point density ratio estimation with bounded QP weights on the calibration set. Here is what that buys yo...
31 Mar 2026

Separating Structural Inflation from the Underwriting Cycle with insurance-trend v0.1.5

The new InflationDecomposer in insurance-trend v0.1.5 uses the Harvey structural time series model to separate persistent cost inflation from cyclical effects. Using the raw tre...
31 Mar 2026

GAMs vs Neural Additive Models: A Decision Guide for UK Pricing Teams

arXiv:2510.24601 reviewed 143 papers across 430 datasets and found no consistent accuracy advantage for neural networks over GAMs on tabular data by 2024. What that means for th...
31 Mar 2026

Your XL Pricing Has a Truncation Bias. Here Is the Fix.

Ignoring policy limits and IBNR censoring when fitting tail distributions biases your tail index by ~15%. For UK motor TPBI with xi in the 0.50–0.67 range, even a 15% upward bia...
31 Mar 2026

Why your rate change evaluation should be doubly robust

Sun, Xie & Zhang (arXiv:2503.11375) combine parallel trends and synthetic control into a single estimator that remains consistent if either assumption holds. We explain the math...
31 Mar 2026

Distribution-Free Prediction Intervals for Insurance Pricing — Conformal Methods That Actually Work

GLM prediction intervals targeting 95% coverage achieved 57.8% actual coverage on real personal injury data. Conformal prediction fixes this without distributional assumptions. ...
31 Mar 2026

Differential Privacy for Insurance Synthetic Data: Why DP-CTGAN Fails and What Actually Works

DP-CTGAN produces near-random output at epsilon=1 on datasets under 50K rows — which is most insurance portfolios. AIM via smartnoise-synth is the correct tool. Here is the full...
31 Mar 2026

Climate Claims Forecasting with Deep Learning and Copulas

Dey (arXiv:2601.11949) builds a two-step pipeline — MLP for precipitation-to-claims, Gumbel copula for climate model uncertainty — that is methodologically sound, Canadian-only,...
31 Mar 2026

Constrained Neural Networks for Insurance Pricing — Architectural vs Post-Hoc

If you are using XGBoost's monotone_constraints or applying isotonic regression post-hoc, you may already have everything you need. Or you may not. The answer depends on your mo...
31 Mar 2026

Conformal Prediction Intervals for Insurance Pricing Models

Parametric Tweedie intervals over-cover low-risk policies and under-cover the high-risk tail. Conformal prediction fixes this with a finite-sample guarantee that does not rely o...
31 Mar 2026

Conformal Prediction with Change Points: When Your Coverage Guarantee Breaks and What to Do About It

Conformal prediction with change points (CPTC, arXiv:2509.02844, NeurIPS 2025) extends adaptive conformal inference to detect structural breaks and reset coverage guarantees pro...
31 Mar 2026

Drift Localisation: Which Passengers Got Wet?

Hinder et al. (arXiv:2602.19790, ESANN 2026) introduce bootstrap conformal p-values for identifying which individual observations are affected by drift. We explain why this is a...
31 Mar 2026

Testing Conditional Coverage in Conformal Prediction — The ERT Diagnostic

Conformal prediction gives valid marginal coverage but says nothing about conditional coverage — your intervals can fail for young drivers or flood-zone properties while the por...
31 Mar 2026

Conditional Coverage and Conformal Prediction Model Selection: CVI and CC-Select

Marginal coverage guarantees say nothing about which policyholders are being undercovered. CVI decomposes conditional coverage into undercoverage risk and overcoverage cost. CC-...
31 Mar 2026

Competing Risks Calibration: Why Your Fine-Gray Validation Is Wrong

D-calibration and ICI are mathematically invalid for competing-risks models. If F_k(inf|x) < 1 — which is always true for lapse, claim, and MTA competing causes — the probabilit...
31 Mar 2026

How Long Does an Inflationary Shock Last? A Gamma-Decay Model for Claims Cost Persistence

Fitting one aggregate trend to UK motor claims 2019–2024 embeds a single implicit decay rate across parts shortage, labour shortage, and social inflation — components that norma...
31 Mar 2026

BYM2 vs Contiguity-Constrained Clustering for Rating Territories

Wang, Shi, and Cao (NAAJ 2025) propose clustering NN-encoded residuals under a spatial contiguity constraint to produce hard territory zones. Here is how that differs from BYM2 ...
31 Mar 2026

Beyond Flood Zone 3: Building-Level Property Risk with Python

EA NaFRA is open, 2m resolution, and free. So is the EPC register. So are OS building footprints. A UK pricing actuary who has actually tried to use them explains what you can a...
31 Mar 2026

Bayesian Hierarchical Credibility — When Bühlmann-Straub Isn't Enough

Bühlmann-Straub pulls every segment toward the same grand mean. When that mean is wrong for your segment, the correction makes things worse. We explain when to reach for Poisson...
31 Mar 2026

Chain Ladder Through the Lens of Mortality Modelling: APC Decomposition for Loss Reserving

Pittarello, Hiabu, and Villegas (NAAJ 2025) showed that chain ladder is the age-only special case of an Age-Period-Cohort model borrowed from demography. We explain what this me...
31 Mar 2026

Actuarial Neural Additive Models: Interpretable Deep Learning for Insurance Pricing

Laub, Pho and Wong's ANAM gives each rating factor its own neural network sub-model, sums them like a GAM, and adds three things a generic NAM cannot do: hard monotonicity, expo...
30 Mar 2026

Adaptive EVT Threshold Selection for Large Loss Pricing: Two Methods That Actually Work

Standard UK practice for POT threshold selection is to pick a round number and hope. Two recent papers — EQD (Murphy et al. 2024) and BMA (Jessup et al. 2025) — give automated m...
28 Mar 2026

ZeroInflatedTweedieGBM: The So & Valdez (2024) Implementation

The first pip-installable implementation we know of for So & Valdez (2024) Scenario 2: a two-stage CatBoost model that separates structural zero probability from Tweedie severit...
28 Mar 2026

Zero-Inflated Tweedie GBM for Insurance Pricing

insurance-distributional v0.3.0 ships ZeroInflatedTweedieGBM — the first open-source implementation of So & Valdez (2024) Scenario 2. When standard Tweedie gets structural zeros...
28 Mar 2026

Smoothing Motor Age Curves with Whittaker-Henderson Poisson: a freMTPL2 Benchmark

We fit WhittakerHendersonPoisson to driver age frequencies from 677K French MTPL policies. The Poisson smoother handles count data correctly, REML selects lambda automatically, ...
28 Mar 2026

Tweedie Regression for Insurance Pricing in Python

Why Tweedie GLM is the standard for aggregate loss modelling in insurance, with a complete Python example covering power parameter selection, exposure offset, and comparison wit...
28 Mar 2026

Tabular Foundation Models for Insurance Pricing — Do They Work?

An honest assessment of where tabular foundation models stand in March 2026 — what the benchmarks actually show, what's missing for insurance pricing, and which models are worth...
28 Mar 2026

Survival Models for Insurance Lapse Prediction: What Actually Works

Deep learning survival models underperform Cox regression on tabular insurance data. Cure models are the real story post-GIPP. Here is what the research says and what UK pricing...
28 Mar 2026

Structural vs Cyclical Claims Inflation: How to Decompose What You're Actually Seeing

UK motor average claim costs reached a record £5,300 in Q4 2024. But applying a flat 8% trend assumption treats structural and cyclical inflation identically. They have opposite...
28 Mar 2026

Stochastic Reserving in Python: Mack and Bootstrap ODP with chainladder

How to produce a full IBNR distribution in Python using the Mack method and Bootstrap ODP sampling. Covers analytical standard errors, 5,000-simulation bootstrap, percentile tab...
28 Mar 2026

Selection Bias in Technical Pricing — Why Your Loss Model Is Wrong and How to Fix It

Your GLM or GBM was trained on policyholders who chose to buy from you at your price. That is not a random sample of the market. The mechanism, what it does to your frequency an...
28 Mar 2026

Probabilistic Gradient Boosting for Insurance Pricing — Beyond Point Predictions

XGBoostLSS, LightGBMLSS, NGBoost, and PGBM can all output a full conditional distribution rather than a point prediction. The Chevalier & Côté benchmark (EAJ 2025) tested 11 alg...
28 Mar 2026

Reserving with chainladder-python Part 3: Neural Networks vs Traditional Methods

When does it make sense to reach beyond chain ladder and bootstrap ODP for neural reserving methods? We compare DeepTriangle, individual RNN approaches, and the Richman-Wüthrich...
28 Mar 2026

Motor Insurance Pricing in Python: A Complete Walkthrough

End-to-end motor insurance pricing in Python using the French MTPL dataset. Frequency-severity GLMs, exposure offsets, coefficient interpretation, validation, and calibration to...
28 Mar 2026

What PRA SS1/23 Validation Looks Like on Real Data: 677K French Motor Policies

Most governance tooling is tested on toy examples with clean DGPs and inflated Gini coefficients. We ran the full insurance-governance validation suite on 677K freMTPL2 policies...
28 Mar 2026

Mixture Density Networks for Insurance Severity

Gamma GLMs fit a single mode to severity data that often has two or three. Mixture Density Networks output the full conditional distribution — mixing weights, component means, a...
28 Mar 2026

Loss Ratio Trending and Rate Adequacy in Python

The standard rate adequacy workflow — earned premium at current rate level, ultimate losses, trend to future period, expense load, indicated rate change — built in Python with p...
28 Mar 2026

Linear Regression Beats Neural Networks for Individual Claims Reserving

Richman and Wüthrich's March 2026 paper shows linear regression with projection-to-ultimate factors closes 44% of the gap over chain ladder — and neural networks add nothing at ...
28 Mar 2026

KAN: What If Your Neural Network Learned Its Own Link Functions?

Kolmogorov-Arnold Networks replace fixed activations with learnable splines on edges, letting the model discover its own functional forms. Here is what that means for insurance ...
28 Mar 2026

How to Smooth Noisy Insurance Loss Curves

Raw loss ratios by age band are noisy. A 5-year moving average introduces boundary bias and requires a judgment call you cannot defend in an IFRS 17 review. This tutorial shows ...
28 Mar 2026

How to Add Prediction Intervals to an Insurance Pricing Model

Point estimates from pricing models are incomplete. This tutorial shows how to add distribution-free prediction intervals to a CatBoost Tweedie model using insurance-conformal —...
28 Mar 2026

EU AI Act Article 13: what transparency actually requires for a pricing model

Article 13 of the EU AI Act is not about SHAP values. It is about deployer-facing documentation — what the underwriter or product team needs to interpret and use a pricing model...
28 Mar 2026

Your Elasticity Model Has a 3x Bias

Most UK motor insurers think they know their price elasticity. They are probably wrong by a factor of 3–5, in the direction that makes them systematically mispricing. The eviden...
28 Mar 2026

Dynamic Pricing for UK Motor and Home Insurance — Build or Buy?

A practitioner's guide to dynamic pricing in UK insurance: what GIPP actually permits, why your elasticity model likely has a 3-5x bias, and an honest assessment of whether Earn...
28 Mar 2026

Does Whittaker-Henderson Smoothing Actually Work for Insurance Pricing?

We benchmarked Whittaker-Henderson against raw rates and a 5-point weighted moving average on a synthetic UK motor driver age curve with known truth. W-H reduces MSE by 57.2% vs...
28 Mar 2026

Does Sarmanov Copula Frequency-Severity Modelling Actually Work?

The standard UK motor pricing formula multiplies E[N] by E[S] and assumes independence. On a 15,000-policy benchmark with planted omega=3.5, that assumption understates portfoli...
28 Mar 2026

Does PSI Actually Catch Pricing Model Drift?

PSI detects covariate shift but not rank collapse. On a synthetic UK motor book where a new risk factor emerges post-deployment, PSI stays GREEN while Gini drops 8 points. The B...
28 Mar 2026

Does Monotonicity-Constrained EBM Actually Work for Insurance Pricing?

On a UK motor DGP with a monotone young-driver requirement, unconstrained EBM violates monotonicity in 31% of runs. Constrained EBM matches GLM monotonicity compliance at 100% w...
28 Mar 2026

Does HMM Telematics Risk Scoring Actually Work for Insurance Pricing?

HMM-derived driving state features improve Gini by 5–10 percentage points over raw trip averages on a state-structured DGP. The reason is temporal: the HMM knows that aggressive...
28 Mar 2026

Does Bühlmann-Straub Credibility Actually Work?

We benchmarked Bühlmann-Straub credibility against raw experience and manual Z-factors on a 30-segment synthetic UK motor fleet book with a known DGP. On thin schemes, it reduce...
28 Mar 2026

Does Automatic Lambda Selection for Whittaker-Henderson Actually Work?

REML-selected lambda beats manual tuning on a 63-band age curve benchmark: 22% lower MSE on thin tail bands, zero analyst discretion, and principled credible intervals. The hone...
28 Mar 2026

Chain Ladder Reserving in Python: A Practical Tutorial with chainladder

Build loss development triangles, calculate IBNR reserves, and plot development patterns using Python and the chainladder library.
28 Mar 2026

BonusMalus Is Endogenous: DML on 677K French Motor Policies

BonusMalus is built from past claims — a naive regression conflates the causal effect with selection. We ran Double Machine Learning on 677K freMTPL2 policies to isolate what Bo...
28 Mar 2026

The Burning Cost Method for Excess of Loss Reinsurance Pricing in Python

A practical Python walkthrough of the burning cost method for pricing excess of loss reinsurance treaties — loss trending, development, pure rate calculation, and sensitivity an...
28 Mar 2026

Bühlmann-Straub on freMTPL2: What Regional Credibility Actually Looks Like

We ran Bühlmann-Straub credibility on the freMTPL2freq dataset — 677K French MTPL policies, 22 regions — and quantified how much thin regions get pulled toward the portfolio mea...
27 Mar 2026

Multicalibration: Portfolio Balance and Fairness Are the Same Test

Denuit, Michaelides and Trufin (March 2026) unify autocalibration and non-discrimination into a single actuarial test. If your model fails it, you have a pricing problem and a r...
27 Mar 2026

Model Drift Detection on 677k Policies: PSI, A/E, and Gini Tests on freMTPL2

We fitted a Poisson GLM on the first third of freMTPL2 (677k French motor policies) and monitored it across two later temporal segments without refitting. PSI, A/E ratios with W...
27 Mar 2026

What a Real Fairness Audit Finds: Gender Bias Testing on 67,856 Motor Policies

We ran insurance-fairness against ausprivauto0405 — a real Australian motor dataset with an explicit Gender field. Here is what FairnessAudit, MulticalibrationAudit, and Indirec...
27 Mar 2026

Why Random Cross-Folding Is Wrong for Time-Series Causal Inference

Ciganovic et al. (March 2026) show that standard DML cross-fitting leaks future information when your data is a time series. Their fix — Reverse Cross-Fitting — has direct impli...
27 Mar 2026

What Conformal Prediction Actually Guarantees (And What It Doesn't)

Sesia & Favaro's March 2026 survey of conformal prediction is the clearest account yet of what finite-sample distribution-free guarantees mean - and why the marginal/condition...
26 Mar 2026

Portfolio-Anchored Telematics Risk Scoring with Wavelets

Lee, Badescu, and Lin (2026) replace ad-hoc event counts with a principled actuarial risk index: MODWT decomposes the acceleration signal, a Gaussian-Uniform mixture anchors tai...
26 Mar 2026

Building a Tweedie GLM for Insurance Pricing in Python

A complete Python tutorial for building a Tweedie GLM for insurance pricing: synthetic motor data, statsmodels, exposure offset, interpreting the p parameter, residual diagnosti...
26 Mar 2026

TPBI Multi-State Modelling After the Whiplash Reforms

The Civil Liability Act 2018 split UK TPBI into two structurally different populations. Standard frequency-severity models treat them as one. Here is why that matters and what a...
26 Mar 2026

The PCW Endogeneity Problem: Why Your Conversion Model Is Biased

Most UK insurers fit a logistic regression on PCW quote data and call it a demand model. It is biased in at least three distinct ways. Here is the causal structure that explains...
26 Mar 2026

Tail Scoring Rules: Why CRPS Fails in the Tail and What to Use Instead

Brehmer & Strokorb (2019) proved that no proper scoring rule applied to raw data can discriminate tail indices. Bladt & Øhlenschlæger (arXiv:2603.24122) fix this by scoring norm...
26 Mar 2026

Spatial Panel GBMs: A Better Way to Price Geography

Balzer and Benlahlou (arXiv:2603.14543) extend gradient boosting to spatial panel data. Here is what it does, how it compares to BYM2 and Blier-Wong, and when a UK pricing team ...
26 Mar 2026

Spatial Error Correction vs Spatial Smoothing: Two Different Questions in Territory Rating

Balzer and Benlahlou's spatial GBM uses GMM pre-estimation and a Cochrane-Orcutt transformation to handle spatial autocorrelation in gradient boosting. It is a different tool fr...
26 Mar 2026

Robust Discrete Pricing Optimisation via Knapsack

A new paper from ETH Zürich (arXiv:2603.18653) frames the conversion from technical price to commercial premium as a Multiple-Choice Knapsack Problem. Under 1% revenue cost for ...
26 Mar 2026

The PtU Reserving Algorithm in Python: Filling the Gap Left by Richman-Wüthrich

Richman-Wüthrich's one-shot PtU reserving paper (arXiv:2603.11660) ships with R code only. We map the algorithm to Python, explain the censored-claims exposure mechanism that ma...
26 Mar 2026

Panel DML with Instrumental Variables: When DiD Isn't Enough

A new paper combines panel fixed effects, double machine learning, and instrumental variables. The headline result is not the estimator — it's that ML covariate adjustment frequ...
26 Mar 2026

Text Embeddings on Claims Data: The Pipeline, the Papers, and the Limits

How to turn insurance claims descriptions into GLM features using sentence-transformer embeddings and PCA. What Troxler & Schelldorfer (2024, BAJ) actually showed, what the Kagg...
26 Mar 2026

Claims Lifecycle Modelling: The Python Gap and How to Bridge It with Poisson GLMs

Python has no equivalent of R's msm package for continuous-time multi-state modelling of claims. We explain the mathematics, show why a Poisson GLM substitution works for most p...
26 Mar 2026

What Actually Drives Flood Claims: Evidence from a Marginal Contribution Analysis

Moriah et al. (2026) run a sequential model-building exercise on a French home insurance portfolio to measure what each data layer — hydrological zoning, rainfall intensity, bui...
26 Mar 2026

Is Your Pricing Engine a Material System Under FCA PS26/2?

FCA PS26/2 (March 2026) creates mandatory incident reporting and material third party registers for all authorised insurers. Every pricing actuary who owns a rating API, renewal...
26 Mar 2026

EV Motor Insurance Pricing: Beyond the Flat Surcharge

Why the standard flat EV surcharge is wrong in two directions simultaneously, what the claims data actually shows, and how to build a severity model that handles the bimodal str...
26 Mar 2026

Estimating PCW Conversion Elasticity with Double Machine Learning

How to estimate a causally identified price elasticity from PCW quote data in Python, using commercial loading variation as an instrument and CatBoost nuisance models. The pract...
26 Mar 2026

Does Conformal Prediction Actually Work for Insurance Claims?

Parametric Tweedie intervals undercover high-risk policies by 10–15 percentage points. We tested conformal prediction on 50,000 UK motor policies to find out whether the fix act...
26 Mar 2026

Credibility vs GBM for Thin Segments: A Decision Guide

When you have fewer than 5,000 policies in a segment, should you use Bühlmann-Straub credibility or a GBM with transfer learning? The answer depends on whether you have a relate...
26 Mar 2026

Connected-Car Data Sources for Insurance Pricing: Beyond the Black Box

OEM APIs, smartphone SDKs, and charging data — what connected-car data sources UK pricing teams can actually access, and how to turn them into rating factors.
26 Mar 2026

Conformal Prediction vs Bootstrap Intervals for Insurance Pricing

Conformal prediction and the parametric bootstrap both produce prediction intervals for insurance pricing models. They answer different questions, have different computational c...
26 Mar 2026

Conformal Prediction for Solvency II SCR Validation

Conformal prediction gives finite-sample valid 99.5% risk bounds for individual policies — useful for premium risk SCR validation and model validation consistent with Solvency I...
26 Mar 2026

Causal Effects at Extreme Quantiles: The TIEE Estimator

Li and Castro-Camilo (arXiv:2603.23309, March 2026) unify inverse probability weighting and extreme value extrapolation in a single estimating equation. Here is what it does, wh...
26 Mar 2026

Boulevard: EBM Confidence Intervals Without Bootstrapping

Fang, Tan, Pipping, and Hooker (AISTATS 2026) show that replacing additive boosting with a moving-average update makes EBMs converge to kernel ridge regression — and that means ...
25 Mar 2026

Which Uncertainty Quantification Method? A Decision Framework for Insurance Pricing Actuaries

A structured decision framework for choosing between conformal prediction, distributional GBM, Bühlmann-Straub credibility, GLM bootstrap, and GAM uncertainty. Model type, data ...
25 Mar 2026

UBI Adverse Selection: When Telematics Discounts Drive Away Your Best Risks

The adverse selection trap in opt-in UBI: why telematics discounts attract the risks you least want to retain, and what to do about it.
25 Mar 2026

TabPFN v2 and the Thin-Segment Problem: A Pricing Actuary's Read

TabPFN v2 (Nature 637:319–326, 2025) does zero-shot prediction on datasets up to 10K rows. Here is what that actually means for the pricing segments where your current models ar...
25 Mar 2026

Tab-TRM: The January 2026 Insurance Neural Architecture

Tab-TRM (arXiv:2601.07675) is a 14,820-parameter recursive model that beats CatBoost on French MTPL while connecting to GLM theory. We explain the architecture, the numbers, and...
25 Mar 2026

Synthetic Insurance Data That Preserves Correlations: MICE-RF

Havrylenko et al. (2025) show that MICE with random forests outperforms CTGAN and VAEs on the freMTPL2 benchmark. We explain why it works, where it fails, and how to run it.
25 Mar 2026

Severity Interactions in Gamma GLMs: Weaker Signal, Higher Bar

Applying CANN + NID to severity (Gamma) GLMs. Why the signal is weaker than frequency, what configuration changes are needed, and when a severity interaction is worth adding.
25 Mar 2026

Reinforcement Learning for Individual Claims Reserving: What Avanzi, Richman, and Wüthrich Propose

Avanzi, Richman, and Wüthrich reformulate individual claims reserving as a Markov Decision Process. We explain why it matters, what it actually does, and when a UK reserving act...
25 Mar 2026

Renewal Classification — When Risk-Based Pricing Conflicts With Retention Targets

How to combine lapse hazard models with causal price elasticity under PS21/5 constraints for UK motor and home renewal pricing.
25 Mar 2026

PS21/5 End-to-End: Renewal Pricing Optimisation That Actually Satisfies the FCA

The complete PS21/5 compliance workflow: CATE estimation with insurance-causal, ENBP-constrained optimisation with insurance-optimise, fairness audit with insurance-fairness, an...
25 Mar 2026

Probabilistic Gradient Boosting for Insurance: Which Method Should Pricing Actuaries Use?

Chevalier & Côté (EAJ 2025) benchmark nine GBM variants on five insurance datasets. We read it so you don't have to, then show where insurance-distributional fits in.
25 Mar 2026

Privacy-Preserving Pricing: Federated Learning and Differential Privacy

UK GDPR constrains what pricing data you can share across entities. Federated learning and differential privacy offer a way around the constraint — but only if you understand wh...
25 Mar 2026

Pricing a New Product with No Claims History

Zero internal claims data is not a reason to guess blindly. Here is a structured sequence of five approaches — from Bühlmann-Straub credibility priors through transfer learning ...
25 Mar 2026

Physical Climate Risk in UK Home Insurance Pricing

How UK home insurers should model physical climate risk: UKCP18 projections, Flood Re's 2039 exit, ABI claims data, and practical code using insurance-whittaker, insurance-confo...
25 Mar 2026

Parametric Insurance: Trigger Calibration and Basis Risk

SOA and CAS research from late 2025 has sharpened the methods for calibrating parametric triggers and quantifying basis risk. Here is what that means in practice for UK flood an...
25 Mar 2026

One-Shot Individual Claims Reserving: What the Richman-Wüthrich Paper Actually Shows

arXiv:2603.11660 proposes direct projection-to-ultimate on individual claims data. The honest finding: linear regression on claim status and incurred already beats aggregate cha...
25 Mar 2026

Our 12-Module Insurance Pricing Course Is Now Free and Open Source

Modern Insurance Pricing with Python and Databricks - all 12 modules, free, on GitHub. GLMs through causal elasticity, fairness auditing, spatial BYM2 territory models, and mode...
25 Mar 2026

Measuring Rate Change Impact with DiD and ITS in Python

A hands-on tutorial for the RateChangeEvaluator in insurance-causal v0.6.0. DiD when you have a control group. ITS when you don't. Real code, real API.
25 Mar 2026

Market-Based Ratemaking Without Claims History

Goffard, Piette, and Peters (ASTIN Bulletin 2025) show how to calibrate insurance rates using competitor premiums and no internal claims data — using ABC and isotonic regression...
25 Mar 2026

How to Quantify What a Model Improvement Is Worth in Pounds

A 5pp Gini improvement means nothing to a CFO. The Loss Ratio Error framework from arXiv:2512.03242 converts model correlation into expected loss ratio — and from there into pou...
25 Mar 2026

How to Build a Double-Lift Chart in Python

Build a double-lift chart to compare GLM vs GBM predictions. Bin by prediction ratio, compute A/E per decile, plot with matplotlib. Standard tool for pricing committee model val...
25 Mar 2026

Your Pricing Model Knows the Average. Your Customers Don't Care About the Average.

The average treatment effect hides a 5x spread in price elasticity across a UK motor book. GATES, CLAN, and RATE tell you the size, who's who, and whether the ranking is actiona...
25 Mar 2026

GAMLSS vs Conformal: Head-to-Head on the Same Dataset

Two approaches to prediction intervals for insurance severity: distributional GAMLSS (insurance-distributional-glm) vs distribution-free conformal (insurance-conformal). Same sy...
25 Mar 2026

EVT Meets Machine Learning: Three Papers Worth Reading for Severity Modellers

Three recent papers on EVT and ML — from generalisation bounds for tail learning to Bayesian nonparametric splicing — and what they actually imply for UK severity models.
25 Mar 2026

EVT and ML for Tail Variable Importance: Which Covariates Drive Your Largest Claims?

A covariate that predicts mean severity well may tell you almost nothing about your 99th percentile claims. Here is how to identify which rating factors actually drive large los...
25 Mar 2026

Does DML Causal Inference Actually Work for Insurance Pricing?

We ran Double Machine Learning against a naive GLM on a 50,000-policy UK motor telematics book. The GLM overestimated the treatment effect by 50–90%. Here is what that means for...
25 Mar 2026

Discrimination-Insensitive Pricing: Beyond Removing the Protected Variable

insurance-fairness v0.6.3 ships DiscriminationInsensitiveReweighter. Here's why dropping the protected column doesn't work, how propensity-based reweighting does, and what the A...
25 Mar 2026

Conformal Prediction for Solvency II Capital Requirements: Model-Free SCR Estimation and Governance Validation

How solvency_capital_range() produces model-free 99.5% SCR bounds, how SCRReport produces the coverage validation table for regulatory submission, and why interval_width is the ...
25 Mar 2026

Commercial Lines Pricing with Python

Fleet motor, property, and liability pricing in Python. Covers Bühlmann-Straub credibility for fleet schemes, GPD large loss loading, MBBEFD ILF tables, and PSI drift detection ...
25 Mar 2026

Causal AI for Pricing Actuaries: A Practical Guide

Why GLM coefficients aren't causal effects, and how to fix that using insurance-causal: DML with CatBoost nuisances, causal forests for heterogeneous treatment effects, and DiD/...
25 Mar 2026

CANN: The Combined Actuarial Neural Network in Python

A clean Python tutorial for the most-cited neural network architecture in actuarial pricing: the Combined Actuarial Neural Network (Schelldorfer & Wüthrich, 2019). Architecture,...
25 Mar 2026

Bayesian Last Layer — Uncertainty That Scales to Insurance Datasets

Fiedler & Lucia's Bayesian Last Layer gives you calibrated posterior uncertainty from a neural network at near-zero additional cost. Here is what it does, why it is the right to...
25 Mar 2026

Bayesian Hierarchical Multi-Level Pricing

Bühlmann-Straub credibility breaks when your hierarchy has more than two levels or when the random effects interact with pricing factors. Here is when to upgrade to a full Bayes...
25 Mar 2026

Actuarial Neural Additive Model: What the Paper Actually Does (arXiv:2509.08467)

Laub, Pho and Wong's ANAM paper enforces smoothness and monotonicity architecturally, not as penalties. Here is what the mechanism actually is, why it matters more than the benc...
24 Mar 2026

Zero-Inflated and Hurdle Models for Insurance Claims

Standard Tweedie GLMs handle zeros implicitly. When that implicit handling breaks — specialty lines, niche segments, specific peril models — you need ZIP or hurdle models. Here ...
24 Mar 2026

Tweedie vs Frequency-Severity Split: When to Use Which

The Tweedie GLM and frequency-severity split both model pure premium. They are not interchangeable. Here is how to decide which one you actually need.
24 Mar 2026

The Python Insurance Pricing Benchmark: GLM vs XGBoost vs CatBoost vs LightGBM on freMTPL2

Definitive Python benchmark: Poisson GLM vs XGBoost vs CatBoost vs LightGBM for insurance frequency modelling on freMTPL2. Poisson deviance, Gini coefficient, and A/E calibratio...
24 Mar 2026

Ogden Rate and PPOs: Pricing Large Bodily Injury in Python

How the Ogden discount rate and Periodical Payment Orders change the maths of large BI pricing in the UK — with Python code to calculate lump sum equivalents, discount PPO cash ...
24 Mar 2026

Modelling Social Inflation in UK Motor Severity: A Python Approach

UK motor bodily injury severity has outrun CPI since 2022. This post implements a multiplicative severity separation model and Whittaker-Henderson smoothing in Python to separat...
24 Mar 2026

LLM Feature Engineering for Insurance Pricing: What the Research Actually Shows

What large language models can genuinely contribute to insurance pricing feature engineering — text embeddings, zero-shot classification, synthetic features — and where the evid...
24 Mar 2026

How to Export a Factor Table to Excel in Python

Step-by-step: extract CatBoost factor tables with shap-relativities and write a clean Excel file with openpyxl. Formatted output ready to paste into Radar or Emblem.
24 Mar 2026

Your Pricing Model Knows the Average Effect. That Is Not Enough.

Using causal forests and GATES/CLAN/RATE inference to find which customers respond most to a price or discount change — not just the average effect.
24 Mar 2026

GLM, GAM, and GBM for UK Motor Pricing in Python

The Python equivalent of the IFoA MLR Working Party's R tutorial: Poisson GLM baseline, EBM GAM, and CatBoost GBM on UK motor data, with the full pipeline from data to governance.
24 Mar 2026

Fitting a Motor Insurance GLM in Python: Poisson Frequency and Gamma Severity with statsmodels

A practical statsmodels tutorial for pricing actuaries: Poisson frequency model with exposure offset, Gamma severity model, overdispersion tests, factor table extraction, and A/...
24 Mar 2026

Does insurance-gam actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor book. EBM beats the GLM by 12.6 Gini points (0.455 vs 0.329). But the deviance number is misleading. We explain why, and when...
24 Mar 2026

Does HMM telematics scoring actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor fleet. HMM state fractions deliver 5–10pp Gini lift over simple aggregates. State classification recovers >50% of true high-r...
24 Mar 2026

Safe Model Deployment for Insurance Pricing: Champion/Challenger with a Full Audit Trail

insurance-deploy provides the champion/challenger infrastructure, audit trail, and ICOBS 6B compliance tooling that MLflow does not. Here is how to use it.
24 Mar 2026

Conformalised Quantile Regression: Prediction Intervals That Actually Adapt to Risk

ConformalisedQuantileRegression in insurance-conformal v0.6.2 gives you statistically guaranteed prediction intervals that are wide for high-risk segments and narrow for low-ris...
24 Mar 2026

Claims Inflation Decomposition: Taylor Two-Factor Separation in Python

Extract the calendar-year inflation component from a claims development triangle using Taylor's two-factor separation. Python from scratch, then connect to severity trending.
23 Mar 2026

Python vs R for Actuarial Pricing: A Practical Comparison

A practical comparison of Python and R for UK personal lines insurance pricing — data wrangling, GLMs, GBMs, deployment, and Databricks. Honest about where R still wins.
23 Mar 2026

One-Way Analysis in Python: From Scratch to Production

One-way analysis in Python for pricing actuaries: pandas from scratch, credibility-weighted confidence intervals, thin cell handling, GBM shortcuts.
23 Mar 2026

How to Reproduce an Emblem GLM in Python

Reproduce an Emblem frequency-severity GLM in Python: factor tables, one-way plots, deviance residuals, and lift charts using statsmodels, CatBoost, and Polars.
23 Mar 2026

GLM Assumptions in Insurance Pricing: What Actually Matters

Which GLM assumptions actually matter for insurance pricing, which ones you routinely violate without consequence, and the diagnostics worth running before signing off a product...
23 Mar 2026

Getting Started: Three Libraries, One Workflow

A practical walkthrough for pricing analysts: use insurance-causal for causal inference, insurance-conformal for prediction intervals, and insurance-monitoring for drift detecti...
23 Mar 2026

Does Whittaker-Henderson smoothing actually work for insurance pricing?

Benchmark results on a known-DGP synthetic UK motor age curve. REML recovers the true frequency well in the data-rich middle. The tails are a different story. Numbers, not claims.
23 Mar 2026

Does DML causal inference actually work for insurance pricing?

We ran the benchmarks. On a synthetic UK motor book with nonlinear confounding, naive logistic GLM overestimates the telematics treatment effect by 50–90%. DML recovers the grou...
23 Mar 2026

Does conformal prediction actually work for insurance pricing?

Benchmark results on a known-DGP synthetic motor book. Conformal hits 90% across all deciles. Parametric Tweedie under-covers the top decile by 10–15pp. Numbers, not theory.
23 Mar 2026

Does Bühlmann-Straub credibility actually work for insurance pricing?

Benchmark results on 100 synthetic schemes with known true loss rates. Credibility blending reduces MSE by 25–35% vs the best naive alternative. Numbers, not theory.
23 Mar 2026

How to Build a Burning Cost Model for Insurance Pricing in Python

Build a burning cost model in Python: frequency-severity split, exposure offsets, large loss capping, IBNR adjustment, and combined pure premium for UK pricing.
22 Mar 2026

Three Ways to Detect Proxy Discrimination in Your Pricing Model: When They Agree and When They Don't

Mutual information, proxy R-squared, and SHAP proxy scores all flag proxy discrimination but catch different things. A practical guide to interpreting conflicting signals in ins...
22 Mar 2026

scikit-learn TweedieRegressor vs insurance-distributional: Why Fixed Tweedie Isn't Enough for Insurance Pricing

sklearn's TweedieRegressor is a well-engineered GLM. It fits a fixed-power Tweedie model correctly. The problem is that insurance pricing needs per-risk variance, not a single p...
22 Mar 2026

Insurance Model Monitoring in Python: Gini, A/E, and Double-Lift

Tutorial on monitoring insurance pricing models using actuarial KPIs. Gini tracking, segmented A/E, double-lift for champion/challenger. Why generic drift tools miss what matters.
22 Mar 2026

GLMs for UK Insurance Pricing in Python: What the Generic Tutorials Miss

GLM insurance Python for UK pricing actuaries: exposure handling, Consumer Duty, frequency-severity split, and Emblem/Radar deployment with glum.
22 Mar 2026

FCA Proxy Discrimination Testing in Python: A Practical Guide

Under Consumer Duty and the Equality Act 2010, non-life insurers must test whether rating factors act as proxies for protected characteristics. Here is exactly how to run that t...
22 Mar 2026

CatBoost for Insurance Pricing: Frequency-Severity on freMTPL2

Build a CatBoost frequency-severity pricing model on freMTPL2 using Polars. Poisson frequency, Gamma severity, combined burning cost, SHAP factor extraction, and distillation to...
20 Mar 2026

MAPIE vs insurance-conformal: Why Generic Conformal Prediction Breaks on Insurance Data

MAPIE is the standard Python library for conformal prediction, but it wasn't designed for insurance. Here is what goes wrong with exposure-weighted portfolios and Tweedie models...
19 Mar 2026

Multi-Step Conformal Prediction for Claims Forecasts: Horizon-Dependent Intervals with Valid Coverage

Single-quantile conformal methods apply one interval width across all forecast horizons. At 12 months ahead the interval is too narrow;
19 Mar 2026

Vine Copula Stress Portfolios for Solvency II: Preserving Joint Tail Structure in Synthetic Scenario Generation

Resampling your real portfolio to generate stress test scenarios destroys the joint tail structure that capital models depend on.
19 Mar 2026

EconML vs insurance-causal: Causal Inference for Insurance Pricing

EconML is the standard Python library for causal ML. It was not built for insurance pricing, Poisson/Gamma exposure models, or the dual-selection bias problems specific to renew...
18 Mar 2026

Whittaker-Henderson Smoothing for Rating Tables: The Penalised Least-Squares Method UK Actuaries Should Already Be Using

Every UK pricing actuary smooths experience tables. Most do it with a 5-point moving average or a polynomial fitted by eye.
18 Mar 2026

Trend Fitting Through Regime Changes: Changepoint Detection Before the Rate Indication

Pricing teams fit log-linear trends through experience with structural breaks. The straight line is wrong on both sides. How to detect and correct it.
18 Mar 2026

DoWhy vs insurance-causal: Which Causal Inference Library Should Insurers Use?

DoWhy is the most rigorous general-purpose causal inference library in Python — DAG specification, formal identification, refutation tests. It was not built for insurance pricin...
17 Mar 2026

DML Works at 1,000 Policies Now. Here Is What Changed.

insurance-causal v0.3.1 fixes over-partialling in DML for small insurance books. Adaptive CatBoost regularisation makes causal estimates reliable at n≥1k.
17 Mar 2026

Bühlmann-Straub Treats Last Year the Same as Five Years Ago

Static credibility weights all years equally. The dynamic Poisson-gamma state-space model weights recent experience more - and quantifies how much more.
16 Mar 2026

Importance-Weighted Evaluation for Portfolio Composition Shift: Diagnosing the Mismatch Before It Shows in Loss Ratios

Density ratio correction for portfolio composition shift - CatBoost classifier, importance-weighted evaluation, insurance-covariate-shift library
16 Mar 2026

The EBM Is Sitting in Your Notebook Because You Can't Show It to the Committee

GLMComparison and MonotonicityEditor in insurance-gam close the governance gap between EBM shape functions and the GLM factor table your pricing committee...
16 Mar 2026

Conformal Reserve Ranges: Finite-Sample Coverage Guarantees for IBNR Intervals

Bootstrap and expert-judgment reserve ranges look like probability statements but carry no frequentist coverage guarantee.
15 Mar 2026

BYM2 Territory Modelling: Posterior Uncertainty Intervals and Year-on-Year Stability from Spatial Smoothing

Two things independent credibility cannot give you: a quantified uncertainty per sector, and stable factors year-on-year.
15 Mar 2026

Monthly Covariate Shift Monitoring: When to Reweight and When to Retrain

How to run covariate shift detection as a recurring monthly check: monitoring cadence, ESS ratio trends, and the thresholds that trigger a retraining...
15 Mar 2026

Adaptive Conformal Inference for Non-Exchangeable Claims Series: Handling Trend Without Retraining

Standard split conformal prediction requires exchangeability - a condition insurance claims time series systematically violate.
15 Mar 2026

Debiasing Price Elasticity Estimates with Double Machine Learning: Removing the Risk Model's Fingerprint

OLS elasticity in formula-rated books is contaminated by your own risk model. insurance-causal fixes this with CausalForestDML and CatBoost nuisance.
14 Mar 2026

Optimal Binning for GLM Rating Factors: Beyond the Eyeball Test

Automated GLM factor banding for UK insurance pricing: R2VF fused lasso, neural embeddings for high-cardinality categoricals, SKATER spatial clustering.
14 Mar 2026

Per-Segment Large Loss Loading with Quantile GBMs: TVaR and ILFs at Risk Level

December is the season for year-end rate reviews where someone adds a flat 8% large loss loading to every segment regardless of tail weight.
14 Mar 2026

EBM, ANAM, or PIN: Choosing an Interpretable Architecture for UK Insurance Pricing

Three interpretable architectures for UK insurance pricing: EBM, ANAM, and PIN via insurance-gam. Refuse the GLM-vs-GBM accuracy trade-off with factor tables.
14 Mar 2026

OLS Elasticity in a Formula-Rated Book Measures the Wrong Thing

CausalForestDML separates causal price effect from risk-lapse correlation in UK motor renewal. insurance-elasticity - per-customer CATE and ENBP optimiser.
13 Mar 2026

Synthetic Difference-in-Differences for Rate Change Evaluation

DiD and Callaway-Sant'Anna for rate change attribution. insurance-causal-policy quantifies what your rate change actually achieved, with FCA Consumer Duty-aligned evidence output.
13 Mar 2026

Actuarial Neural Additive Models: Exact Interpretability with Tweedie Loss

Actuarial Neural Additive Models for UK pricing: exact interpretability, Tweedie loss, guaranteed monotonicity. insurance-gam - beyond SHAP and EBM limitations.
13 Mar 2026

Credibility-Weighted Broker and Scheme Effects with REML

Two-stage CatBoost plus REML random effects for UK insurance broker adjustments. insurance-multilevel - Buhlmann-Straub credibility weighting, not guesswork.
13 Mar 2026

Trend Selection Is Not Actuarial Judgment: A Python Approach

Python insurance trend library: log-linear OLS/WLS, bootstrap CIs, ONS deflation, superimposed inflation, structural break detection - insurance-trend.
13 Mar 2026

HMM-Based Telematics Risk Scoring for Insurance Pricing

Continuous-time HMM for telematics risk scoring in UK motor pricing. Latent driving regimes from GPS data - actuarially interpretable features for Poisson GLM.
13 Mar 2026

Foundation Models for Thin Segments: TabPFN and TabICLv2 in Insurance Pricing

TabPFN and TabICLv2 for thin-segment UK insurance pricing. In-context learning at inference, no gradient descent. insurance-thin-data wraps both for actuaries.
13 Mar 2026

Frequency and Severity Are Two Outputs. You Have One Prediction Interval.

Joint conformal prediction sets for frequency and severity in UK insurance. Fan and Sesia coordinate-wise standardization - simultaneous coverage across both.
13 Mar 2026

Individual Experience Rating Beyond NCD: From Bühlmann-Straub to Neural Credibility

Four-tier experience rating in Python: Buhlmann-Straub, Poisson-Gamma state-space, GBM surrogate, attention credibility. Policy-level multiplicative factors.
13 Mar 2026

Frequency-Severity Dependence in UK Motor: A Shared-Trunk Neural Architecture

Shared-trunk neural model for frequency-severity dependence in UK motor pricing. Explicit dependence testing where two-part GLMs assume independence - Python.
13 Mar 2026

Coverage Is the Wrong Guarantee for Pricing Actuaries

Conformal risk control for UK insurance: coverage calibrated to financial shortfall, not miscoverage rate. insurance-conformal - beyond standard intervals.
13 Mar 2026

Composite Severity Regression: Getting the Tail Right Without Throwing Away the Body

Spliced severity for UK motor BI: lognormal body, GPD tail above a policyholder-specific threshold. insurance-severity - mode-matching, 2,818 lines, 106 tests.
13 Mar 2026

When Did Your Loss Ratio Actually Change?

Bayesian Online Changepoint Detection for UK insurance loss ratios. Poisson-Gamma conjugates, regulatory event priors, Consumer Duty evidence pack - Python.
13 Mar 2026

Mixture Cure Models for Retention Pricing: Separating Structural Non-Lapsers from the At-Risk Book

Logistic regression treats all non-lapsers the same. Mixture cure models split them into two groups: structural non-lapsers who will never leave, and...
12 Mar 2026

Doubly Robust Causal Inference for Insurance: TMLE With Poisson Outcomes

Doubly robust TMLE for insurance pricing with Poisson outcomes and exposure offsets. insurance-tmle - first Python library with the implementation AIPW lacks.
12 Mar 2026

Beyond Lognormal: Normalizing Flows for Insurance Severity Modelling

Neural Spline Flows for bimodal UK motor BI severity - no family assumption. insurance-nflow: TVaR, ILF curves, reinsurance layer costs, fat-tail transform.
12 Mar 2026

The Telematics Score That Forgets Where It's Been

Joint longitudinal-survival model for telematics: driving trajectory not current score. insurance-jlm - Wulfsohn-Tsiatis SREM with mid-term repricing in Python.
12 Mar 2026

GARCH for Claims Inflation: Modelling Volatility That Clusters

GARCH for UK insurance claims inflation: time-varying variance in trend analysis. insurance-garch - Engle (1982) applied to actuarial trend and pricing models.
12 Mar 2026

Vine Copulas for Multi-Peril Home: The Flood-Subsidence Correlation That Costs 9% in Mispriced Revenue

Vine copulas for multi-peril UK home pricing. Flood-subsidence correlation costs ~9% in mispriced revenue. insurance-copula: BIC selection, PML simulation.
12 Mar 2026

Treating Competing Risks as Censored Is Biasing Your Retention and Home Insurance Pricing

Fine-Gray subdistribution hazard for UK insurance competing risks. Separates lapse, MTC, and NTU correctly - insurance-survival Python, not naive censoring.
12 Mar 2026

Causal Fixed Effects for Rate Change Evaluation: Using causalfe on Insurance Panel Data

Causal Forests with Fixed Effects for UK insurance panel data. Rate change evaluation by segment - beyond before-and-after loss ratios. causalfe Python.
12 Mar 2026

Heterogeneous Lapse Effects with Bayesian Causal Forests: Beyond the Average Elasticity

Bayesian Causal Forests for heterogeneous lapse effects in UK insurance pricing. Segment-level elasticity with posteriors - insurance-bcf wrapping stochtree.
12 Mar 2026

Continuous Treatment Causal Inference for Insurance Pricing: insurance-causal

Automatic Debiased ML via Riesz Representers for continuous price elasticity. insurance-causal - no GPS density blow-up at tails. UK personal lines Python.
12 Mar 2026

Borrowing Experience You Don't Have

Transfer learning for thin-segment UK insurance pricing: Tian-Feng GLM algorithm, CatBoost source-as-offset, CANN fine-tuning, negative transfer diagnostics.
11 Mar 2026

MinTrace Reconciliation for Insurance Pricing Hierarchies

MinTrace reconciliation for insurance pricing hierarchies: optimal joint adjustment across peril models and portfolio GLM. Exposure-weighted, Python.
11 Mar 2026

Survival Models for Insurance Retention

Survival models for UK personal lines retention: cure models, survival-adjusted CLV, actuarial lapse tables, MLflow deployment. What lifelines does not do.
11 Mar 2026

Does the Risk Actually Drop at 25? Using Regression Discontinuity to Test Your Age Threshold

Regression Discontinuity Design tests if UK motor risk drops at age 25. Exposure-weighted Poisson outcomes, geographic boundaries, Consumer Duty output.
11 Mar 2026

Double GLM for Insurance: Every Risk Gets Its Own Dispersion

Double GLM gives every UK insurance policy its own dispersion parameter. insurance-dispersion - policy-level Solvency II variance and risk-adequate loading.
11 Mar 2026

Separating Structural Non-Claimers from Risk: Mixture Cure Models for Insurance Pricing

Mixture cure models for UK motor: separates non-claimers from susceptibles. Per-policyholder cure fraction scoring - insurance-survival Python library.
10 Mar 2026

When You Can't Fit a GLM from Scratch: Transfer Learning for Thin Segments

GLMTransfer borrows statistical strength from a related source book to price thin target segments. Motor-to-fleet, home-to-landlord, and fleet roll-outs.
10 Mar 2026

Rate Change Evaluation: Did the Premium Increase Cause the Lapses?

A 12% rate increase on young motor drivers. An 8% lapse spike three months later. Here is how to tell whether the rate change caused it — using synthetic difference-in-differences.
10 Mar 2026

Discrimination-Free Pricing in Python: Causal Paths, Optimal Transport, and the FCA

Discrimination-free UK insurance pricing via Wasserstein barycenter and causal path decomposition. Satisfies FCA Consumer Duty proxy discrimination rules.
10 Mar 2026

GAMLSS in Python, Finally

GAMLSS in Python: seven families, RS algorithm, variance as function of covariates. insurance-distributional-glm - the actuarial implementation Python lacked.
10 Mar 2026

GLMs Predict Means. DRN Predicts Everything Else.

Distributional Refinement Networks wrap any GLM to produce a full predictive distribution. insurance-severity - neural severity modelling for UK motor pricing.
09 Mar 2026

Whittaker-Henderson Smoothing for Insurance Pricing

Whittaker-Henderson smoothing for noisy experience rating tables in Python. REML lambda selection, Bayesian confidence intervals, 2D surface smoothing.
09 Mar 2026

Getting Spatial Territory Factors Into Production

From CatBoost frequency model to BYM2 spatial territory factors for Emblem or Radar. Data engineering, MCMC convergence checks, Polars joins - Python.
09 Mar 2026

Nested GLMs with Neural Network Embeddings for Insurance Ratemaking

Handle 800+ vehicle makes and 9,000+ postcode sectors in a multiplicative GLM using neural embeddings and spatial clustering. Auditable Python pipeline.
09 Mar 2026

Why Generic Synthetic Data Fails Actuarial Fidelity Tests

Actuarially faithful synthetic data via vine copulas and AIC-selected marginals. insurance-synthetic fixes Poisson semantics and tail behaviour SDV gets wrong.
09 Mar 2026

Calibration Testing That Goes Beyond the Residual Plot

The full calibration framework for insurance pricing: balance property test, auto-calibration by price cohort, and Murphy decomposition...
09 Mar 2026

Double Machine Learning for Insurance Pricing: Benchmarks and Pitfalls

Where double machine learning beats naive regression for insurance pricing — and where it does not. Benchmarks on 100,000-policy synthetic UK motor data with known ground truth....
08 Mar 2026

How Do You Know Your Sigma Model Is Working?

Three diagnostics prove a GAMLSS sigma submodel is real: quantile residuals, worm plots, split-sample calibration. From insurance-distributional-glm.
08 Mar 2026

Tracking Trend Between Model Updates with GAS Filters

GAS filters track claims frequency and severity trend between GLM refits. Step-by-step tutorial using insurance-gas on UK motor data.
07 Mar 2026

Quantile GBMs for Insurance: TVaR, ILFs, and Large Loss Loadings

CatBoost MultiQuantile plus actuarial output layer: TVaR, ILFs, large loss loadings, exceedance probabilities for UK insurance pricing. insurance-quantile.
06 Mar 2026

ICC Diagnostics for Group Factor Selection: Which Broker, Scheme, and Fleet Effects Justify REML

ICC diagnostics for multiple group factors in insurance pricing. When broker, scheme, fleet, and postcode sector effects are worth modelling with REML...
06 Mar 2026

Density Ratio Detection for Channel Mix Drift: Correcting Predictions Before the Loss Ratio Reacts

When a new aggregator partnership or competitor exit changes your new business mix, models trained on the old distribution misprice silently.
05 Mar 2026

Distributional GBMs for Insurance: Pricing Variance, Not Just the Mean

insurance-distributional models the full conditional loss distribution, not just the mean. First open-source Python implementation of the ASTIN 2024 Best Paper.
04 Mar 2026

How to Build a Large Loss Loading Model for Home Insurance

Per-risk large loss loadings for UK home insurance using quantile GBMs. Avoids the flat-loading trap by making the loading a function of the risk itself.
04 Mar 2026

GLM Interaction Detection: A Six-Step Walkthrough with CANN, NID, and SHAP

Step-by-step tutorial: plant two interactions in synthetic motor data, detect them with CANN + NID, validate with SHAP, confirm with A/E surfaces, and...
03 Mar 2026

Proxy Discrimination in UK Motor Pricing: Detection and Correction

Detect and correct proxy discrimination in UK insurance using SHAP and insurance-fairness. Protected characteristic leakage detection under FCA Consumer Duty.
02 Mar 2026

Covariate Shift in Motor Pricing: Detection, Correction, and Conformal Intervals

The foundational walkthrough for insurance-covariate-shift: density ratio estimation, ESS/KL diagnostics, importance weighting, shift-robust conformal...
02 Mar 2026

How to Extract GLM-Style Rating Factors from a CatBoost Model

Step-by-step: extract multiplicative CatBoost rating factors using shap-relativities. SHAP decomposition to GLM-format exp(beta) tables with CI and...
01 Mar 2026

Double Machine Learning for Insurance Price Elasticity

Double Machine Learning fixes biased price elasticity in insurance quote data. insurance-optimise: conversion, retention, elasticity, demand curves, FCA GIPP.
01 Mar 2026

From CatBoost to Radar in 50 Lines of Python

Python library distilling CatBoost GBMs into multiplicative GLM factor tables for Radar and Emblem. Open-source GBM-to-GLM distillation for UK pricing teams.
28 Feb 2026

Blending GLMs and GBMs for UK Pricing: Cross-Validated Weights, Not a Choice Between Them

How to combine GLM and GBM predictions for production pricing: cross-validated blend weights, PRA interpretability, and when blending actually helps. Once the blended model is v...
28 Feb 2026

When Credibility Meets CatBoost: Choosing Between Classical and Modern Approaches

Bühlmann-Straub vs CatBoost vs two-stage multilevel for UK motor pricing: when each wins and how insurance-credibility and insurance-multilevel combine them.
28 Feb 2026

Recalibrate or Refit? The Murphy Decomposition Makes it a Data Question

Assumes familiarity with the Murphy decomposition framework. Focuses on the operational question: given a monitoring alert, how do you read GMCB vs LMCB...
27 Feb 2026

Finding the Interactions Your GLM Missed

Automated interaction search for UK motor GLMs using CANN residuals and NID. Bonferroni-corrected shortlist before manual testing - insurance-interactions.
27 Feb 2026

Experience Rating: NCD and Bonus-Malus

Python library for NCD and bonus-malus in UK motor insurance. Optimal claiming thresholds peak at 20% NCD discount, not 65% - derived mathematically.
23 Feb 2026

BYM2 Spatial Smoothing for Territory Ratemaking

BYM2 spatial model in PyMC for UK territory ratemaking. Borrows strength across neighbouring postcode sectors - statistically correct vs k-means banding.
21 Feb 2026

From GBM to Radar: A Complete Databricks Workflow for Pricing Actuaries

Databricks workflow for UK pricing actuaries: CatBoost plus MLflow tracking, SHAP relativities, and Radar export. End-to-end motor pricing in Python.
21 Feb 2026

Constrained Rate Optimisation and the Efficient Frontier

Rate changes that meet a target loss ratio, respect movement caps, and minimise cross-subsidy. Linear programming for UK personal lines pricing teams.
19 Feb 2026

Conformal Prediction Intervals for Insurance Pricing Models

Distribution-free conformal prediction intervals for insurance GBMs. Per-risk coverage guarantees, not confidence intervals for the mean. Python library.
19 Feb 2026

Bühlmann-Straub Credibility in Python: Blending Thin Segments with Portfolio Experience

Buhlmann-Straub credibility in Python for UK personal lines. Blend thin-segment experience with portfolio rates - mathematically equivalent to mixed models.
17 Feb 2026

SHAP Relativities for Insurance GBMs: GLM-Format Factor Tables in Python

How to extract SHAP relativities from insurance GBMs. Multiplicative factor tables in GLM exp(beta) format, with confidence intervals and exposure weighting. Python, CatBoost, U...
17 Feb 2026

Bayesian Hierarchical Models for Thin-Data Pricing

Partial pooling for thin rating cells in UK motor pricing. bayesian-pricing stabilises sparse segments with hierarchical Bayesian models - no data discarded.
28 Jul 2025

Telematics Risk Scoring: From Raw Trips to GLM Features

How to convert raw telematics trip data into GLM-ready features for UK motor pricing. Covers HMM state segmentation and score calibration to GLM relativities.
14 May 2025

Your Frequency-Severity Independence Assumption Is Costing You Premium

Your frequency GLM and severity GLM are both correct. Multiplying them is not. How to test and correct for the dependence your pricing model ignores.
15 Mar 2025

Spliced Severity Distributions: When One Distribution Isn't Enough

A practitioner tutorial on fitting spliced composite severity distributions for UK motor claims using insurance-severity.