Probability of Default (PD) — also called default probability or credit default probability — is the likelihood that an obligor (borrower or counterparty) will fail to meet contractual debt obligations within a specified horizon, typically one year. PD is a forward-looking, per-obligor risk metric used by credit risk teams, investors and regulators. It differs from an observed default rate (a historical cohort frequency) because PD is an estimated probability for each exposure based on information available today.
Intuitively, PD answers: "Given what I know now about this borrower and the macro environment, what is the chance they will default within my chosen horizon?" You can estimate PD at the instrument level (loan, bond) or obligor level (company, household). PD is combined with Exposure at Default (EAD) and Loss Given Default to compute expected loss.
PD underpins most credit processes and risk measures:
PD also influences product-level decisions for lenders and secured portfolio risk management.
The canonical expected loss formula is:
Example: PD = 2% (0.02), EAD = AUD 100,000 and LGD = 45% (0.45):
PD also enters regulatory capital formulas under IRB, where capital is calibrated to unexpected loss (UL) which depends on PD, LGD and asset correlation assumptions.
PIT vs TTC is central to model design and use:
Point-in-Time (PIT) Estimates conditioned on current macro and borrower signals; cyclical and responsive to near-term conditions. Best for pricing, provisioning and forward-looking stress tests, but volatile and requires frequent recalibration and macro overlays.
Through-the-Cycle (TTC) Smoothed estimates over the credit cycle, representing an average default likelihood. Stable and useful for long-term capital allocation and comparability, but can understate near-term risk and is less useful for provisioning and pricing.
Conversion & calibration: You can derive PIT PDs from TTC models using a cycle factor or macro model (e.g., forecasted GDP/unemployment). Model governance must document whether PDs are reported PIT or TTC and how conversions are performed.
Three broad PD estimation approaches exist:
Each approach has trade-offs in data needs, interpretability and use-case fit. Statistical models are flexible and widely used; structural models suit firms with observable market data; market-implied PDs are timely but can be noisy.
Statistical methods are pervasive for retail and SME lending and for obligors without liquid equity markets.
Logistic regression Binary outcome: default within horizon (1) or survival (0). The model specification is logit(PD) = β₀ + β₁X₁ + … + βₖXₖ. Score-to-PD conversion: PD = 1 / (1 + exp(−score)). Typical predictors include financial ratios (interest cover, leverage), payment history, utilisation, borrower age, industry and macro variables. Calibration requires choosing a training window and handling class imbalance (defaults are rare) with sampling or penalisation.
Survival / hazard models Useful for modelling time-to-default and handling censoring (prepayments, cures). They estimate an instantaneous hazard λ(t|X) and convert it to a cumulative PD over the required horizon. Use when time at risk varies across accounts or when censoring is material.
Machine learning Tree-based methods and boosting often improve ranking but can miscalibrate probabilities. Convert ML scores to PDs via Platt scaling (sigmoid), isotonic regression or binning calibration. Use explanation tools (SHAP, partial dependence) to retain interpretability for governance.
Practical tips:
Structural models relate a firm's default risk to economic fundamentals and market prices. They leverage equity prices and balance-sheet information to estimate a firm's distance-to-default. Strengths include connecting fundamentals to default risk and using market signals. Limitations include the requirement for liquid market data, reliance on simplifying assumptions, and potential to miss defaults occurring between maturity dates.
Structural models are complementary to statistical and market-implied approaches, especially for large corporates with liquid equity.
Market-implied PDs use traded instruments that reflect credit risk. A simple approximation under a constant hazard rate h and assumed recovery R is:
and the 1-year PD is
Caveats: CDS spreads include liquidity, counterparty and funding premia. Recovery R is uncertain; different R values materially change PD. Bond-implied PDs require stripping risk-free and liquidity premia.
For market context and methodologies see the Reserve Bank of Australia (RBA) at https://www.rba.gov.au/ and the Bank for International Settlements (BIS) at https://www.bis.org/.
Ratings-to-PD mappings use historical default studies (Moody's, S&P). Transition matrices show empirical probabilities of moving between rating buckets over time (e.g., 1-year transitions).
Uses include:
When using mappings, perform vintage analysis and update tables to reflect current default environments.
Example 1 — Loan-level PD (logistic)
Model: logit(PD) = −3.0 + 0.8·(leverage) − 0.5·(interest cover) + 0.6·(payment delinquencies)
Borrower data: leverage = 2.0, interest cover = 4.0, delinquencies = 1
Calculation:
Example 2 — CDS-implied PD
Input data: 1-year CDS spread s = 300 bps = 0.03; assume recovery R = 40% (0.40)
Calculation:
Spreadsheet steps: For logistic PD, implement score formula then PD = 1 / (1 + EXP(−score)). For CDS PD, input spread and recovery; hazard = spread / (1 − recovery); PD = 1 − EXP(−hazard × horizon).
Key data needs include sufficient history of defaults and survivals (including censored observations), granular borrower characteristics (financials, payment records, behavioural data), macro indicators for PIT calibration (GDP growth, unemployment, house prices), and market data for structural/market-implied models (equity prices, CDS spreads).
Common pitfalls to avoid:
Validation checklist:
Discrimination: Test AUC (ROC area) and KS statistic.
Calibration: Create calibration plots (predicted PD bins vs observed defaults) and calculate Brier score:
where yi is the 0/1 default outcome.
Backtesting PDs: Perform unconditional backtest (compare average predicted PD to observed default rate) and conditional backtest (test calibration by score band or rating). Use appropriate statistical tests but account for low default frequency.
Stress & sensitivity testing: Apply macro stress scenarios and re-run PDs. Test sensitivity to recovery assumptions for market-implied PDs and to model choices.
Governance & frequency: Document purpose, inputs, assumptions and limitations. Recalibrate PIT PDs at least quarterly or on material performance drift; TTC PDs less frequently. Maintain independent validation, version control and audit trail in line with supervisory expectations.
| Step | Purpose |
|---|---|
| Data lineage check | Verify completeness and censorship handling |
| Discrimination tests (AUC, KS) | Assess rank ordering |
| Calibration plots & Brier score | Assess probability accuracy |
| Backtesting by vintage | Track out-of-time performance |
| Sensitivity to recovery/macro | Quantify fragility |
| Governance documentation | Ensure auditability and controls |
Under the Basel framework, IRB approaches require institutions to estimate PD, LGD and EAD with robust models and governance. PD estimates feed risk-weighted assets and capital via functions that consider default correlation and PD level. For Basel material see the BIS at https://www.bis.org/.
Regulatory expectations in Australia align with Basel principles. Maintain credible model governance, independent validation and documented calibration. Model outputs used for regulatory capital are subject to supervisor review; APRA publishes prudential guidance and thematic reviews at https://www.apra.gov.au/. ASIC covers conduct and disclosure where credit decisioning affects consumers at https://asic.gov.au/. The Reserve Bank provides credit and financial stability data that inform macro overlays at https://www.rba.gov.au/. Relevant legislation and insolvency frameworks are published on the Australian Government legislation site at https://www.legislation.gov.au/. For tax and administrative context that can affect credit outcomes see the Australian Taxation Office at https://www.ato.gov.au/.
When preparing regulatory submissions or internal capital models, ensure PD methods, PIT/TTC treatment and mapping logic are transparent, auditable and evidence-based.
Limitations: PD estimates are model-dependent and sensitive to data, assumptions and economic shifts. Market-implied PDs reflect risk premia and liquidity; structural models rely on strong assumptions. Rare default events create statistical uncertainty; account for model risk in capital and governance.
Best practices:
PD is an estimated probability for an individual obligor over a horizon; the observed default rate is a historical proportion of defaults in a cohort. PDs can be PIT or TTC; observed default rates are retrospective.
One year is standard for regulatory and many commercial uses, but multi-year PDs are used for portfolio planning or lifetime expected loss (e.g., IFRS 9).
PIT PDs often need quarterly recalibration or when macro conditions change; TTC PDs recalibrated less frequently (annually). Both require ongoing monitoring.
It depends on data and use-case. Statistical methods are versatile and common; structural models suit listed corporates; market-implied PDs are timely but noisier.
Use logistic mapping, Platt scaling, isotonic regression or empirical binning with observed default rates. Validate out-of-sample.
Very sensitive — small changes in assumed recovery materially change implied PD. Report sensitivity ranges.
This article is general information only and is not legal, tax or financial advice.