Model Bias in Practice

Context: why “model bias” shows up in real systems

Machine learning systems do not operate in a vacuum: they are trained on historical data, optimized toward explicit objectives, and deployed into socio-technical processes. In practice, “model bias” usually refers to systematic performance or outcome differences that disadvantage certain groups, especially when the model is used for decisions that affect people (credit, hiring, healthcare, public services).

Definitions and scope (avoid ambiguity)

Statistical bias (estimation bias): a systematic error between an estimated quantity and the true quantity; important in measurement and evaluation.
Societal or fairness-related bias: systematic disparities in model outcomes or errors across groups that are ethically, legally, or operationally unacceptable.
Protected attributes and sensitive characteristics: traits such as race, sex, age, disability status, and other characteristics defined by regulation or policy; these vary by jurisdiction and context.
Fairness is not a single metric: “fair” depends on the decision context, the harm model, and the constraints; many fairness definitions cannot be simultaneously satisfied.

Where bias enters the ML lifecycle

Bias can be introduced at multiple points; treating it only as a “modeling issue” is a common failure mode.

1) Problem framing and target definition

Wrong objective: optimizing a proxy (e.g., “likelihood to repay”) when the business process actually needs “ability to repay” can replicate historical exclusion.
Label-choice risk: targets derived from human decisions (arrests, approvals, performance ratings) encode prior policies and discretion.
Decision thresholding: even with a well-calibrated score, the chosen cutoff and downstream workflow can create disparate impacts.

2) Data collection and representation

Sampling and coverage gaps: under-representation of groups, regions, languages, device types, or edge cases; “missing not at random” patterns.
Historical bias: the dataset reflects unequal access, discrimination, or different treatment by institutions.
Measurement and instrument bias: what is recorded (and how) differs by group (e.g., different documentation practices, reporting behavior, sensor quality).
Aggregation bias: a single global model ignores meaningful subgroup differences in feature-outcome relationships.