Every Day Statistics
Statistical Self-Defence
Start here
The reference library

Concepts, plainly defined.

Every concept in the curriculum. Each one a single idea, in plain words. Use the curriculum to learn; use the library to remember.

Anchoring and Numerical Estimation
The first number you encounter shapes every estimate that follows, even when you know that number was chosen at random. Anchoring operates in salary negotiations, retail pricing, and courtroom sentencing. Understanding it changes how you read any situation where someone else names a number first.
Anti-Curriculum
Base Rate Neglect
The most consequential cognitive error in everyday probabilistic reasoning. When a specific piece of evidence arrives, the mind discards the background frequency of the event in the population. The cab problem shows you exactly how this happens, and Bayes' theorem shows you exactly how to stop it.
Anti-Curriculum Essential
Bayes as the Synthesis: The Calibrated Mind
The final unit. Bayes' theorem is not just a calculation — it is a description of how a rational mind should work. Every cognitive failure in this area is, at bottom, a failure to apply Bayes. Calibration. Superforecasting. Prediction markets. And the three questions that, if you ask them habitually, will change how you think.
Anti-Curriculum
Bayes vs Null-Hypothesis Testing
The two dominant frameworks in statistics answer completely different questions. Null-hypothesis testing tells you how surprising your data would be if nothing were going on. Bayesian inference tells you how probable your hypothesis is, given what you now know. These are not the same thing, and the difference matters every time a drug is approved, a trial proceeds to verdict, or a screening programme is designed.
Measurement
Cherry-Picking — Time Periods, Subgroups, and Studies
Any dataset contains patterns. The selection of which patterns to present is a powerful tool. This unit covers cherry-picked time periods, subgroup analyses, and selective citation of studies.
Detection Essential
Conditional Probability
The probability of an event changes depending on what you already know. P(A|B) is not the same as P(B|A), and confusing the two has sent innocent people to prison and led doctors to badly misread test results. This unit gives you the tool that sits at the heart of Bayes' theorem.
Foundations Essential
Confidence Intervals — What the Uncertainty Band Actually Says
Almost everyone who reads a confidence interval misreads it. The misreading is not a minor technicality. It is the difference between knowing something is uncertain and believing it is not.
Measurement
Correlation Is Not Causation
The most frequently stated statistical fact and the least frequently applied one. Three structural reasons why correlation does not imply causation — confounding, reverse causation, and chance — with the Bradford Hill criteria as a framework for when it might.
Detection Essential
Descriptive Statistics: The Full Picture
Mean and median were the start. This unit adds variance, standard deviation, percentiles, quartiles, the five-number summary, box plots, and weighted averages — the full toolkit for understanding what a dataset actually looks like, and for noticing when someone has shown you only part of it.
Measurement
Expected Value: The Number Behind Every Bet, Ticket, and Policy
What probability theory predicts will happen on average, over many trials. How to calculate it, when to use it, when it misleads you, and why the people selling you lottery tickets and insurance policies already know it and are counting on you not to.
Foundations
First Encounter with Bayes' Theorem
The most important single idea in the curriculum, introduced intuitively. A positive test result for a rare disease is probably wrong, even when the test is 99% accurate. Understanding why requires prior probability, likelihood, and posterior belief — the three ingredients of Bayesian reasoning.
Foundations
Hypothesis Testing: The Full Framework
The complete null-hypothesis significance testing procedure, assembled from confidence intervals, p-values, and error types. One-tailed versus two-tailed tests, degrees of freedom, multiple comparisons, and the one trick that inflates false discoveries without anyone technically cheating.
Measurement
Numbers at Scale
A million and a billion sound like cousins. They are not. Most numerical manipulation in public life depends on you not feeling the difference. This unit corrects that, permanently.
On-Ramp
Overconfidence and Calibration
People's confidence in their own judgements reliably exceeds their accuracy. This is not a personality flaw — it is a systematic feature of how human minds form predictions. Understanding it, and the correctives that actually work, is the difference between someone whose plans routinely collapse and someone who consistently delivers.
Anti-Curriculum
P-Hacking and Data Dredging
Statistical significance can be manufactured without fraud, simply by making enough analysis choices until the right number appears. This is p-hacking, and it is built into how academic publishing works. Understanding it changes how you read every headline that says 'scientists find'.
Detection
P-Values: The Most Abused Number in Science
The p-value is the most cited and most misunderstood number in public science. Almost everyone who uses it is using it wrong, including many scientists. Here is what it actually means, why the 0.05 threshold is completely arbitrary, and why a result can be both statistically significant and completely pointless.
Measurement
Percentages, Fractions, and Ratios
The vocabulary of proportion. Percentages, fractions, and ratios are the building blocks of almost every statistical claim — and the source of some of the most reliably effective manipulation. This unit covers the percentage point versus percentage change distinction, what 'X times more likely' actually means, and why every percentage needs a denominator.
On-Ramp
Publication Bias and the File Drawer Problem
The studies that get published are not a random sample of all studies conducted. Positive findings are published; negative findings are buried. The result is a scientific literature systematically tilted toward conclusions that support the people funding the research — and a public making decisions based on a distorted picture.
Detection
Reading Graphs Critically
A graph communicates through visual impression before you process the numbers. That impression can be engineered. Five tricks — the truncated y-axis, the compressed x-axis, area distortion, dual axes, and the missing baseline — account for the vast majority of graph manipulation you will encounter.
On-Ramp Essential
Regression to the Mean
Extreme measurements are followed by less extreme ones — not because anything changed, but because that is how randomness works. Misattributing this statistical law to an intervention is one of the most common errors in medicine, management, and sport.
Detection
Risk Communication: The Full Toolkit
The synthesis unit. NNT, NNH, absolute vs relative risk, natural frequencies, icon arrays, sensitivity, specificity, PPV, NPV, and how PPV collapses when you screen a low-prevalence population. The HIV test worked through in full. The mammography debate as a case study in honest versus dishonest risk communication.
Detection
Sample Size: How Many Is Enough?
A thousand randomly chosen people can accurately represent sixty million. A million carefully selected people might tell you almost nothing. Sample size matters, but size alone is not the point. How the sample was gathered is what determines whether the number is meaningful.
Measurement
Sampling: What a Study Tells You
Every study is a sample. The quality of the conclusion depends on the quality of the sample. Population vs sample, random sampling, convenience sampling, self-selection bias, and stratified sampling — with the 1936 Literary Digest disaster as the object lesson.
Measurement Essential
Survivorship Bias
We only see the planes that came back. Abraham Wald noticed this in 1943 and saved bomber crews' lives. The same blind spot distorts investment returns, entrepreneurship mythology, and medical evidence — wherever failures disappear from the data before you see it.
Detection Essential
The Availability Heuristic
We judge the probability of an event by how easily examples come to mind. Vivid, recent, or emotionally charged events are overweighted. This one cognitive shortcut shapes public policy, insurance markets, and personal decisions in ways that have nothing to do with actual risk.
Anti-Curriculum
The Conjunction Fallacy
People consistently judge a specific, detailed scenario as more probable than a broader one that contains it. This violates a foundational rule of probability. Understanding why it happens, and what it looks like in the wild, is the protection.
Anti-Curriculum
The Ecological Fallacy
Using statistics about groups to draw conclusions about individuals is a structural error with serious real-world consequences — in criminal profiling, public health policy, and cross-national research. Here is how to recognise it and its mirror image.
Detection
The Gambler's Fallacy
Past random events are taken to influence future random events, even when there is no causal mechanism connecting them. The casino industry is built, in part, on this error. Understanding it — really understanding it, not just nodding at it — changes how you read any streak.
Anti-Curriculum
The Hot Hand Fallacy: And Why It Is Different
The belief that a player on a hot streak will keep performing well was declared a cognitive illusion in 1985. Then in 2018 a mathematician found a flaw in the original analysis. The hot hand may be real after all. This unit covers what changed, why it matters, and how it connects to a deeper point about dependent processes.
Anti-Curriculum
The Laws of Probability
The four formal rules that govern how probabilities combine — addition for exclusive events, addition for non-exclusive events, multiplication for independent events, and the complementary rule. Plus the specific way these rules get weaponised to mislead.
Foundations
The Misleading Graph — A Comprehensive Guide
Every trick in the visual statistics arsenal: the truncated axis, the inverted axis, the dual axis, the area distortion, and the missing uncertainty band. With documented real-world examples from governments, media, and financial advertising.
Detection Essential
The Normal Distribution
Why the bell curve appears everywhere, what it actually implies, and what happens when the people running your pension fund assume it holds when it does not. The 68-95-99.7 rule, z-scores, and the normality assumption that helped cause the 2008 financial crisis.
Measurement
The Prosecutor's Fallacy
Confusing P(evidence|innocent) with P(innocent|evidence). This error has contributed to wrongful convictions. The Sally Clark case, DNA database searches, and how to spot the tell in any courtroom probabilistic argument.
Detection
The Relative Risk Trick
The single most important unit in the curriculum. The most common trick in medicine, policy, and advertising. A treatment that reduces risk from 2% to 1% is '50% effective' — but reduces your absolute risk by one percentage point.
Detection Essential
The Replication Crisis
Most published scientific findings do not replicate. This is not a failure of individual scientists — it is a structural consequence of how science is conducted and reported. Here is what it means, why it happened, and how to read a single study properly.
Measurement
The Representativeness Heuristic
Judgements of probability are replaced by judgements of similarity to a prototype. The conjunction fallacy, base rate neglect, and the small sample fallacy are all the same mistake wearing different clothes. When you understand the single mechanism behind all three, you become much harder to fool.
Anti-Curriculum
The Three Averages: First Contact
The word 'average' conceals three different calculations, each giving a different answer. This unit shows why mean and median diverge in skewed distributions — and how choosing between them is one of the most common tools of numerical manipulation.
On-Ramp Essential
The Unrepresentative Sample
The politics of who gets studied and who does not. WEIRD psychology populations, male-dominant clinical trials, the Tuskegee legacy, volunteer bias, and attrition bias — and how conclusions drawn from one group get quietly applied to everyone else.
Detection Essential
Type I and Type II Errors
Every statistical test can fail in exactly two ways. Which failure you find more tolerable is not a technical question. It is a values question, and the answer determines how the system treats you.
Measurement
Variance as Spread
Two data sets with identical averages can be entirely different in character. Variance and standard deviation measure how far outcomes scatter around the centre — and in finance, medicine, and public policy, ignoring spread is how people get badly misled.
Foundations
What Is Probability?
Two legitimate interpretations of probability, both useful, often confused. The frequentist view treats probability as long-run frequency; the Bayesian view treats it as a degree of belief. The distinction matters every time you read a forecast, a poll, or a risk figure in public life.
Foundations
What Is Randomness?
Most people have a wrong model of what randomness looks like. Random sequences contain runs and clusters as a matter of course. Understanding this protects you from one of the most seductive errors in statistics: seeing a pattern in noise and inventing a cause for it.
Foundations
What Is Risk? The Intuitive Primer
Before formal probability, you need an intuitive vocabulary for risk. This unit covers absolute risk, baselines, and why your gut feeling about risk is systematically, predictably wrong — and how that gets exploited.
On-Ramp Essential
When Distributions Are Not Normal
Real data is rarely bell-shaped. Skew, fat tails, bimodal distributions, log-normal distributions, and power laws each produce systematic errors when treated as if they were normal. This unit shows what those errors look like and why they cost people money, elections, and lives.
Measurement