biostatistics rapid review • build from 2x2 to board questions
The 2Γ2 Table
// everything flows from here. learn this once, use it forever.
Every board question about risk, treatment effect, or diagnostic tests is secretly asking you to fill in a 2Γ2 table and do arithmetic. Master the table, and the formulas become obvious.
Disease +
Disease β
TOTAL
Exposed +
aTP / exposed + sick
bFP / exposed + healthy
a+b
Exposed β
cFN / unexposed + sick
dTN / unexposed + healthy
c+d
TOTAL
a+c
b+d
N
KEY INSIGHT
"Exposed" = treated group, risk factor group, or test positive β depending on question type. "Disease" = outcome, disease, whatever you're measuring.
The math is identical regardless of framing.
HOW TO READ A QUESTION
Step 1: Draw the 2Γ2 mentally.
Step 2: Fill in a, b, c, d from the stem or table.
Step 3: Ask β "what are they actually asking for?" and apply one formula.
STUDY DESIGN MATTERS FOR WHICH CALC YOU CAN USE
RCT / Cohort Can calculate RR and OR Case-Control Can only calculate OR (you selected cases and controls β incidence is artificial) Cross-Sectional / Dx Study Sensitivity, Specificity, PPV, NPV
Risk Measures
// six numbers. this is the entire section.
π―
Absolute Risk (AR)
a / (a+b)
"What fraction of exposed people got sick?"
Also called incidence or attack rate.
Unexposed AR = c/(c+d)
β Case-control β cannot calculate AR (no true incidence data)
βοΈ
Relative Risk (RR)
a/(a+b) Γ· c/(c+d)
= [a/(a+b)] / [c/(c+d)] RR = 1: no difference RR > 1: harmful RR < 1: protective
Only valid for cohort/RCT.
β 95% CI includes 1 β not statistically significant
π²
Odds Ratio (OR)
(aΓd) / (bΓc)
"Diagonal cross multiply."
Use for case-control studies.
When disease is rare (<10%), OR β RR.
Interpreted identically to RR.
β OR always farther from 1 than RR β never mistake them
π‘οΈ
Absolute Risk Reduction (ARR)
c/(c+d) β a/(a+b)
= [c/(c+d)] β [a/(a+b)]
Always control minus treatment. This is the most clinically meaningful number.
β Boards: when asked "best reflects clinical benefit" β choose ARR or NNT
π’
Relative Risk Reduction (RRR)
ARR / ARcontrol
Often cited in drug ads to make results look impressive. RRR is misleading when baseline risk is very low.
"50% RRR" sounds great until ARR = 0.002%.
β Pharma ads use RRR. The exam will ask you to see through it
π
NNT / β οΈ NNH
1 / ARR
Number needed to treat = 1/ARR
Number needed to harm = 1/ARI (absolute risk increase) Lower NNT = better treatment. Higher NNH = safer drug. Always round UP.
β NNT of 1 = perfect drug. NNT of 1000 = barely any benefit
BOARD EXAM PATTERN β MEMORIZE THIS
When a question gives you a drug trial and asks what "best describes the benefit":
β They want ARR or NNT, almost always.
When they cite "the drug reduced risk by 40%" β that's RRR, and they're testing whether you can calculate the actual ARR.
When it's a case-control study β OR only. No RR possible.
THE MISLEADING AD TRAP
Drug trial: 2% placebo events, 1% drug events.
Ad claims: "50% reduction in events!" (That's RRR.)
The real ARR = 1%. NNT = 100.
The boards will ask which statistic "most accurately reflects clinical benefit." Answer: ARR or NNT.
Live Calculator
// sliders update everything live β watch the ROC curve move
Drag sliders or type values. All stats and the ROC curve update instantly.
Dis+
Disβ
Tot
Exp+
20
180
200
Expβ
40
160
200
Tot
60
340
400
ROC Curve β operating point moves with your values
// sensitivity, specificity, PPV, NPV β and why prevalence matters more than you think
π
Sensitivity (SnNout)
a / (a+c)
TP / all who have disease.
"How good is this test at finding disease?" High Sens β good screening test.
Negative result on high-Sens test = rules OUT disease.
β SeNsitivity: high β Negative rules out (SnNout). Does NOT change with prevalence.
π
Specificity (SpPin)
d / (b+d)
TN / all who don't have disease.
"How good is this test at correctly labeling healthy people?" High Spec β good confirmatory test.
Positive result on high-Spec test = rules IN disease.
β Specificity: high β Positive rules in (SpPin). Does NOT change with prevalence.
β
PPV β Positive Predictive Value
a / (a+b)
"If the test is positive, what's the chance the patient actually has it?" PPV depends heavily on prevalence.
Same test β lower PPV in low-prevalence population.
β Low prevalence β lousy PPV even with a great test. Most positives = false positives.
β
NPV β Negative Predictive Value
d / (c+d)
"If the test is negative, what's the chance they truly don't have it?" NPV also depends on prevalence.
Higher prevalence β more false negatives β lower NPV.
β High prevalence β lower NPV. More disease = more missed cases.
π
LR+ (Likelihood Ratio +)
Sens / (1βSpec)
How much does a positive result shift probability?
LR+ > 10 = strong evidence for disease.
LR+ 2β5 = modest shift. LR is independent of prevalence.
How much does a negative result shift probability?
LRβ < 0.1 = strong evidence against disease.
LRβ 0.2β0.5 = modest shift down.
Post-test odds = Pre-test odds Γ LR
β LRβ < 0.1 = strong rule-out. LR is prevalence-independent, unlike PPV/NPV.
THE PREVALENCE TRAP β MOST COMMON WRONG ANSWER
A test has 99% sensitivity and 99% specificity. Prevalence = 1 in 1000.
Test is positive. What is the PPV? Answer: only ~9%! (999 healthy people β ~10 false positives. 1 true positive. PPV = 1/11.)
This is why mass screening programs with rare diseases have lousy PPV β and why confirmatory tests exist.
MNEMONIC
SnNout: High SeNsitivity β Negative rules out SpPin: High Specificity β Positive rules in
PPV & NPV change with prevalence. Sens & Spec do NOT. LR does NOT.
CHANGING THE THRESHOLD
Lower threshold β more positives β Sensitivity β, Specificity β
Raise threshold β fewer positives β Sensitivity β, Specificity β
They always move in opposite directions β this is the ROC curve tradeoff.
Board Question Drill
// 10 questions. full board format. worked solutions.
0
correct
0
wrong
0/10
answered
Q01 / π NNT
A randomized controlled trial evaluates a new statin in patients with known coronary artery disease. Over 5 years, 8% of patients in the statin group experienced a myocardial infarction, compared to 12% of patients in the placebo group.
What is the number needed to treat (NNT) to prevent one MI over 5 years?
Treat 25 patients for 5 years to prevent 1 MI. β
Q02 / π² ODDS RATIO
A case-control study examines the association between smoking and bladder cancer. Among 100 cases (bladder cancer), 60 are smokers. Among 100 controls, 30 are smokers.
What is the odds ratio for the association between smoking and bladder cancer?
STEP 2 β OR formula: OR = (aΓd)/(bΓc) = (60Γ70)/(30Γ40) = 4200/1200 = 3.5
Case-control β OR is the only valid measure. Cannot calculate RR. β
Q03 / ππ SENSITIVITY & SPECIFICITY
A new rapid test for pulmonary embolism is evaluated in 400 patients. Results:
PE Present
PE Absent
Test +
90
40
Test β
10
260
What is the sensitivity and specificity of this test?
a=90, b=40, c=10, d=260
Sensitivity = a/(a+c) = 90/100 = 90%
Specificity = d/(b+d) = 260/300 = 86.7% β 87%
90% sensitivity β a negative result helps rule OUT PE (SnNout). β
Q04 / π’ THE MISLEADING AD
A pharmaceutical rep presents data on a new anticoagulant. In a RCT of 10,000 patients: 1% of the treatment group and 2% of the placebo group had a stroke over 3 years. The rep claims the drug "reduces stroke risk by 50%."
Which of the following most accurately describes the clinical benefit of this drug?
STEP 1: The rep cited RRR = (2%-1%)/2% = 50%. Technically correct but misleading.
STEP 2 β ARR:0.02 β 0.01 = 0.01 = 1%
STEP 3 β NNT:1/0.01 = 100
Treat 100 patients for 3 years to prevent 1 stroke. The boards always choose ARR/NNT over RRR. β
Q05 / β PPV AND PREVALENCE
A screening test for a rare genetic disorder has sensitivity 99% and specificity 99%. Disease prevalence = 1 in 1,000. A randomly selected patient tests positive.
Even with an excellent test, rare disease = most positives are false positives. β
Q06 / βοΈπ‘οΈπ INTEGRATED
A cohort study follows 500 hypertensive patients on ACE inhibitor and 500 matched controls for 10 years. 30 ACE inhibitor patients develop CKD; 60 control patients develop CKD.
Which of the following BEST describes the effect of ACE inhibitors on CKD?
a=30, b=470, c=60, d=440
RR = (30/500)/(60/500) = 0.5
ARR = 0.12 β 0.06 = 6%
NNT = 1/0.06 = 16.7 β round up to 17
Choice A says NNT=16 (rounds wrong). Choice D is exact. β
Q07 / π LIKELIHOOD RATIO
A bedside test for Lyme disease has sensitivity 80% and specificity 95%. A patient from an endemic area tests positive.
What is the likelihood ratio positive (LR+) for this test?
LR+ = Sens / (1 β Spec)
LR+ = 0.80 / (1 β 0.95) = 0.80 / 0.05 = 16
LR+ > 10 = strong evidence for disease. This is a good confirmatory test. E (0.21) is the LRβ. β
Q08 / ππ CHANGING THE THRESHOLD
A researcher lowers the positivity threshold for a urine dipstick test for UTI, capturing more patients as "positive." Which of the following is MOST LIKELY to occur?
Select the best answer.
Lowering the threshold = calling more results "positive."
More positives β more TP captured β Sensitivity β
But more FP also captured β Specificity β
Sens and Spec always move in opposite directions as threshold shifts. This is the ROC tradeoff. PPV and LR+ both fall (more false positives dilute them). β
Q09 / π² STUDY DESIGN β WHICH STAT
Investigators identify 300 patients with newly diagnosed lung cancer and 300 age-matched patients without cancer. They survey both groups about lifetime asbestos exposure. Which measure of association is most appropriate?
Select the best answer.
This is a case-control study β you selected based on outcome (cancer vs no cancer).
Because you selected on outcome, you cannot calculate true incidence β no RR, no AR, no ARR, no NNT.
OR is the only valid association measure in case-control studies. β
Q10 / TYPE I / TYPE II ERROR
A randomized trial comparing two antibiotic regimens uses Ξ± = 0.05 and Ξ² = 0.20 (power = 80%). The trial ends with p = 0.18, showing no statistically significant difference. The researchers suspect the sample size was too small.
Which error is most likely occurring?
Type I error (Ξ±) = false positive β concluding there IS a difference when there isn't. p < 0.05.
Type II error (Ξ²) = false negative β MISSING a real difference. More likely with small samples.
p = 0.18 β not significant. But this could mean no effect OR the sample was too small to detect it β Type II error.
Power = 1 β Ξ² = 80%. Increasing sample size β power β Type II error risk. β
Rapid Cheatsheet
// print this in your head before entering the exam room
Risk Measures
π― ARa/(a+b)incidence in exposedCase-control β cannot calculate AR
RCT/CohortRR, OR, ARR, NNTgold standard for causation
Case-ctrlOR onlyno incidence β no RRGood for rare diseases; remember: selected on outcome
Cross-sectprevalence, ORsnapshot in time
Interpretation & Errors
RR=1 / OR=1no association
CI crosses 1not significantfor RR and OR
CI crosses 0not significantfor differences (ARR)
p < 0.05statistically significantβ clinically meaningful
Type I (Ξ±)false positive β Ξ± = 0.05concluding effect when nonep < 0.05 = Type I risk β€ 5%
Type II (Ξ²)false negative β Ξ² = 0.20missing a real effectSmall sample size β β Type II risk
Power1 β Ξ² = 80%β sample size β β power
EXAM STRATEGY β 30 SECOND ALGORITHM
1. What study type? β determines which stats are valid
2. Build 2Γ2 (a, b, c, d) from the data given
3. What are they asking? β apply one formula
4. Does the answer make biologic sense?
If RRR vs ARR question β always choose ARR/NNT as "most meaningful"
π Flash Cards
// tap to flip Β· Space = flip Β· β = got it Β· β = again
0 / 13 learned
Tap card to flip | Space = flip | β = Got it | β = Again
π―
AR β Absolute Risk
tap to reveal formula & killer fact
a / (a+b)
Fraction of exposed people who got sick. Also called incidence or attack rate.
β Case-control β cannot calculate AR. No true incidence data.