Milano-Cortina 2026 · Men's Snowboard Halfpipe Final

What 144 Scores Reveal About Olympic Judging

For every scored performance in the final, we have all six individual judge ratings. Here's what the data shows about consensus, bias, and what actually drives the score.

12
Competitors
24
Scored performances
144
Individual judge ratings
SCROLL ↓
Key Findings at a Glance
🎯
Perfect Agreement
All 6 judges gave Ruka Hirano exactly 90 — twice, on two different routines. Every other scored run had judges disagreeing by at least a point.
💥
Crashes Create Confusion
When a rider falls, judges have to guess how much credit to give. The result: they disagree 70% more on crash scores than on clean landings.
⚖️
The System Works
The scoring rules drop the highest and lowest judge to limit bias. It works — that adjustment only shifts scores by 0.17 points on average.
🔍
Relief Bias? Probably Not
Do judges score higher after watching a string of crashes? When we compare the same rider in different crash contexts, there's no effect.

🏂 Read the Scotty James Story — crash, comeback, and a gamble for gold →

The Event

Three Rounds of Chaos and Precision

The worst qualifier goes first. The best goes last. Same order, every round. Round 1 was carnage — six consecutive wipeouts before a single clean landing. By Round 3, four riders had crashed out entirely, and the gold medalist didn't even bother to drop in.

Every Performance, Color-Coded
Positions 1–12 (left to right = worst to best qualifier) · Hover for details
R1
PATES
77.5
JOSEY
11.75
WANG
17.75
LEE
24.75
IVES
43
HIRANO
27.5
GUSELI
35
HIRANO
90
BARBIERI
75
YAMADA
92
TOTSUKA
91
JAMES
48.75
R2
PATES
JOSEY
70.25
WANG
17.25
LEE
24.75
IVES
HIRANO
86.5
GUSELI
HIRANO
90
BARBIERI
YAMADA
TOTSUKA
95
JAMES
93.5
R3
PATES
JOSEY
WANG
76
LEE
87.5
IVES
HIRANO
GUSELI
88
HIRANO
91
BARBIERI
YAMADA
92
TOTSUKA
JAMES
Clean run
Wipeout (scored)
Crash (DNI)
Skipped / No improvement
↑ BACK TO TOP

Two Runs. Six Judges. One Number.

Ruka Hirano's Round 1 and Round 2 runs each received identical scores from all six judges: 90 across the board. The runs even featured different trick sequences — R2 included a triple cork upgrade over R1 — yet all six judges independently landed on the same number both times. The next-tightest agreement among all other runs was a 1-point spread.

Judge Score Spread Per Run
Each dot = one judge's score · Horizontal spread = disagreement · Purple = perfect consensus

Perfect consensus either means the performance was so unambiguous that all judges independently converged — or it suggests anchoring, where early scores influence later ones. We can't distinguish which from this data, but we can say it's statistically remarkable.

↑ BACK TO TOP
The Disagreement

When Do Judges Diverge?

Judges agree remarkably well on clean runs. But when a rider crashes mid-run, consensus breaks down. The question becomes: how much credit for what they showed?

2.0 pts
Avg judge spread
on clean runs
3.4 pts
Avg judge spread
on wipeouts
70%
More disagreement
on crashes
Wipeout Scores: Tricks Completed Before Crash
More tricks landed = higher score, regardless of what they were attempting

And some judges are consistently more generous or strict than others — across every run, not just wipeouts.

How Each Judge Deviates from the Panel Average
Each dot = one run's deviation · Lollipop = average tendency · Left = strict, Right = generous
But the safety net works.
The trimmed mean (drop the highest and lowest score) shifts the final result by just 0.17 points on average. Medal rankings would be identical with or without it.
↑ BACK TO TOP

Does a Crash Streak Help the Next Clean Run?

The hypothesis: after watching multiple riders crash in a row, judges feel relief when someone finally lands — and unconsciously score them higher. We tracked the number of consecutive crashes immediately before each clean run.

Verdict: probably not.
The group average is +1.96 pts higher after crash streaks — but within-rider comparisons tell a different story. Ruka scored highest after zero crashes (91 vs 90). Yamada scored identically (92) regardless. The one positive case came with a trick upgrade, not a bias effect.
↑ BACK TO TOP

What We Learned

Olympic halfpipe judging is more consistent than you'd expect. Six independent judges land within 2 points of each other on clean runs, and the trimmed mean effectively neutralizes individual tendencies.

The biggest source of disagreement isn't bias — it's ambiguity. Wipeouts force judges to estimate what could have been, and that's where scores diverge. As for contrast bias from watching crashes? The data from this competition says no.

Based on individual judge scores from 24 scored performances across 3 rounds.
All data from the official Olympics.com results.

Read the Scotty James Story →