Milano-Cortina 2026 · Men's Snowboard Halfpipe Final

What 144 Scores Reveal About Olympic Judging

For every scored performance in the final, we have all six individual judge ratings. Here's what the data shows about consensus, bias, and what actually drives the score.

Competitors

Scored performances

144

Individual judge ratings

SCROLL ↓

Key Findings at a Glance

🎯

Perfect Agreement

All 6 judges gave Ruka Hirano exactly 90 — twice, on two different routines. Every other scored run had judges disagreeing by at least a point.

💥

Crashes Create Confusion

When a rider falls, judges have to guess how much credit to give. The result: they disagree 70% more on crash scores than on clean landings.

⚖️

The System Works

The scoring rules drop the highest and lowest judge to limit bias. It works — that adjustment only shifts scores by 0.17 points on average.

🔍

Relief Bias? Probably Not

Do judges score higher after watching a string of crashes? When we compare the same rider in different crash contexts, there's no effect.

🏂 Read the Scotty James Story — crash, comeback, and a gamble for gold →

The Event

Three Rounds of Chaos and Precision

The worst qualifier goes first. The best goes last. Same order, every round. Round 1 was carnage — six consecutive wipeouts before a single clean landing. By Round 3, four riders had crashed out entirely, and the gold medalist didn't even bother to drop in.

Every Performance, Color-Coded

Positions 1–12 (left to right = worst to best qualifier) · Hover for details

PATES

77.5

JOSEY

11.75

WANG

17.75

LEE

24.75

IVES

HIRANO

27.5

GUSELI

HIRANO

BARBIERI

YAMADA

TOTSUKA

JAMES

48.75

PATES

—

JOSEY

70.25

WANG

17.25

LEE

24.75

IVES

—

HIRANO

86.5

GUSELI

—

HIRANO

BARBIERI

—

YAMADA

—

TOTSUKA

JAMES

93.5

PATES

—

JOSEY

—

WANG

LEE

87.5

IVES

—

HIRANO

—

GUSELI

HIRANO

BARBIERI

—

YAMADA

TOTSUKA

—

JAMES

—

Clean run

Wipeout (scored)

Crash (DNI)

Skipped / No improvement

↑ BACK TO TOP

The Consensus

Two Runs. Six Judges. One Number.

Ruka Hirano's Round 1 and Round 2 runs each received identical scores from all six judges: 90 across the board. The runs even featured different trick sequences — R2 included a triple cork upgrade over R1 — yet all six judges independently landed on the same number both times. The next-tightest agreement among all other runs was a 1-point spread.

Judge Score Spread Per Run

Each dot = one judge's score · Horizontal spread = disagreement · Purple = perfect consensus

Perfect consensus either means the performance was so unambiguous that all judges independently converged — or it suggests anchoring, where early scores influence later ones. We can't distinguish which from this data, but we can say it's statistically remarkable.

↑ BACK TO TOP

The Disagreement

When Do Judges Diverge?

Judges agree remarkably well on clean runs. But when a rider crashes mid-run, consensus breaks down. The question becomes: how much credit for what they showed?

2.0 pts

Avg judge spread
on clean runs

3.4 pts

Avg judge spread
on wipeouts

70%

More disagreement
on crashes

Wipeout Scores: Tricks Completed Before Crash

More tricks landed = higher score, regardless of what they were attempting

And some judges are consistently more generous or strict than others — across every run, not just wipeouts.

How Each Judge Deviates from the Panel Average

Each dot = one run's deviation · Lollipop = average tendency · Left = strict, Right = generous

But the safety net works.

The trimmed mean (drop the highest and lowest score) shifts the final result by just 0.17 points on average. Medal rankings would be identical with or without it.

↑ BACK TO TOP

The Hypothesis

Does a Crash Streak Help the Next Clean Run?

The hypothesis: after watching multiple riders crash in a row, judges feel relief when someone finally lands — and unconsciously score them higher. We tracked the number of consecutive crashes immediately before each clean run.

Verdict: probably not.

The group average is +1.96 pts higher after crash streaks — but within-rider comparisons tell a different story. Ruka scored highest after zero crashes (91 vs 90). Yamada scored identically (92) regardless. The one positive case came with a trick upgrade, not a bias effect.

↑ BACK TO TOP

What We Learned

Olympic halfpipe judging is more consistent than you'd expect. Six independent judges land within 2 points of each other on clean runs, and the trimmed mean effectively neutralizes individual tendencies.

The biggest source of disagreement isn't bias — it's ambiguity. Wipeouts force judges to estimate what could have been, and that's where scores diverge. As for contrast bias from watching crashes? The data from this competition says no.

Based on individual judge scores from 24 scored performances across 3 rounds.
All data from the official Olympics.com results.

Read the Scotty James Story →