Methodology

How this works. What's measured. What's locked.

The Challenge

Can a human beat 5 LLMs at predicting the 2026 World Cup?

Jack hosts. Three friends play. Five AIs compete: ChatGPT, Claude, Gemini, Perplexity, Grok. Anyone can submit a bracket. All picks become public after kickoff.

The Competitors

AIs (5)

  • ChatGPT — OpenAI
  • Claude — Anthropic
  • Gemini — Google
  • Perplexity
  • Grok — xAI

Humans

  • Jack (host)
  • Garrett, Zack, Tristan
  • Public submitters

Two Phases, Three Games

The competition runs in two phases. Each phase is scored separately, plus a combined leaderboard tracks total points across both.

Phase 1 — Advancement

Open now · Locks June 11, 19:00 UTC

Place all 48 teams into 8 advancement tiers based on how far you think they go in the tournament:

  • 16 teams exit at group stage
  • 16 teams lose in Round of 32
  • 8 teams lose in Round of 16
  • 4 teams lose in Quarterfinals
  • 1 team finishes 4th
  • 1 team finishes 3rd
  • 1 team is runner-up
  • 1 team wins the World Cup

Phase 2 — Knockout Picks

Opens June 27 · Locks June 27, 19:00 UTC

Once the group stage ends and the actual knockout bracket is set, pick the winner of every knockout match — 32 matches total, from Round of 32 to the Final.

Three Leaderboards

  • Phase 1— advancement scoring only
  • Phase 2— knockout pick scoring only
  • Combined— sum of both

Scoring

No partial credit. Wrong tier or wrong pick = 0 points.

Phase 1 — Per team

TierTeamsPointsTier max
Group stage exit16116
R32 exit16232
R16 exit8432
QF exit4832
4th place11212
3rd place11616
Runner-up12424
Champion13232
Phase 1 perfect score196

Phase 2 — Per match

RoundMatchesPointsRound max
Round of 32 winner16232
Round of 16 winner8432
Quarterfinal winner4832
Semifinal winner21632
3rd place match winner11212
Final winner13232
Phase 2 perfect score172

Combined perfect score: 368 points.

AI Methodology

To make this a fair, auditable fight:

  • All 5 AIs receive the same prompt in their respective web interfaces.
  • No web search is allowed. Predictions rely on training data only.
  • Output is structured JSON, validated against the schema before being stored.
  • Raw prompts and raw outputs are stored and timestamped for every AI submission.
  • Once submitted, AI brackets are locked. They cannot be re-run, regenerated, or edited.

Human Submission Rules

  • One bracket per email address per phase.
  • Display name required. Cannot impersonate Jack or any of the AIs.
  • Picks save as a draft in your browser as you make them — close the tab and come back, your picks are preserved.
  • After clicking Submit, your bracket is saved to the database. You can edit your picks until the phase locks.
  • All submissions become public once the phase locks.

The Asymmetry

Humans can edit their bracket until the phase locks. AIs cannot — they run once and lock immediately.

This is intentional. Humans always have the option to chase information: lineups, weather, injury news, form. The AIs get only what was in their training data.

If a human beats the AIs, they did it with more information available to them. The AIs are competing handicapped by design. That's what makes the question Can a human beat 5 LLMs? interesting in the first place.

Timeline

DateEvent
Late May 2026Phase 1 opens. AI Phase 1 predictions locked.
June 11, 19:00 UTCMatch 1 kickoff. Phase 1 locks for everyone.
June 11 — 26Group stage matches played.
June 27Phase 2 opens. AI Phase 2 predictions locked.
June 27, 19:00 UTCRound of 32 begins. Phase 2 locks for everyone.
June 28 — July 19Knockout rounds. Leaderboards update live.
July 19Final. Champion crowned. Final leaderboards locked.

Raw AI prompts and outputs are stored in the database with timestamps. After Phase 1 lock, they will be published here for audit.