Python Script
The processing script is the single source of truth for the dataset.
What it does
- Loads
data/raw/fifaworldcup2026-groupstage.csv
- Loads
data/raw/fifaworldcup2026-basecamps.csv
- Standardizes team names
- Geocodes training sites
- Finds nearby commercial airports
- Calculates total one-way air travel distance per team
- Writes
data/processed/travel_distances.json
- Copies the same payload to
docs/data/travel_distances.json
Run it
uv run python scripts/process_data.py
Output
The JSON includes:
- team name
- total one-way distance in kilometers
- matches counted
- generation timestamp
- short methodology note