Python Script

The processing script is the single source of truth for the dataset.

What it does

Run it

uv run python scripts/process_data.py

Output

The JSON includes: