Usage Scenario 1

Data loading and cleaning

Inspect the unified FIFA archive, verify schema coverage, and track the cleaning rules that prepare the player snapshot table for downstream analysis.

Source: csv-archive. Preview rows are sampled from the current player snapshot view.
Rows

144.3K

Columns

110

Seasons

15, 16, 17, 18, 19, 20, 21, 22

Position fields

27

Tidy player snapshot cache

Archive inspection
Preview rows and column profile
sofifa_idshort_nameplayer_positionsoverallpotentialvalue_eurwage_eurleague_namenationality_nameseasongender
226324C. LloydCM, CAM, LM, ST9191nullnullnullUnited States16female
226328M. RapinoeLM, CM9090nullnullnullUnited States16female
226334A. WambachST9090nullnullnullUnited States16female
226362L. NécibLM, CAM9090nullnullnullFrance16female
226373N. KeßlerCM8989nullnullnullGermany16female
Cleaning checklist
Data preparation status

Load yearly CSV snapshots

ready

Loaded 144323 rows from the yearly FIFA CSV snapshots into one unified table.

Normalize numeric market fields

ready

Converted ratings, IDs, value_eur, wage_eur, release_clause_eur, and ability columns into nullable numeric types.

Split position rating strings

ready

Parsed 27 position columns into base, modifier, and effective rating fields.

Write tidy parquet cache

ready

Parquet cache written successfully.

Null hotspots
Columns requiring special handling

club_loaned_from

94%

Post-cleaning null rate from the generated tidy dataset.

nation_team_id

93%

Post-cleaning null rate from the generated tidy dataset.

nation_position

93%

Post-cleaning null rate from the generated tidy dataset.

nation_jersey_number

93%

Post-cleaning null rate from the generated tidy dataset.

nation_logo_url

93%

Post-cleaning null rate from the generated tidy dataset.