Dashboard Week: Star Wars Statistics

For today's challenge DS25 was tasked to work on data about Star Wars, including all kinds of demographics about the people that responded, with sections for each movie and for each character.

The data was in the form of a 268 pages PDF, and we had to split into two groups: half of the people will look at the movies and the other half will look at the characters.

After having downloaded the data, Ollie Matthews and I looked at the tables 103-177 for the individual characters, and I specifically dealt with cleaning up the fields' structure, which originally looked like this:

For the preparation I used the following steps:

And this is the final data:

Challenges:

  • Understanding the structure of the original data and tables and how to import it in Alteryx;
  • There were a lot of unknown characters () that were instead of the value "1", as well as "-" instead of brackets that needed replacing;
  • Cleaning up the data required a lot of trial and error and an extensive use of Multi-Field Formula tools.

[Picture by Tommy van Kessel 🤙 on Unsplash]

Author:
Edoardo Aversa
Powered by The Information Lab
1st Floor, 25 Watling Street, London, EC4M 9BR
Subscribe
to our Newsletter
Get the lastest news about The Data School and application tips
Subscribe now
© 2025 The Information Lab