What is Melbourne Cup?

The Melbourne Cup is Australia’s most famous horse race held over a distance of 3200 meters at Flemington Racecourse in Melbourne. Held on the first Tuesday of November each year the ‘race that stops a nation is an opportunity to try and back a winner and claim bragging rights over your friends, workmates, and family. In this blog, I’ll show my process of creating a dashboard based on Melbourne Cup data.

 

Data Preparation – Web Scraping & Iteration Macro

I wanted to use the last 10 years’ data to do the analysis. But I found some websites’ URLs are different (2011-2019 & 2020-2021), so I need to do separate jobs in collecting the data.

 

a) Create an Iteration Macro to collect the data from 2011-2019

Figure 1. Iteration Macro for collecting data from 2011-2019

 

b) Create an Iteration Macro to collect the data from 2020-2021

Figure 2. Iteration Macro for collecting data from 2020-2021

 

c) Implement the Iteration Workflow

Figure 3. Implement the Iteration

 

Data Wrangling

I got all the data. But some websites had missing values or dirty data. I need to do some data cleansing.

 

a) Data Cleansing for the two datasets and Union them

Figure 4. Data Wrangling

 

b) Create a Dataset about the trainers

The trainer column had some concatenated values. To fix that, I created another workflow.

Figure 5. Create a workflow to Cross Tab the concatenated Trainers

 

Dashboard Creation

After getting all the data, put them in Tableau and create visualizations.

 

a) Make an introduction on my dashboard

Figure 6. Introduction of the Dashboard

 

b) Create the horse racing graph to show the moment of victory from 2011-2021

Figure 7. Horse Racing Graph

 

c) List the champion horses, jockeys, and trainers

Figure 8. Champion List

 

d) Arrange the Layout and Finalise the Dashboard

Figure 9. Final Dashboard

 

You can click here to interact with the dashboard functionality. If there are any problems, please feel free to point them out. Besides, you can reach out to me on LinkedIn. I will try my best to answer your questions about Tableau or Alteryx.

 

Joe Chan
Author: Joe Chan

Joe has an IT background with a master's degree in UNSW, majoring in AI and Data Science. During his studies, he realized Data is one of the most valuable assets a business can have and potentially has a tremendous impact on its long-term success. After graduation, his desire to level up his data analytics skills led him to join The Data School. He is interested in Data Wrangling, Data Visualization, and Machine Learning, eager to be a great Data Analyst to help businesses grow.