2020: Week 28

Week 28 by Carl Allchin

This week's challenge comes from our fellow Tableau Challenge Sports Viz Sunday. The data set in question has sadly gotten a little messed and manipulated from the lovely Summer Olympics data set they had produced. As we would have been approaching the start of Japan 2020, we thought we would test some of the skills you've developed in your data preparation training to see if you are ready to compete at the top level!

Input
There is one Excel file for the task this week with two separate tabs. 
  1. Host Cities of the Summer Olympics - this is your main data set this week
  2. Scaffold - this will help with one part of the task
Input 1 - Host Cities of Summer Olympics

Input 2 - Scaffold

Requirements
  • Input Data
  • Convert the Roman numerals to numerical values
  • Determine the Year of the Olympics 
    • Each Roman numeral is four years apart even if the games wasn't held
  • Create a Start Date for the Games (in DD/MM/YYYY format)
  • Create an End Date for the Games (in DD/MM/YYYY format)
  • Remove unnecessary fields
  • Remove the null dates
  • Output the Data

Output

One File:
  • 7 Fields 
    • Start Date
    • End Date
    • Games
    • Host 
    • Nations
    • Sports
    • Events
  • 34 Rows - One Row per Olympics (based on the Roman numeral)

The full output can be found here for comparisons.

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14@JonathanAllenby @TomProwse1

You can also post your solution on the brand new Tableau Forum where we have a Preppin Data community page. Post your solutions and ask questions if you need any help! 

Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text