2021: Week 1

Hello World...

Welcome to 2021 Preppin' Data. 

Since Jonathan (@JonathanAllenby) and I chatted by a coffee machine in Feb 2019, we along with Jenny Martin (@JennyMartinDS14) and Tom Prowse (@TomProwse1) have produced 99 challenges to date, completed by 305 different people, who have solved the challenges 2,424 times (that we know of!). 

The overall aim of Preppin' Data is to give people a place to learn the power of being able to prepare your own data sets (rather than relying on others) and the capabilities of Tableau Prep. If you want to know more, check out our first post.

For those of you who are new to Preppin' Data, here are some basics:

  • We release a new challenge each Wednesday
  • We post a written and video solution the following Tuesday
  • We build our challenges in Tableau Prep but welcome solutions from any other tool too. 
One change this year is Jonathan is taking a step back from the challenges but will come up with a tricky problem every now and then I am sure. This means for our regulars you don't need to tag him on Twitter anymore. 

New Year, New Challenges

Each January people all over the world might make resolutions to start a regular challenge to practice their skills so we wanted to facilitate that as best we could for a newer crowd to Prep Builder. 

The aim of January's challenges are to give you an introduction to Tableau Prep Builder. By going through these challenges you will be able to build fundamental skills with the tool, as well as data preparation skills applicable to all data work. 

Learning Resources

- Tableau Videos - how I learnt Desktop originally and works well for getting the basics on Prep Builder too. There are 12 short videos but you won't need to watch them all at once. You will need to sign up for a free Tableau account to watch the videos. 

- Jenny & Tom's excellent free online training series - currently 31 short videos going into more precision on the features of Prep. Did I mention free?

- Tableau Prep Up & Running - if you prefer your learning through words on a page, I wrote this book for beginner and intermediate users for Tableau Prep. 

Once you complete the challenge, you can post on our tracker, share an image (annotated if you want bonus points) on Twitter or the Tableau Forums. 

Week 1's Challenge

Challenge by Carl Allchin

This week we are going to be focusing on cleaning data ready to answer some questions from our stakeholders. In the requirements I will be adding some links to some useful resources if you get stuck on a particular requirement. 

Input

The input is a csv file (Text File input type on prep Builder). The file looks like this (just with a lot more rows).


There are 1,000 orders.

Requirements

Here's what we need you to do:
  • Connect and load the csv file (help)
  • Split the 'Store-Bike' field into 'Store' and 'Bike' (help)
  • Clean up the 'Bike' field to leave just three values in the 'Bike' field (Mountain, Gravel, Road) (help)
  • Create two different cuts of the date field: 'quarter' and 'day of month' (help)
  • Remove the first 10 orders as they are test values (help)
  • Output the data as a csv (help)

Output

  • 8 Data Fields
    • Quarter
    • Day of Month
    • Store
    • Bike
    • Order ID
    • Customer Age
    • Bike Value
    • Existing Customer?
  • 990 Rows (991 including Column Headers)
Here's our full output for comparison. Remember we don't care about the order of the columns or rows as Tableau Desktop will import them in whatever order they come in!

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14 & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 

Bonus task

We are conscious this shouldn't be too much of a stretch for our Preppin' regulars so we've set you a Desktop task too. 

Our stakeholder wants to know the average monthly bike value sold by each day in the month. Yeah, they even want the running total to see where they should be in terms of sales by that point. The stakeholder knows each quarter is significantly different so the running totals should be separated by quarter. 

Build this view using the output if you fancy taking on the extra task. 


Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text