2021: Week 6 - Comparing Prize Money for Professional Golfers

Challenge By: Jenny Martin

What's one of the benefits of preparing your own data?
Being able to start your analysis sooner!

Sometimes I can find opening Tableau Desktop to explore my data gets a little distracting by trying to visualise it before I've decided on the story. Starting my analysis of the dataset in Tableau Prep helps me, personally, to stay more focused! It's clear where the outliers are, what the distribution of the dataset is and therefore what the story should be.

For this week's challenge we're looking at a dataset that was used in December 2020 for Sports Viz Sunday (thanks to Kate Brown for sharing!) This dataset comes from the PGA and LPGA 2019 Golf tours and lists the total prize money for the top 100 players. For those of us who aren't too familiar with golf, the PGA is the men's tour, whilst the LPGA is the women's tour.

Input

We have one input this week:

Requirements

  • Input the data
  • Answer these questions:
    • What's the Total Prize Money earned by players for each tour? (help)
    • How many players are in this dataset for each tour?
    • How many events in total did players participate in for each tour?
    • How much do players win per event? What's the average of this for each tour? (help)
    • How do players rank by prize money for each tour? What about overall? What is the average difference between where they are ranked within their tour compared to the overall rankings where both tours are combined? (help)
      • Here we would like the difference to be positive as you would presume combining the tours would cause a player's ranking to increase
  • Combine the answers to these questions into one dataset (help)
  • Pivot the data so that we have a column for each tour, with each row representing an answer to the above questions (help)
  • Clean up the Measure field and create a new column showing the difference between the tours for each measure
    • We're looking at the difference between the LPGA from the PGA, so in most instances this number will be negative
  • Output the data

Output


  • 4 fields
    • Measure
    • PGA
    • LPGA
    • Difference between tours
  • 5 rows (6 including headers)

The full output can be downloaded here.

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14 & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 





Popular posts from this blog

2024: Week 1 - Prep Air's Flow Card

2023: Week 1 The Data Source Bank

How to...Handle Free Text