2021: Week 38 - Trilogy

 Challenge By: Jenny Martin

Recently, I've been playing with data about the best movie trilogies of all time, according to IMDb. So I thought I'd create a Preppin' Data challenge to allow you all to do the same!


Inputs

There are 2 inputs for this challenge:
  1. Top 30 Trilogies 

  2. Films 

Requirements

  • Input the data
  • Split out the Number in Series field into Film Order and Total Films in Series
  • Work out the average rating for each trilogy
  • Work out the highest ranking for each trilogy
  • Rank the trilogies based on the average rating and use the highest ranking metric to break ties (make sure you haven't rounded the numeric fields yet!)
    • We have noticed a slight error in the way that Tableau Prep is calculating this rank, so don't worry if your output is different to ours, we are investigating!
  • Remove the word trilogy from the Trilogy field
  • Bring the 2 datasets together by the ranking fields
  • Output the data

Output


  • 7 fields
    • Trilogy Ranking
    • Trilogy
    • Trilogy Average
    • Film Order
    • Title
    • Rating
    • Total Films in Series
  • 90 rows (91 including headers)
You can download the full output here

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14 & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 


Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text