2022: Week 26 - Making Spotify Data Spotless

Challenge By: Algirdas Grajauskas

It's always great when a Data Schooler in training comes up with an idea for a Preppin' Data Challenge! Al is part of DS31 and has been in training for 3 months now. For this challenge we're going to clean up and analyse his Spotify data. 

Input

Unclean Spotify data – from Spotify 

Requirements

  • Input the data
  • Create a new field which would break down milliseconds into seconds and minutes
    • e.g. 208,168 turned into minutes would be 3.47min
  • Extract the year from the timestamp field
  • Rank the artists by total minutes played overall
  • For each year, find the ranking of the artists by total minutes played
  • Reshape the data so we can compare how artist position changes year to year
  • Filter to the overall top 100 artists
  • Output the data

Output


  • 10 fields
    • Overall Rank
    • Artist Name
    • 2015
    • 2016
    • 2017
    • 2018
    • 2019
    • 2020
    • 2021
    • 2022
  • 100 rows (101 including headers)
You can download the full output here

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14 & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 

Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text