2020: Week 26

This week's challenge is inspired by some real life consulting work, replacing a very manual task with an automated process! Unique IDs are a great way to label your data, but issues can arise when trying to match internal IDs to 3rd Party IDs, as there could have been changes made to the naming convention by any number of people.

So how do we go about making some educated guesses and getting together a list of IDs that we can take back to the 3rd Party to complain about?

Input

1 input this week, with 2 sheets:



Requirements

  • Input the data.
  • Find the IDs that match perfectly and label them as such.
  • For the remaining unmatched Internal and 3rd Party IDs, create all the possible matching IDs for each Scent.
  • For each 3rd Party ID, find the Internal ID with the lowest sales difference. 
  • For each Internal ID, find the 3rd Party ID with the lowest sales difference.
    • You should now have no duplicated IDs.
  • Classify these IDs as "Matched on Scent".
  • Join these IDs to the "Matched IDs".
  • Classify IDs from Internal and 3rd Party Data which have not been matched and join these to create the output.
  • Output the data. 

Output



  • 6 fields
    • Status
    • ID
    • 3rd Party ID
    • Scent
    • Sales
    • 3rd Party Sales
  • 49 rows (50 including headers)


The full output can be found here for comparison.

Make sure to fill in the participation tracker, share using #PreppinData on Twitter and post you solutions onto our Tableau Forums community page so that we can compare our workflows! 

Popular posts from this blog

2024: Week 1 - Prep Air's Flow Card

2023: Week 1 The Data Source Bank

How to...Handle Free Text