2022: Week 27 - Solution


Solution by Tom Prowse and you can download the workflow here.


This week was the first of our basic & intermediate challenges throughout July & August. The challenge is aimed at anyone just starting out and covers some of the fundamentals that you'll need to get started. Let's see how we can solve the challenge.

Step 1 -  Separate Product Name

First we need to connect to our data source and bring the single table into our workflow. Once we have the data, we can go about splitting the Product Name field into two separate fields (Product Type & Quantity).

The Product Name contains a common structure of Product Type - Quantity, therefore we can use a Custom Split on the '-' to separate either side into different fields: 


As a result of the split we now have 2 additional fields that we can rename to be Product Type and Quantity. The table should look like this: 


Step 2 - Liquid & Bar Path

We can now split our workflow into two separate path/branches. To do this we want to create two new clean steps (one for Liquid, one for Bars) so now we should have two separate branches that we can rename to be Bar and Liquid: 


Step 3 - Bar Path

As there are slightly different steps required for the different paths, lets focus on just the Bars first.

In the Bars clean step, we can use the filter to 'Keep Only' Bars from the Product Type field. Then once we have filtered for Bars, we can then remove any Letters from the Quantity field so we are left with just the numeric values.



Now we've filtered and cleaned the table, we need to calculate the total sales and number of orders for each Store, Region, and Quantity combination. To do this we can use an aggregate tool with the following step up:


Group by: Quantity, Store Name, and Region
Aggregate: Sum Sale Value, and CountD OrderID

After the aggregation we can rename the Order ID field to Present in N Orders and we are ready to output our first table: 


Step 4- Liquid Path

Now we can go back to our Liquid branch and make some changes to clean this table. 

First we need to filter so that we only have the Liquid values in Product Type and again we need to remove the letters from the quantity field.

In addition, we also need to convert the quantity into millilitres using the following calculation:

Quantity
IF INT([Quantity]) = 1 THEN INT([Quantity])*1000
ELSE INT([Quantity])
END

Make sure you are using the INT function to convert the Quantity field into a number.

Once we have made these changes to again need to aggregate to calculate the total sales and orders. The aggregation should look like this:


Group by: Quantity, Store Name, and Region
Aggregate: Sum Sale Value, and CountD Order ID

After renaming the Order ID field to Present in N Orders we are now ready to output our second table: 


You can download the full output here

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14 & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 

Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text