2020: Week 46

 Challenge by Tom Prowse.

At Prep Air, we have decided to do some research into the risks of running an airline. We want to complete some analysis on some historic aviation incident reports so we can try to identify potential areas where we can make our airline safer.

We have taken a selection of reports from the AeroInside website, who document various incident reports from around the world. Each report contains information about the incident, but is a free text field so doesn't really have a structure.

In this challenge, we want to parse out the key information from the string, and then see how many incidents occur that are related to our key categories. 

Inputs

Incident List

Category List

Requirements

  • Input the Data
  • Parse out the following information from the incident string: 
    • Aircraft - eg, American B738
    • Location - Amsterdam
    • Date - Apr 21st 2016
    • Incident Description - details about the incident
  • Convert date field from string to a date
  • Combine similar incident types. Eg, Attendant & Attendants or Pressure & Pressurize. (Tableau Prep's Grouping functionality will help here!)
  • Create a total for how many incidents happened within each category

Outputs


Incident List - 4 Fields, 100 Rows (101 including headers)


Category Total - 2 Fields, 10 Rows (11 including headers)


The full outputs can be downloaded here.

After you finish the challenge make sure to fill in the participation tracker, then share your solution on Twitter using #PreppinData and tagging @Datajedininja@JennyMartinDS14@JonathanAllenby & @TomProwse1

You can also post your solution on the Tableau Forum where we have a Preppin' Data community page. Post your solutions and ask questions if you need any help! 

Popular posts from this blog

2023: Week 1 The Data Source Bank

2023: Week 2 - International Bank Account Numbers

How to...Handle Free Text