Datasets
These are the datasets you will use during the course, including music, travel, and medical datasets.
We'll cover the following
Core dataset (Music)
For the purpose of the course, the code examples will use a dummy dataset of music artists, including information such as:
- Artist Name
- Country
- Number of plays (how many times the artist has been listened to in a specific period)
- Genre (the type of music the artist plays).
Shown below is a subset of the music dataset you’ll be using in the exercises.
Artist | Country | Plays | Genre |
---|---|---|---|
The Beatles | UK | 150 | rock |
Pink Floyd | UK | 10000 | rock |
Metallica | US | 500 | metal |
Cairokee | Egypt | 200 | rock |
ACDC | US | 250 | rock |
The Doors | US | 1000 | rock |
Poets of The Fall | Finland | 250 | rock |
In some lesson categories, there will be additional columns/datasets to allow for more advanced challenges. These include date information and continents of countries, among other columns.
Other datasets
In the Introduction lessons, there will be references to two other datasets: Travel and Medical. There are no specific columns or attributes associated with each; think of them simply as tools to introduce you to the concepts and possible interview questions.
Travel
This dataset includes information about travelers, their nationalities, and their choice of destination.
Medical
This involves patients and their symptoms, as well as their demographic information.