Deal with Mislabeled and Imbalanced Machine Learning Datasets/

...

Overview of AI, ML, DL, and Supervised/Unsupervised Learning

Learn about the different concepts in artificial intelligence, including machine, deep, supervised, and unsupervised learning.

We'll cover the following...

Artificial intelligence
Machine learning
Machine learning process
- Types of machine learning
- Machine learning applications
Deep learning
Supervised learning
How supervised learning works
- Regression
- Classification
Unsupervised learning
Summary
References

Artificial intelligence

Artificial intelligence (AI) is a field of science and engineering that aims to develop intelligent machines that think and act like humans. These machines can mimic human behavior and perform tasks through learning and problem-solving.

The most popular definitions of AI are as follows:

“A computer system able to perform tasks that normally require human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages [1].”
“The capability of machines to imitate intelligent human behavior [2].”

Press + to interact

Deep learning

Deep learning (DL) is a subset of ML that employs algorithms inspired by the human brain’s structure and function. In particular, DL is based on neural networks, whose architecture is also inspired by the human brain. The word deep implies that several layers are used in the neural networks. DL uses massive structured and unstructured data to train models to detect or predict the desired results. One key advantage of DL is its ability to extract features from data automatically.

DL models have numerous applications, such as image classification, text classification, recommendation systems, self-driving cars, voice recognition, and more.

The basic structure of deep learning

The basic architecture of a DL model is as follows:

Input layer: This layer accepts input data and passes it to the next layer for processing.
Hidden layer: Lying between the input and output layer, each of these layers is made up of many nodes, which are also known as neurons. These nodes receive input from the previous layer and then process it in order to generate output to deliver to the next layer.
Output layer: The output layer shows the predicted output. The number of nodes in the output layer depends on the number of classes in the data.

Each layer in a DL model is typically made up of a large number of neurons, which are interconnected and use mathematical operations to process data. The neurons in one layer are connected to the neurons in the next layer, and the weights of these connections are learned during training using optimization techniques.

Types of deep learning

The figure below represents the most commonly used types of DL:

Press + to interact

The machine will learn features with the corresponding labels, as described below:

The machine will give an object the label “banana” if the shape of the object is long with a curved cylinder and a green-yellow color.
The machine will give an object the label "apple" if the shape of the object is round with depression at the top and has a red color.

After training, the model is validated using test data to check its performance.

Supervised learning is used to solve two problems: regression and classification.

Regression

Regression predicts continuous output in a value, such as in “house price” or “temperature,” for example. In particular, it identifies the relationship between a dependent variable (the target variable) and one or more independent variables (the features). Regression is achieved using the techniques given in the figure below:

Press + to interact

For example, if the provided dataset contains different fruits that are never seen by the machine, then the machines cannot classify different types of fruits directly because they do not know their labels. However, the fruits can still be classified into several groups (clusters) based on their features.

Summary

It should be noted that the terms artificial intelligence (AI), machine learning (ML), and deep learning (DL) are frequently used interchangeably and can be confusing. In short, AI is the general concept responsible for creating human intelligence in machines. Meanwhile, ML enables machines to learn automatically from data-identifying patterns without being explicitly programmed, and DL depends on massive data and several layers in the neural network.

Press + to interact

Supervised and unsupervised learning are two different approaches to machine learning, each with their own strengths and weaknesses. In supervised learning, the model is trained using labeled data in which the desired output is known. This approach is useful in classification and regression problems, where the goal is to predict output based on input features. In contrast, unsupervised learning is used when the data is not labeled, in which case the model finds patterns from the data and divides it into groups based on features.

While supervised and unsupervised learning have unique advantages and are suitable for different tasks, we will solely focus on supervised learning in this course. The choice between supervised and unsupervised learning depends on the task and type of data available. Depending on your objectives, select the appropriate approach by properly understanding the difference between these fields, which will help you achieve the desired outcomes for your projects.

References

[1] Russell, Stuart and Norvig Peter, Artificial Intelligence: A Modern Approach (Prentice Hall, 2009).

[2] Poole, D., and Mackworth, A., Artificial Intelligence: Foundations of Computational AgentsCambridge University Press, 2017).

Shape	Size	Color	Label/Target
Round in shape with a depression at the top	Big	Green-red	Apple
Round shape	Big	Orange	Orange
Long curved cylinder	Big	Yellow	Banana

Introduction to the Course

Getting Started

Understanding Noisy Data, Label Noise, and Its Types

Introduction to Convolutional Neural Network (CNN)

Cats vs Dogs Classification with Convolutional Neural Networks

Performance Comparison of Mislabeled and Clean Dataset

Dealing with Imbalance Dataset

Gauge the Impact of Imbalanced and Mislabeled Datasets

Comprehensive Quiz

Wrap Up

Appendix

Dealing With Small Datasets In ML

Overview of AI, ML, DL, and Supervised/Unsupervised Learning

Artificial intelligence

Machine learning

Machine learning process

Types of machine learning

Machine learning applications

Deep learning

The basic structure of deep learning

Types of deep learning

Applications of deep learning

Supervised learning

How supervised learning works

Fruit Dataset

Regression

Classification

Unsupervised learning

Summary

References