Business Machine Learning/

...

Morale Function and Model Error

Learn about the true (morale) function and the total (model) error.

We'll cover the following...

True function
Total error

Press + to interact

Python 3.8

# try this, the trend is the same as in the next plot! but with missing points, right?
# plt.plot(week_points, morale_points);
# At this stage, this code below should be self-explanatory! 
fig1 , (ax1, ax2) = plt.subplots(ncols=2,figsize=(16,6), sharey=True)
# Available data
ax1.plot(week_points, morale_points, lw=5.0, c='r', alpha=0.7, label='true function')
ax1.scatter(week_points, morale_points, s = 100)
# Interpolated data
ax2.plot(days, morale_true, lw=5.0, c='r', alpha=0.7, label='true function')
ax2.scatter(days, morale_true, s = 100)
# Setting title, labels ...... etc!
ax1.set_title('\nMorale over time (available data)\n')
ax2.set_title('\nMorale over time (interpolated data)\n')
ax1.set_xlabel('days\n')
ax2.set_xlabel('days\n')
ax1.set_ylabel('morale\n')
ax1.legend(loc=2) # 'upper left'
ax2.legend(loc=2)#'upper left'
plt.tight_layout();

Our true function for morale can have the following interpretations.

With no measurement error:
- All students may have the same morale at every time point (day), and the function represents no measurement error in the morale at the given time or day for any student. This is a situation where our measurement tool or survey was perfect, and we measured the same morale for every student at each time point.
- What if there were some unavoidable issues in the instrument that randomly added some noise in each observation?
- What if some external parameter (weather) affected the measurement tool someday and added an error (unavoidable and irreducible)?
With no individual variance:
- We can interpret that our true function is the baseline morale for each time point, and all students vary around this function to some degree (±). A student’s morale at any given time point is baseline ± deviation. However, there is no individual variance in a student (for a particular student, the morale line is just an offset ± from the baseline). This might mean the variance is biased to the baseline.
- Just a heads-up, while generating the data for an individual student, we’ll add some random noise in the true function to create some individual variance.
Average or mean across infinite students:
- Our measurements of morale vary at each time point for an individual student. Still, if we had an infinite number of students and averaged all their morale measurements across all time points, we would have the true function of morale. We might need to factor in high variance or not being able to quantify the relationship.

In the situations above, we are trying to interpret morale as a function of time with no error. However, each situation is a different source of error.

Irreducible error: Occurs from an imperfect ability to measure morale because of some unavoidable reasons.
Bias error: Occurs from an imperfect relationship between time and morale.
Variance error: Occurs from an insufficient amount of good data that can correctly quantify the relationship(s).

These sources of errors combine, resulting in the final error in our trained model.

Note: We always have errors in our models. However, it depends on how much and what proportion of each type. We can play with bias and variance to find the sweet spot. However, we can’t do anything about the irreducible error.

Total error

There are three sources of error in a model:

Course Introduction

Linear Regression

Regularization

Bias-Variance Trade-off

Categorical Features

Logistic Regression

Logistic Regression: Titanic Data

Sentiment Analysis Using Multinomial Logistic Regression

Multiclass Classification and Handling Imbalanced Classes

Project: Predicting Chronic Kidney Disease

K-Nearest Neighbors

Implementation of K-Nearest Neighbors

Logistic Regression vs. KNN

Decision Tree Learning

Implement the Decision Tree Classifier from Scratch

Bootstrapping and Confidence Interval

Support Vector Machine

Practice and Comparisons

What's Next?

Appendix

Morale Function and Model Error

True function

Total error