Data Classification

Learn how to split the provided samples into training and testing samples and ponder about data classification.

Splitting the data

In effect, splitting the data into two subsets can be defined around some filter functions. We’ll avoid Python for a moment and focus on the conceptual math to make sure we have the logic completely correct before diving into code. Conceptually, we have a pair of functions, e(sis_i) and r(sis_i), that decide if a sample, (sis_i), is for testing, ee, or training, rr ...