Understanding the Bias in Model Development
Learn about the various biases that can occur in the AI solution development process.
Bias in the data preparation phase
Bias may enter the system even before we have started the model-building steps.
During problem formation itself, we need to validate if AI is an ethical solution to this problem.
Data collection and preparation is the stage when a lot of biases creep into the solutions. We need to validate if the data is free of any selection bias and is representative of different segments; there should be no biases in data labels and features getting created.
Let’s explore a few of these using some practical examples:
-
Selection bias: This refers to a type of bias that occurs when the process of selecting data for analysis or study results in a nonrandom or unrepresentative sample. It arises when certain characteristics or factors influence the inclusion or exclusion of data points, leading to a skewed or biased representation of the population of interest. Imagine a model that is developed to predict the success of a new smartphone based on surveys conducted with customers who purchased that specific phone. However, the surveys only include feedback from customers who bought that particular phone and do not include any input from customers who chose to purchase a different brand or model. As a result, the model’s predictions may be biased toward positive outcomes because it lacks information about the experiences and preferences of customers who opted for a different phone.
-
Biased ...