Joint Probability

In this lesson, you will learn about joint probability with the help of some examples.

Random variable

A random variable is a function that assigns a real number to each value in the sample space. This is produced as a result of a random experiment like tossing a coin or throwing a dice.

Example 1

When we flip a coin twice, there are four possible outcomes: both heads, heads and tails, tails and heads, or both tails. The value of x below represents the number of heads from each outcome of these four outcomes.:

Sample Space x
HH 2
HT 1
TH 1
TT 0

Example 2

Random variable Y represents the number of times we roll “5” when rolling a six-sided dice six times. Therefore, the possible values of this random variable Y, can be anything from 0 (we never roll “5”) to6 (we roll “5” every time).

Joint probability

This refers to the Probability of two or more simultaneous events and is used at many places in the Data Science field, especially when dealing with textual data where we have to find the probability of words which can occur together. The joint probability of two events is denoted as below.


P(A and B) = P (A \cap B)


Where P(A) and P(B) are probabilities of events A and B respectively.

Probability for independent events

If one event is not dependent on the other, the events are said to be independent and their joint probability is calculated as below.


P(A and B) = P(A, B) = P (A \cap B) = P(A) * P(B)


Where P(A) and P(B) are probabilities of events A and B respectively.

Example

A jar contains three red, five green, two blue, and six yellow marbles. A marble is chosen from the jar at random. After replacing it, a second marble is chosen. What is the probability of choosing a green and then a yellow marble?

Solution

From the above question, we can deduce the below things from the above question.

  • Number of Red Marbles = 3
  • Number of Green Marbles = 5
  • Number of Blue Marbles = 2
  • Number of Yellow Marbles = 6
  • Total Marbles = 16
  • Let A be the event that the Green marble is drawn
  • Let B be the event that the Yellow marble is drawn
  • Let C be the event that the Blue marble is drawn
  • Let D be the event that the Red marble is drawn
  • As the event of drawing marble is independent from each other we can proceed as follows.

P(A) = 516\frac{5}{16}

P(B) = 616\frac{6}{16} ...