Comparison with Confidence Intervals
Learn about the comparison of hypothesis testing with confidence intervals.
We'll cover the following...
One of the great things about the infer
package is that we can jump seamlessly between conducting hypothesis tests and constructing confidence intervals with minimal changes! Recall the code from the previous section that creates the null distribution, which in turn is needed to compute the
null_distribution <- promotions %>%specify(formula = decision ~ gender, success = "promoted") %>% hypothesize(null = "independence") %>%generate(reps = 1000, type = "permute") %>%calculate(stat = "diff in props", order = c("male", "female"))
To create the corresponding bootstrap distribution needed to construct a 95% confidence interval hypothesize()
step because weβre no longer assuming a null hypothesis hypothesize()
line of code. Second, we switch the type
of resampling in the generate()
step to bootstrap instead of permute.
bootstrap_distribution <- promotions %>%specify(formula = decision ~ gender, success = "promoted") %>%# Change 1 - Remove hypothesize():# hypothesize(null = "independence") %>%# Change 2 - Switch type from "permute" to "bootstrap":generate(reps = 1000, type = "bootstrap") %>%calculate(stat = "diff in props", order = c("male", "female"))
Using this bootstrap_distribution
, letβs first compute the percentile-based confidence intervals:
percentile_ci <- bootstrap_distribution %>% get_confidence_interval(level = 0.95, type = "percentile")percentile_ci
Using our shorthand interpretation for 95% confidence intervals, weβre 95% confident that the true difference in population proportions