Data-Centric Statistical Inference Using R and Tidyverse/

...

Two Numerical Explanatory Variables

Learn about two numerical explanatory variables in multiple regression.

We'll cover the following...

Exploratory data analysis

Let’s now consider multiple regression models where, instead of one numerical and one categorical explanatory variable, we have two numerical explanatory variables. The dataset we’ll use is from the textbook, An Introduction to Statistical Learning with Applications in R (James et al., 2017). Its accompanying ISLR R package contains the datasets to which the authors apply various machine-learning methods.

One frequently used dataset in this course is the Credit dataset, where the outcome variable of interest is the credit card debt of 400 individuals. Other variables like income, credit limit, credit rating, and age are included as well. Note that the Credit data isn’t based on real individuals’ financial information, but rather is a simulated dataset used for educational purposes.

In this lesson, we’ll fit a regression model where we have:

A numerical outcome variable y, the cardholder’s credit card debt
Two explanatory variables:
- One numerical explanatory variable x₁, which is the cardholder’s credit limit
- Another numerical explanatory variable x₂, which is the cardholder’s income (in thousands of dollars) ...

Getting Started with Data in R

Data Visualization

Data Wrangling

Data Importing and “Tidy” Data

Basic Regression

Multiple Regression

Statistical Inference with the infer Package

Bootstrapping and Confidence Intervals

Hypothesis Testing

Inference for Regression

Price Prediction With Regression Analysis in R

Tell a Story with Data

Appendix

Uber Data Analysis Using the R Language

Two Numerical Explanatory Variables

Exploratory data analysis