Tidyverse Overview
Explore the tidyverse and its core principles to enhance data manipulation and visualization in R. Understand tidy data standards, how the tidyverse ensures consistency across packages, and how it reduces time spent on data cleaning so you can concentrate on analysis.
We'll cover the following...
Let’s go over what tidyverse is.
tidyverse overview
The tidyverse is a set of R packages designed to work together seamlessly so data manipulation and visualization are consistent, logical, and easily readable. The tidyverse isn’t the only way to do things in R. In fact, many R users don’t use the tidyverse and will stick with base-R, then layer other packages on top of that. In fact, we already have all the skills needed to do many analyses in R. However, in a data science context, there are some significant benefits to moving into the tidyverse.
One thing we quickly discover outside of the tidyverse is that a considerable portion of our code will be converting the output of one function to be useable as the input to another function. The tidyverse is all about ensuring consistency to address that need. The tidyverse lets us focus on analysis rather than data manipulation.
A good rule of thumb for data scientists is that about 80% of code is around cleaning our data, a.k.a. tidying it up. The tidyverse will help reduce that 80% so we can stay focused on analysis. The standardization we see in the tidyverse also shows up in how we call and use functions within packages in the tidyverse. ...