Summary and Quiz

Get a refresher on the Analytics section, and take a short quiz to test your knowledge.

We'll cover the following

In this lesson, we’ll summarize what we have learned about the analytics services provided by AWS. Also, you’ll test our knowledge through a quiz.

Summary

Here is a summary of key takeaways from the analytics section:

  • Amazon Athena: Amazon Athena is a serverless, interactive service that allows us to query data in Amazon S3 using standard SQL queries. Athena can work with various data formats stored in an S3 bucket, including CSV, JSON, ORC, Parquet, and more.

  • Amazon EMR: Amazon EMR is a fully managed service that helps us process and analyze large amounts of data. It simplifies running big data frameworks like Hadoop and Spark on AWS for data processing and analysis. Each EMR cluster has a primary, task, and core node.

  • Amazon Redshift: It is a fully managed data warehousing service. It supports a multi-query engine with a massively parallel architecture, which allows us to perform complex queries on large amounts of data in less time. Each Redshift cluster consists of a leader node responsible for managing the queries and results and multiple compute nodes to perform parallel queries on the data.

  • AWS Glue: AWS Glue is a serverless data integration service that facilitates the easy discovery, preparation, movement, and integration of data from multiple sources, aiding analytics, machine learning, and application development. It automates data discovery through a data crawler, organizes data on a data catalog, and creates ETL jobs to extract, transform, and load data.

Get hands-on with 1400+ tech skills courses.