AWS EMR (Elastic MapReduce) is a platform that enables users to create clusters. It can be used to execute big data frameworks like Spark and Hadoop. Moreover, it also provides a serverless option in which a user can execute any task without configuring a cluster.
In this Cloud Lab, you’ll learn to configure the size and type of systems, set up a framework for performing tasks, and create an EMR cluster. Next, you’ll learn how to write and read data in the S3 bucket with the EMR cluster and run an application on it. After this, you’ll learn how to execute the same task on EMR Serverless, which automatically configures and manages a cluster according to the processing needs of the tasks performed on it.
The following is the high-level architecture diagram of the infrastructure you’ll create in this Cloud Lab: