Amazon Athena is an interactive query service that simplifies the process of querying data stored in different locations. The queries in Athena are based on metadata, which is data about the underlying data in a database. Athena supports federated queries, allowing it to work with distributed data sources. This is done by creating connectors in AWS Lambda.
In this Cloud Lab, you’ll provision an EC2 instance and an Aurora DB cluster with a MySQL-compatible database. You’ll also create a spill bucket in S3 and an IAM role. After that, you’ll deploy a connector using a Lambda application and then use your Aurora DB cluster as a data source in Athena. Finally, you’ll execute queries in a query editor provided by Athena to fetch data from your database.
After completing this Cloud Lab, you’ll be able to work with Amazon RDS and Athena to create database instances and execute federated queries using the query editor.
The following is the high-level architecture diagram of the infrastructure you’ll create in this Cloud Lab: