Link Azure Storage to Azure Data Factory
Learn how to connect storage accounts to ADF, which is crucial for storing raw source and sink files processed by data factory operations.
Microsoft Azure offers Azure Storage, a highly scalable and secure cloud storage service. It provides a wide variety of storage services, including
For creating and linking storage instances to the data factory, ensure that the following prerequisites are completed in the Azure environment:
An active Azure subscription
An active resource group
An Azure Data Factory (ADF) instance
We have detailed the instructions for creating an Azure account and a data factory instance in previous lessons; be sure to complete them!
Creating and linking storage instances in Azure
Creating and linking storage instances in Azure is a fundamental step in setting up a data storage solution. The various storage options within Azure Storage are available across different regions and are designed to meet different performance, scalability, and cost requirements. To create a storage instance in Azure, specify the storage type, storage account name, replication type, and storage region.
Azure Data Factory’s linked services create links between the platform and outside data sources or destinations, such as cloud services, file systems, and databases. They include the authentication information, connection string, and pertinent characteristics required to access the data. Linked services function as a bridge, enabling easy data transfer between Azure Data Factory and the linked data sources or destinations. They are essential for creating and maintaining the links necessary for effective data integration procedures.
Linked services offer the advantage of connection reuse across pipelines, saving time and effort. In larger organizations, using a single Linked service avoids creating duplicate connections for the same data source or destination, enhancing efficiency. They provide a straightforward and efficient way to connect to external data sources or destinations in Azure Data Factory. By defining necessary properties and authentication details, Linked services enable seamless integration and simplify connection management. This promotes connection reuse across pipelines, optimizing efficiency and minimizing duplication.
Step 1: Create an Azure Storage instance
In the earlier lessons, we created an Azure Data Factory instance and an Azure Storage instance. In this lesson, we will link the two together, so we have a storage layer for the data processing. Azure Storage will ensure the is a location where all raw, processed, and output files can stay. Let’s start by recapping the creation of an Azure Data Factory instance:
Note: The names of the Azure Data Factory (ADF) and Azure Storage instances will have to be a globally unique namestherefore the user will be unable to use the name used below.
Get hands-on with 1300+ tech skills courses.