Azure Data Factory Bootcamp: From Beginner to Expert/

...

Monitoring Pipeline Performance and Quality

Learn how to monitor pipeline performance and quality in ADF using Azure Monitor.

We'll cover the following...

Azure Monitor service
Resource alert and health monitoring
- Monitoring Azure Data Factory performance and health
  - Monitoring using Azure portal UI
  - Monitoring using Azure CLI
Create monitors and alerts for ADF

Monitoring the performance of pipelines is essential to ensure that they are running smoothly and delivering desired results. In this lesson, we’ll discuss how pipeline performance can be monitored using Azure’s Monitor service.

Azure Monitor service

Azure Monitor is a comprehensive monitoring solution provided by Azure, designed to collect and analyze data from various Azure services in real-time. It acts as a centralized monitoring platform and connects to multiple Azure services to gather logs and metrics, enabling users to gain insights into the health and performance of their cloud resources.

At its core, Azure Monitor relies on a distributed architecture that employs a combination of data collectors, log analytics, and alerting mechanisms.

Data collectors, like Azure Diagnostic Logs, collect data from Azure services and transmit it to the Azure Monitor service. This data is then stored in a centralized repository for further analysis and visualization.
The log analytics feature enables users to query and visualize this data using Kusto Query Language (KQL), empowering them to uncover patterns, detect anomalies, and troubleshoot issues efficiently.
Azure Monitor’s alerting capabilities enable users to set up custom alerts based on defined conditions, allowing for proactive monitoring and quick response to critical events.

Resource alert and health monitoring

As seen earlier, Azure monitor can monitor the health of resources across an Azure subscription. Along with pre-built monitoring metrics, it also allows for the definition of custom monitoring metrics. Below are some commonly used Azure monitor metrics:

CPU percentage: Monitors the CPU utilization of virtual machines, providing insights into resource consumption and performance bottlenecks.
Memory usage: Tracks the memory consumption of virtual machines, ensuring optimal memory allocation and preventing potential memory-related issues.
Network in/out: Measures the incoming and outgoing network traffic for virtual machines, helping to identify network performance issues and bandwidth requirements.

Monitoring and alerting generally go hand in hand. If a monitoring service is used regularly, it can be designed to create custom alerts that will help keep the health of the resources in check. Here are some commonly used alerts that can be set up using Azure monitor.

High CPU usage: Sets up an alert to trigger when CPU usage exceeds a defined threshold, enabling proactive investigation of resource-intensive processes or applications.
Low memory availability: Creates an alert when memory usage drops below a specified level, allowing ...

Getting Started

Introduction to Azure Data Factory

Setting Up an Azure Data Factory Environment

Data Connectivity and Management

Azure Data Factory: Introduction and Connectivity Exam

Creating Data Pipelines in Azure Data Factory

Managing and Monitoring Azure Data Factory Pipelines

Azure Data Factory: Designing and Maintaining Data Pipelines Exam

Big Data Integration and Processing

Machine Learning and Advanced Analytics

Azure Data Factory: Big Data Processing and Machine Learning Exam

Data Governance and Security

Azure Data Factory: Best Practices

Conclusion

Appendix

Monitoring Pipeline Performance and Quality

Azure Monitor service

Resource alert and health monitoring