Monitor Azure Data Factory - Azure Data Factory (2024)

  • Article

This article describes:

  • The types of monitoring data you can collect for this service.
  • Ways to analyze that data.

Note

If you're already familiar with this service and/or Azure Monitor and just want to know how to analyze monitoring data, see the Analyze section near the end of this article.

When you have critical applications and business processes that rely on Azure resources, you need to monitor and get alerts for your system. The Azure Monitor service collects and aggregates metrics and logs from every component of your system. Azure Monitor provides you with a view of availability, performance, and resilience, and notifies you of issues. You can use the Azure portal, PowerShell, Azure CLI, REST API, or client libraries to set up and view monitoring data.

  • For more information on Azure Monitor, see the Azure Monitor overview.
  • For more information on how to monitor Azure resources in general, see Monitor Azure resources with Azure Monitor.

Monitoring methods

There are several ways to monitor Azure Data Factory.

Azure Data Factory Studio

You can monitor all of your Data Factory pipeline runs natively in Azure Data Factory Studio. To open the monitoring experience, select Launch Studio from your Data Factory page in the Azure portal, and in Azure Data Factory Studio, select Monitor from the left menu.

For more information about monitoring in Azure Data Factory Studio, see the following articles:

  • Visually monitor Azure Data Factory
  • Data flow monitoring
  • Monitor copy activity
  • Session log in a Copy activity

Azure portal

You can also monitor Azure Data Factory directly from the Azure portal. Several metrics graphs appear on the Azure portal Overview page for your Data Factory. On the left sidebar menu, you can access the Azure Activity log, or select Alerts, Metrics, Diagnostic settings, or Logs from the Monitoring section.

Monitor programmatically

You can monitor Data Factory pipelines programmatically by using .NET, PowerShell, Python, or the REST API. For more information, see the following articles:

  • Programmatically monitor Azure Data Factory
  • Set up diagnostics logs via the Azure Monitor REST API

Resource types

Azure uses the concept of resource types and IDs to identify everything in a subscription. Azure Monitor similarly organizes core monitoring data into metrics and logs based on resource types, also called namespaces. Different metrics and logs are available for different resource types. Your service might be associated with more than one resource type.

Resource types are also part of the resource IDs for every resource running in Azure. For example, one resource type for a virtual machine is Microsoft.Compute/virtualMachines. For a list of services and their associated resource types, see Resource providers.

For more information about the resource types for Azure Data Factory, see Data Factory monitoring data reference.

Data storage

For Azure Monitor:

  • Metrics data is stored in the Azure Monitor metrics database.
  • Log data is stored in the Azure Monitor logs store. Log Analytics is a tool in the Azure portal that can query this store.
  • The Azure activity log is a separate store with its own interface in the Azure portal.

You can optionally route metric and activity log data to the Azure Monitor logs store. You can then use Log Analytics to query the data and correlate it with other log data.

Many services can use diagnostic settings to send metric and log data to other storage locations outside Azure Monitor. Examples include Azure Storage, hosted partner systems, and non-Azure partner systems, by using Event Hubs.

For detailed information on how Azure Monitor stores data, see Azure Monitor data platform.

Store Data Factory pipeline run data

Data Factory stores pipeline run data for only 45 days. Use Azure Monitor to route diagnostic logs if you want to keep the data longer.

Route data to Log Analytics if you want to analyze it with complex queries, create custom alerts, or monitor across data factories. You can route data from multiple data factories to a single Log Analytics workspace.

You can use a storage account or event hub namespace that isn't in the subscription of the resource that emits logs. The user who configures the setting must have appropriate Azure role-based access control (Azure RBAC) access to both subscriptions.

Azure Monitor platform metrics

Azure Monitor provides platform metrics for most services. These metrics are:

  • Individually defined for each namespace.
  • Stored in the Azure Monitor time-series metrics database.
  • Lightweight and capable of supporting near real-time alerting.
  • Used to track the performance of a resource over time.

Collection: Azure Monitor collects platform metrics automatically. No configuration is required.

Routing: You can also usually route platform metrics to Azure Monitor Logs / Log Analytics so you can query them with other log data. For more information, see the Metrics diagnostic setting. For how to configure diagnostic settings for a service, see Create diagnostic settings in Azure Monitor.

For a list of all metrics it's possible to gather for all resources in Azure Monitor, see Supported metrics in Azure Monitor.

For a list of available metrics for Data Factory, see Data Factory monitoring data reference.

Azure Monitor resource logs

Resource logs provide insight into operations that were done by an Azure resource. Logs are generated automatically, but you must route them to Azure Monitor logs to save or query them. Logs are organized in categories. A given namespace might have multiple resource log categories.

Collection: Resource logs aren't collected and stored until you create a diagnostic setting and route the logs to one or more locations. When you create a diagnostic setting, you specify which categories of logs to collect. There are multiple ways to create and maintain diagnostic settings, including the Azure portal, programmatically, and though Azure Policy.

Routing: The suggested default is to route resource logs to Azure Monitor Logs so you can query them with other log data. Other locations such as Azure Storage, Azure Event Hubs, and certain Microsoft monitoring partners are also available. For more information, see Azure resource logs and Resource log destinations.

For detailed information about collecting, storing, and routing resource logs, see Diagnostic settings in Azure Monitor.

For a list of all available resource log categories in Azure Monitor, see Supported resource logs in Azure Monitor.

All resource logs in Azure Monitor have the same header fields, followed by service-specific fields. The common schema is outlined in Azure Monitor resource log schema.

  • For the available Data Factory resource log categories, their associated Log Analytics tables, and the logs schemas, see Data Factory monitoring data reference.

  • To configure diagnostic settings and a Log Analytics workspace to monitor Data Factory, see Configure diagnostic settings and a workspace.

Azure activity log

The activity log contains subscription-level events that track operations for each Azure resource as seen from outside that resource; for example, creating a new resource or starting a virtual machine.

Collection: Activity log events are automatically generated and collected in a separate store for viewing in the Azure portal.

Routing: You can send activity log data to Azure Monitor Logs so you can analyze it alongside other log data. Other locations such as Azure Storage, Azure Event Hubs, and certain Microsoft monitoring partners are also available. For more information on how to route the activity log, see Overview of the Azure activity log.

Monitor integration runtimes

Integration runtime is the compute infrastructure Data Factory uses to provide data integration capabilities across different network environments. Data Factory offers several types of integration runtimes:

  • Azure integration runtime
  • Self-hosted integration runtime
  • Azure-SQL Server Integration Services (SSIS) integration runtime
  • Apache Airflow integration runtime

Azure Monitor collects metrics and diagnostics logs for all types of integration runtimes. For detailed instructions on monitoring integration runtimes, see the following articles:

  • Monitor an integration runtime in Azure Data Factory
  • Monitor an integration runtime within a managed virtual network
  • Monitor self-hosted integration runtime in Azure
  • Configure self-hosted integration runtime for log analytics collection
  • Monitor SSIS operations with Azure Monitor
  • Diagnostics logs and metrics for Apache Airflow

Analyze monitoring data

There are many tools for analyzing monitoring data.

Azure Monitor tools

Azure Monitor supports the following basic tools:

  • Metrics explorer, a tool in the Azure portal that allows you to view and analyze metrics for Azure resources. For more information, see Analyze metrics with Azure Monitor metrics explorer.

  • Log Analytics, a tool in the Azure portal that allows you to query and analyze log data by using the Kusto query language (KQL). For more information, see Get started with log queries in Azure Monitor.

  • The activity log, which has a user interface in the Azure portal for viewing and basic searches. To do more in-depth analysis, you have to route the data to Azure Monitor logs and run more complex queries in Log Analytics.

Tools that allow more complex visualization include:

  • Dashboards that let you combine different kinds of data into a single pane in the Azure portal.
  • Workbooks, customizable reports that you can create in the Azure portal. Workbooks can include text, metrics, and log queries.
  • Grafana, an open platform tool that excels in operational dashboards. You can use Grafana to create dashboards that include data from multiple sources other than Azure Monitor.
  • Power BI, a business analytics service that provides interactive visualizations across various data sources. You can configure Power BI to automatically import log data from Azure Monitor to take advantage of these visualizations.

Azure Monitor export tools

You can get data out of Azure Monitor into other tools by using the following methods:

  • Metrics: Use the REST API for metrics to extract metric data from the Azure Monitor metrics database. The API supports filter expressions to refine the data retrieved. For more information, see Azure Monitor REST API reference.

  • Logs: Use the REST API or the associated client libraries.

  • Another option is the workspace data export.

To get started with the REST API for Azure Monitor, see Azure monitoring REST API walkthrough.

For detailed instructions on configuring diagnostic logs by using the REST API, see Set up diagnostic logs via the Azure Monitor REST API.

Kusto queries

You can analyze monitoring data in the Azure Monitor Logs / Log Analytics store by using the Kusto query language (KQL).

Important

When you select Logs from the service's menu in the portal, Log Analytics opens with the query scope set to the current service. This scope means that log queries will only include data from that type of resource. If you want to run a query that includes data from other Azure services, select Logs from the Azure Monitor menu. See Log query scope and time range in Azure Monitor Log Analytics for details.

For a list of common queries for any service, see the Log Analytics queries interface.

For example queries, select Logs under Monitoring in the left navigation of your Data Factory page in the Azure portal, and then select the Queries tab. Here are some example queries:

PipelineRuns availability: Gives the availability of the pipeline runs.

ADFPipelineRun| where Status != 'InProgress' and Status != 'Queued'| where FailureType != 'UserError'| summarize availability = 100.00 - (100.00*countif(Status != 'Succeeded') / count()) by bin(TimeGenerated, 1h)), _ResourceId| order by TimeGenerated asc| render timechart

Activity runs Top 5 failures: Returns top five activities failing with system errors.

ADFActivityRun | where TimeGenerated >= ago(24h)| where Status != 'InProgress' and Status != 'Queued'| where FailureType != 'UserError'| where ActivityName in (name)| summarize failureCount = countif(Status != 'Succeeded') by bin(TimeGenerated, 1h), ActivityName| top 5 by failureCount desc nulls last| order by TimeGenerated asc| render timechart

Pipeline runs latest status: Returns latest status of pipeline runs.

ADFPipelineRun| summarize argmax(TimeGenerated, * ) by RunId, Status, _ResourceId

Alerts

Azure Monitor alerts proactively notify you when specific conditions are found in your monitoring data. Alerts allow you to identify and address issues in your system before your customers notice them. For more information, see Azure Monitor alerts.

There are many sources of common alerts for Azure resources. For examples of common alerts for Azure resources, see Sample log alert queries. The Azure Monitor Baseline Alerts (AMBA) site provides a semi-automated method of implementing important platform metric alerts, dashboards, and guidelines. The site applies to a continually expanding subset of Azure services, including all services that are part of the Azure Landing Zone (ALZ).

The common alert schema standardizes the consumption of Azure Monitor alert notifications. For more information, see Common alert schema.

Types of alerts

You can alert on any metric or log data source in the Azure Monitor data platform. There are many different types of alerts depending on the services you're monitoring and the monitoring data you're collecting. Different types of alerts have various benefits and drawbacks. For more information, see Choose the right monitoring alert type.

The following list describes the types of Azure Monitor alerts you can create:

  • Metric alerts evaluate resource metrics at regular intervals. Metrics can be platform metrics, custom metrics, logs from Azure Monitor converted to metrics, or Application Insights metrics. Metric alerts can also apply multiple conditions and dynamic thresholds.
  • Log alerts allow users to use a Log Analytics query to evaluate resource logs at a predefined frequency.
  • Activity log alerts trigger when a new activity log event occurs that matches defined conditions. Resource Health alerts and Service Health alerts are activity log alerts that report on your service and resource health.

Some Azure services also support smart detection alerts, Prometheus alerts, or recommended alert rules.

For some services, you can monitor at scale by applying the same metric alert rule to multiple resources of the same type that exist in the same Azure region. Individual notifications are sent for each monitored resource. For supported Azure services and clouds, see Monitor multiple resources with one alert rule.

Data Factory alert rules

To create and manage alerts, select Alerts under Monitoring in the left navigation of your Data Factory page in the Azure portal.

The following table lists popular alert rules for Data Factory. This is just a recommended list. You can set alerts for any metric, log entry, or activity log entry that's listed in the Data Factory monitoring data reference.

Alert typeConditionDescription
MetricFailed pipeline runs metricsWhenever the total Failed pipeline runs metrics is greater than 0
MetricTotal entities countWhenever the maximum Total entities count is greater than 1700000
MetricMaximum allowed entities countWhenever the maximum Total factory size (GB unit) is greater than 6

Notifications provide proactive alerting during or after execution of a pipeline.

  • Send an email with an Azure Data Factory pipeline shows how to configure email notifications from pipeline alerts.
  • Send notifications to a Microsoft Teams channel from an Azure Data Factory pipeline shows how to configure notifications from pipeline alerts into Microsoft Teams.

Advisor recommendations

For some services, if critical conditions or imminent changes occur during resource operations, an alert displays on the service Overview page in the portal. You can find more information and recommended fixes for the alert in Advisor recommendations under Monitoring in the left menu. During normal operations, no advisor recommendations display.

For more information on Azure Advisor, see Azure Advisor overview.

Related content

  • See Data Factory monitoring data reference for a reference of the metrics, logs, and other important values created for Data Factory.
  • See Monitoring Azure resources with Azure Monitor for general details on monitoring Azure resources.
Monitor Azure Data Factory - Azure Data Factory (2024)

FAQs

How to monitor Azure Data Factory jobs? ›

Monitoring Job Details
  1. Go to the list of tasks in the Process Monitoring perspective.
  2. Find the Azure Data Factory task.
  3. Select the task and select the Details button. The Details pane opens on the right hand side and shows a summary of the execution of the selected task.

What is monitor in Azure Data Factory? ›

You can monitor all of your pipeline runs natively in the Azure Data Factory user experience. To open the monitoring experience, select the Monitor & Manage tile in the data factory blade of the Azure portal. If you're already in the ADF UX, click on the Monitor icon on the left sidebar.

How to check Azure Data Factory? ›

You can also monitor Azure Data Factory directly from the Azure portal. Several metrics graphs appear on the Azure portal Overview page for your Data Factory. On the left sidebar menu, you can access the Azure Activity log, or select Alerts, Metrics, Diagnostic settings, or Logs from the Monitoring section.

How long is monitoring data stored in a data factory? ›

Data Factory stores pipeline-run data for only 45 days. Use Monitor if you want to keep that data for a longer time. With Monitor, you can route diagnostic logs for analysis. You can also keep them in a storage account so that you have factory information for your chosen duration.

How do I improve my Azure Data Factory performance? ›

Use modular #pipelines to divide large, complex workflows into smaller, more manageable ones. By dividing big #datasets into smaller pieces and processing them simultaneously, you may make use of parallelism. This method minimizes execution time overall and makes the best use of available #resources.

What types of data does Azure Monitor collect? ›

Azure Monitor Logs collects logs and performance data where they can be retrieved and analyzed in different ways by using log queries. You must create a Log Analytics workspace to collect log data. Use Log Analytics to analyze data from Azure Monitor Logs.

What are the three main functions of Azure monitor? ›

Azure Monitor's core consumption methods include tools to provide insights, visualize, and analyze data. The visualization tools build on the analysis tools and the insights build on top of both the visualization and analysis tools.

What are the two main kinds of data Azure monitor works with? ›

All data collected by Azure Monitor fits into one of two fundamental types: metrics and logs. Metrics are numerical values that describe some aspect of a system at a particular point in time. They are lightweight and capable of supporting near real-time scenarios.

How do I access Azure monitor? ›

Accessing Azure Monitor

In the Azure portal, Access all Azure Monitor features and data from the Monitor menu. Use the Monitoring section in the menu of various Azure services to access the Azure Monitor tools with data filtered to a particular resource.

What is behind Azure Data Factory? ›

Data Factory provides a data integration and transformation layer that works across your digital transformation initiatives. Enable citizen integrators and data engineers to drive business and IT-led Analytics/BI. Prepare data, construct ETL and ELT processes, and orchestrate and monitor pipelines code-free.

What is Azure Data Factory step by step? ›

The first step collects all the data from a different source and then move the data to a centralized location for subsequent processing. We can use the Copy Activity in a data pipeline to move data from both cloud source and on-premises data stores to a centralized data store in the cloud.

How do I practice Azure Data Factory? ›

Try your first demo with one click
  1. A data factory account.
  2. A pipeline within the data factory with one copy activity.
  3. An Azure blob storage with moviesDB2. csv uploaded into an input folder as source.
  4. A linked service to connect the data factory to the Azure blob storage.
May 15, 2024

What are the steps in data monitoring? ›

How to Build an Effective Data Monitoring System
  1. Choose the Right Data Metrics. ...
  2. Set Appropriate Thresholds for Data Metrics. ...
  3. Integrate with Every Step of Your Data Pipeline. ...
  4. Detect Issues Before They Become Problems. ...
  5. Create an Interactive Dashboard. ...
  6. Ensure Scalability.
Jan 22, 2024

How is monitoring data used? ›

Data quality monitoring helps you reveal problem areas where the most inaccuracies are observed, track unusual or abnormal behaviors, and identify where you should focus your data quality initiatives.

Why collect monitoring data? ›

The Basics of Data Monitoring

A company can monitor different areas of its business, such as customer behaviors, product performance, operational efficiency and more. The information found using data insights can help identify patterns, trends and opportunities for businesses to leverage.

How do I check my Azure backup jobs? ›

Sign in to the Azure portal. Open the vault dashboard. On the Backup Items tile, select Azure Virtual Machine. On the Backup Items pane, you can view the list of protected VMs and last backup status with latest restore points time.

What does lookup activity do in ADF? ›

Lookup activity can retrieve a dataset from any of the data sources supported by data factory and Synapse pipelines. You can use it to dynamically determine which objects to operate on in a subsequent activity, instead of hard coding the object name.

How do I monitor database performance in Azure? ›

Monitoring Azure SQL Database can be done effectively using built-in Azure tools like Azure Monitor, Azure SQL Database Auditing, and Azure Log Analytics. These tools give you insights into your database's health, query performance, and security.

How do I monitor Azure infrastructure? ›

You can connect your existing System Center Operations Manager management group to Azure Monitor to collect data from agents into Azure Monitor Logs. This capability allows you to use log queries and solutions to analyze data collected from agents.

Top Articles
Can Microsoft Teams Be Used to Spy on You? - Technipages
ICICI Bank
Tiny Tina Deadshot Build
Ohio Houses With Land for Sale - 1,591 Properties
Best Pizza Novato
Jennifer Hart Facebook
Best Transmission Service Margate
Visustella Battle Core
Craigslist Estate Sales Tucson
Call Follower Osrs
Best Pawn Shops Near Me
South Bend Tribune Online
Mid90S Common Sense Media
Craigslist Pikeville Tn
Restaurants Near Paramount Theater Cedar Rapids
6813472639
Truck Trader Pennsylvania
Cashtapp Atm Near Me
Everything We Know About Gladiator 2
1v1.LOL - Play Free Online | Spatial
Recap: Noah Syndergaard earns his first L.A. win as Dodgers sweep Cardinals
Kamzz Llc
Dallas Craigslist Org Dallas
Wbiw Weather Watchers
Bòlèt Florida Midi 30
Olivia Maeday
Sessional Dates U Of T
Roanoke Skipthegames Com
Pacman Video Guatemala
Gunsmoke Tv Series Wiki
MethStreams Live | BoxingStreams
Human Unitec International Inc (HMNU) Stock Price History Chart & Technical Analysis Graph - TipRanks.com
Hattie Bartons Brownie Recipe
Tendermeetup Login
Clark County Ky Busted Newspaper
Carespot Ocoee Photos
How to Destroy Rule 34
Wattengel Funeral Home Meadow Drive
Tugboat Information
Überblick zum Barotrauma - Überblick zum Barotrauma - MSD Manual Profi-Ausgabe
Oppenheimer Showtimes Near B&B Theatres Liberty Cinema 12
Conan Exiles Armor Flexibility Kit
Bustednewspaper.com Rockbridge County Va
Canvas Elms Umd
Nurses May Be Entitled to Overtime Despite Yearly Salary
De boeken van Val McDermid op volgorde
Sleep Outfitters Springhurst
Read Love in Orbit - Chapter 2 - Page 974 | MangaBuddy
Unit 4 + 2 - Concrete and Clay: The Complete Recordings 1964-1969 - Album Review
Mike De Beer Twitter
Die 10 wichtigsten Sehenswürdigkeiten in NYC, die Sie kennen sollten
Cbs Scores Mlb
Latest Posts
Article information

Author: Velia Krajcik

Last Updated:

Views: 5579

Rating: 4.3 / 5 (74 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Velia Krajcik

Birthday: 1996-07-27

Address: 520 Balistreri Mount, South Armand, OR 60528

Phone: +466880739437

Job: Future Retail Associate

Hobby: Polo, Scouting, Worldbuilding, Cosplaying, Photography, Rowing, Nordic skating

Introduction: My name is Velia Krajcik, I am a handsome, clean, lucky, gleaming, magnificent, proud, glorious person who loves writing and wants to share my knowledge and understanding with you.