ADF Interview Questions and Answers
Introduction to Azure Data Factory
Integration Runtime (IR)
- What is Integration Runtime in ADF, and what are the Types of IR?
- Which IR will you use if you need to read data from an on-premises network to Azure Cloud?
- If your Self-Hosted IR is running slowly, what steps will you take in Azure Data Factory?
- How will you check if SHIR is not working?
- Can a single Self-Hosted IR be used in more than one Data Factory?
- For load balancing and failover scenarios, what is the maximum number of nodes we can add in SHIR?
- Is it okay to create multiple Integration Runtimes in a single Data Factory, and why?
Pipelines and Activities
- What are Pipelines in Azure Data Factory?
- What is an Activity in Azure Data Factory?
- How will you control the flow of activities in an Azure Data Factory Pipeline?
- Which activity will you use to delete all files in Azure Data Factory?
- Explain a few activities in Azure Data Factory Pipeline that can run a stored procedure.
- How can you complete a task that cannot be done with built-in activities in Azure Data Factory?
- Can the same linked service be used in multiple Pipelines in Azure Data Factory?
- To get a list of all files from Azure Blob Storage, which activity will you use?
Triggers
Linked Services and Datasets
SSIS Integration
Data Operations
- Does Copy Activity support UPSERT operations?
- How to remove duplicate records in Azure Data Factory?
- If you need to add an extra column or derive extra columns from source columns, can you use Copy Activity?
- You are given hundreds of files, and you need to load them into separate tables and create the tables on the fly. How would you do this?
- Multiple files are loaded to a single table. How will you identify the file names for each record?
Error Handling and Recovery
Security and Networking
Parameters and Variables
Monitoring and Logging
- Where will you see the Pipeline Runs and Triggers?
- How will you keep the execution log history of ADF Pipelines for more than 3 months?
- How will you get the last 2 months' execution details of pipelines if your pipelines have been failing?
- How to find out which Pipeline is billed the most?
- How will you check if a file exists or does not exist in Blob storage in Azure Data Factory?
Data Flows
CI/CD and Source Control
Notifications
Miscellaneous
- In which modes can an Azure Data Factory Pipeline run?
- The Microsoft EventGrid resource provider is not registered in the subscription. How to resolve this?
- What steps will you take if your pipelines are stuck in a queue or in progress status?
- How often do you upgrade or update your Self-Hosted Integration Runtime?