The DP - 600T00 : Microsoft Fabric Analytics Engineer course is designed to provide learners with comprehensive knowledge and hands-on experience in end-to-end analytics using Microsoft Fabric. The course covers a wide range of topics, from introduction to analytics to administering Microsoft Fabric, data ingestion, and data management. Learners will gain practical skills in using Dataflows Gen2, Spark, Data Factory pipelines, and managing Lakehouses within Microsoft Fabric. Throughout the modules, participants will learn about organizing data using the Medallion architecture design, working with Apache Spark, Delta Lake tables, and securing their data environments. The course also delves into data warehousing, teaching how to load, query, monitor, optimize, and model data warehouses. Moreover, it addresses scalability in Power BI, creating model relationships, performance optimization tools, and enforcing model security. This extensive training is crucial for those looking to excel in managing and analyzing data with Microsoft's analytics tools.
Audience Profile:
The primary audience for this course is:
-
Data professionals with experience in data modeling, extraction, and analytics.
-
Professionals who want to use Microsoft Fabric to create and deploy enterprise-scale data analytics solutions.
Prerequisites:
Course Outline:
Module 1: Ingest Data with Dataflows Gen2 in Microsoft Fabric
-
Describe Dataflow (Gen2) capabilities in Microsoft Fabric.
-
Create Dataflow (Gen2) solutions to ingest and transform data.
-
Include a Dataflow (Gen2) in a pipeline.
Module 2: Ingest Data with Spark and Microsoft Fabric Notebooks
-
Ingest external data to Fabric lakehouses using Spark.
-
Configure external source authentication and optimization.
-
Load data into lakehouse as files or as Delta tables.
Module 3: Use Data Factory Pipelines in Microsoft Fabric
-
Describe pipeline capabilities in Microsoft Fabric.
-
Use the Copy Data activity in a pipeline.
-
Create pipelines based on predefined templates.
-
Run and monitor pipelines.
Module 4: Get Started with Lakehouses in Microsoft Fabric
-
Describe core features and capabilities of lakehouses in Microsoft Fabric.
-
Create a lakehouse.
-
Ingest data into files and tables in a lakehouse.
-
Query lakehouse tables with SQL.
Module 5: Organize a Fabric Lakehouse Using Medallion Architecture Design
-
Describe the principles of using the medallion architecture in data management.
-
Apply the medallion architecture framework within Microsoft Fabric.
-
Analyze data stored in the lakehouse using DirectLake in Power BI.
-
Describe best practices for ensuring the security and governance of data stored in the medallion architecture.
Module 6: Use Apache Spark in Microsoft Fabric
-
Configure Spark in a Microsoft Fabric workspace.
-
Identify suitable scenarios for Spark notebooks and Spark jobs.
-
Use Spark dataframes to analyze and transform data.
-
Use Spark SQL to query data in tables and views.
-
Visualize data in a Spark notebook.
Module 7: Work with Delta Lake Tables in Microsoft Fabric
-
Understand Delta Lake and delta tables in Microsoft Fabric.
-
Create and manage delta tables using Spark.
-
Use Spark to query and transform data in delta tables.
-
Use delta tables with Spark structured streaming.
Module 8: Get Started with Data Warehouses in Microsoft Fabric
-
Describe data warehouses in Fabric.
-
Understand a data warehouse vs a data lakehouse.
-
Work with data warehouses in Fabric.
-
Create and manage datasets within a data warehouse.
Module 9: Load Data into a Microsoft Fabric Data Warehouse
-
Learn different strategies to load data into a data warehouse in Microsoft Fabric.
-
Learn how to build a data pipeline to load a warehouse in Microsoft Fabric.
-
Learn how to load data in a warehouse using T-SQL.
-
Learn how to load and transform data with dataflow (Gen 2).
Module 10: Monitor a Microsoft Fabric Data Warehouse
-
Monitor capacity unit usage with the Microsoft Fabric Capacity Metrics app.
-
Monitor current activity in the data warehouse with dynamic management views.
-
Monitor querying trends with query insights views.
Module 11: Understand Scalability in Power BI
-
Describe the importance of building scalable data models.
-
Implement Power BI data modeling best practices.
-
Use the Power BI large dataset storage format.
Module 12: Create Power BI Model Relationships
-
Understand how model relationships work.
-
Set up relationships.
-
Use DAX relationship functions.
-
Understand relationship evaluation.
Module 13: Use Tools to Optimize Power BI Performance
-
Optimize queries using performance analyzer.
-
Troubleshoot DAX performance using DAX Studio.
-
Optimize a data model using Tabular Editor.
Module 14: Enforce Power BI Model Security