Data Integration with Cloud Data Fusion

WGAC-GGL-DICDF

Google Cloud Training Courses Certification

Schedule

See all training events for this courseSee all CoursesProfessional Services & Support
Virtual Classroom
Open
Google

Data Integration with Cloud Data Fusion

WGAC-GGL-DICDF

Virtual ILT

Language: English

GMT UTC+00:00

Start date:11 May 2023 09:00
End date:12 May 2023 16:30
Duration:2 days

$ 1,210

Virtual Classroom
Open
Google

Data Integration with Cloud Data Fusion

WGAC-GGL-DICDF

Virtual ILT

Language: English

GMT UTC+00:00

Start date:17 Aug 2023 09:00
End date:18 Aug 2023 16:30
Duration:2 days

$ 1,210

Virtual Classroom
Open
Google

Data Integration with Cloud Data Fusion

WGAC-GGL-DICDF

Virtual ILT

Language: English

GMT UTC+00:00

Start date:16 Nov 2023 09:00
End date:17 Nov 2023 16:30
Duration:2 days

$ 1,210

Description

Module 00 - Introduction

(in English)

Module 01 - Introduction to Data Integration and Cloud Data Fusion
  • Data integration: what, why, challenges
  • Data integration tools used in the industry
  • User personas
  • Introduction to cloud-based data fusion
  • Critical Data Integration Capabilities
  • Cloud Data Fusion user interface components
Module 02 - Building Pipelines
  • Cloud Data Fusion architecture
  • Basic concepts
  • Data pipelines and directed acyclic graphs (DAG)
  • Pipeline Life Cycle
  • Designing pipelines in Pipeline Studio
Module 03 - Designing Complex Pipelines
  • Branches, merges and joins
  • Actions and Notifications
  • Error handling and macros Pipeline configurations, scheduling, import and export
Module 04 - Pipeline Execution Environment
  • Scheduling and triggers
  • Runtime environment: Compute profile and provisioners
  • Pipeline Monitoring
Module 05 - Building transformations and preparing data with Wrangler
  • Wrangler
  • Guidelines
  • User-defined directives
Module 06 - Stream Connectors and Pipelines
  • Understand the data integration architecture.
  • List the different connectors.
  • Use the Cloud Data Loss Prevention (DLP) API.
  • Understand the streaming pipeline reference architecture.
  • Build and run a streaming pipeline

.

Module 07 - Metadata and Data Lineage
  • Metadata
  • Data lineage
Module 08 - Summary
  • Course summary

Complete "Fundamentals of Big Data and Machine Learning."

  • Identify the need for data integration,
  • Understand the capabilities of Cloud Data Fusion as a data integration platform,
  • Identify use cases for possible implementation with Cloud Data Fusion,
  • List the major components of Cloud Data Fusion,
  • [Design and execute batch and real-time data processing pipelines,
  • Work with Wrangler to build data transformations.
  • Use connectors to integrate data from different sources and formats,
  • Configure the runtime environment; monitor and troubleshoot pipeline execution,
  • Understand the relationship between metadata and data lineage

.

The description for this course is currently being updated.

Complete "Fundamentals of Big Data and Machine Learning."