Cloudera - CCP Data Engineer

Varighed

Varighed:

Kun 3 dage

Metode

Metode:

Klasseværelse / Online / Hybrid

Næste dato

Næste dato:

24/6/2024 (Mandag)

Overview

Learn how to build big data applications to solve real-world problems using Apache Hadoop and associated tools, in just 3-days.

On this accelerated CCP Data Engineer course, you’ll get the knowledge to build and design solutions that can ingest data, determine the appropriate file format for storage, process stored data, and present the results to the end-user. 

You’ll be immersed in your accelerated course with Firebrand’s Lecture | Lab | Review methodology. Get CCP Data Engineer certified - in just 3 days - and join an elite group of data engineers. You’ll also learn how to:

  • Convert data between file formats
  • Purge bad data
  • Filter, sort, join, aggregate and transform complex data sets
  • Create linear and branching workflows that include Hadoop/Hive/Pig jobs

You’ll follow the curriculum for Cloudera’s Designing and Building Big Data Applications course. This includes additional Firebrand material to prepare you for the CCP Data Engineer exam (DE575), which you’ll take as part of your accelerated course. This exam is covered by your Certification Guarantee.

Curriculum

Introduction

Application architecture

  • Scenario explanation
  • Understanding development

Environment

  • Identifying and collecting input data
  • Selecting tools for data processing and analysis
  • Presenting results to the user

Defining and using data sets

  • Metadata management
  • What is Apache Avro?
  • Avro schemas
  • Avro schema evolution
  • Selecting a file format
  • Performance considerations

Using the Kite SDK data module

  • What is the Kite SDK?
  • Fundamental data module concepts
  • Creating new data sets using the Kite SDK
  • Loading, accessing and deleting a data set

Importing relational data with Apache Sqoop

  • What is Apache Sqoop?
  • Basic imports
  • Limiting results
  • Improving Sqoop’s performance
  • Sqoop 2

Capturing data with Apache Flume

  • What is Apache Flume?
  • Basic Flume architecture
  • Flume sources
  • Flume sinks
  • Flume configuration
  • Logging application events to Hadoop

Developing custom Flume components

  • Flume data flow and common extension points
  • Custom Flume sources
  • Developing a flume pollable source
  • Developing a Flume event-driven source
  • Custom Flume interceptors
  • Developing a header-modifying Flume interceptor
  • Developing a filtering flume interceptor
  • Writing Avro objects with a custom Flume interceptor

Managing workflows with Apache Oozie

  • The need for workflow management
  • What is Apache Oozie?
  • Defining an Oozie workflow
  • Validation, packaging and deployment
  • Running and tracking workflows using the CLI
  • Hue UI for Oozie

Processing data pipelines with Apache Crunch

  • What is Apache Crunch?
  • Understanding the runch Pipeline
  • Comparing Crunch to Java MapReduce
  • Working with Crunch Projects
  • Reading and writing Data in Crunch
  • Data collection API
  • Functions
  • Utility classes in the Crunch API

Working with tables in Apache Hive

  • What is Apache Hive?
  • Accessing Hive
  • Basic query syntax
  • Creating and populating Hive Tables
  • How Hive reads data
  • Using the RegexSerDe in Hive

Developing user-defined functions

  • What are user-defined functions?
  • Implementing a user-defined function
  • Deploying custom libraries in hive
  • Registering a user-defined function in Hive

Executing interactive queries with Impala

  • What is Impala?
  • Comparing Hive to Impala
  • Running queries in Impala
  • Support for user-defined functions
  • Data and metadata management

Understanding Cloudera Search

  • What is Cloudera Search?
  • Search architecture
  • Supported document formats

Indexing data with Cloudera Search

  • Collection and schema management
  • Morphlines
  • Indexing data in batch mode
  • Indexing data in near real time

Presenting results to users

  • Solr query syntax
  • Building a search UI with Hue
  • Accessing Impala through JDBC
  • Powering a custom web application with Impala and Search

Exam Track

You'll sit the following exam at the Firebrand Training Centre, covered by your Certification Guarantee:

  • CCP Data Engineer Exam (DE575)

You will be provided with five to eight customer problems, each with a large, unique data set and a CDH cluster. You will then have four hours in which to implement a technical solution to each problem that meets all functional requirements.

Additional information:

  • This is a hands-on practical exam using Cloudera technologies
  • You’ll get your own pre-loaded CDH cluster that includes:
    • Spark
    • Impala
    • Crunch
    • Hive
    • Pig
    • Sqoop
    • Kafka
    • Flume
    • Kite
    • Hue
    • Oozie
    • DataFu
  • Your CCP certification is valid for three years

What's Included

Det hele er inkluderet! Du får en alt-inklusiv kursuspakke, som er målrettet til dine behov. Vi tager os af enhver detalje, så det eneste du skal fokusere på er dine lærings- og certificeringsmål.

  • Transport til/fra specifikke afhentningssteder
  • Overnatninger, samtlige måltider samt adgang til forfriskninger, snacks, kaffe og the.
  • Intensiv Hands-on uddannelse med vores unikke (Lecture | Lab | Review)TM metode
  • Omfattende kursusmaterialer og labmanualer
  • Et helt igennem instruktørstyret program
  • 24 timers adgang til både undervisningslokale og instruktøren
  • Samtlige måltider samt adgang til forfriskninger, snacks, kaffe og the.
  • Certificeringsgaranti

Prerequisites

You should possess in-depth experience developing data engineer solutions and a high-level working knowledge of data analysis. 

Er du klar til dit Firebrand Kursus?

Vi interviewer alle potentielle deltagere angående deres baggrund, uddannelser, certificeringer og personlig indstilling. Hvis du kommer igennem denne screeningsprocedure, betyder det, at du har rigtig gode chancer for at bestå.

Firebrand Training tilbyder et ambitiøst uddannelsesmiljø, som forudsætter at du dedikerer dig til kurset. Ovenstående forkundskaber er vejledende; mange deltagere med mindre erfaring, men med en anden baggrund eller færdigheder, har haft succes med accelereret uddannelse hos Firebrand Training.

Hvis du funderer på hvorvidt du opfylder de anbefalede forkundskaber, er du meget velkommen til at ringe til os på 89 88 66 05 og tale med en af vores uddannelsesrådgivere, som kan hjælpe dig.

Kundereferencer

Her er Firebrand Training review afsnit. Siden 2001 har vi trænet præcist 134.561 studerende og professionelle og bedt dem alle om at gennemgå vores Accelerated Learning. Lige nu har 96,41% sagt, at Firebrand har overgået deres forventninger.

Læs anmeldelser fra de seneste accelererede kurser nedenfor, eller besøg Firebrand Stories for skriftlige og videointerviews med vores alumner.


"I had a great time. The instructor is very knowledgeable and his training is on point. Two exams in 2 days is a lot. But he makes it doable."
Juan van Gom, Ministry of Defense Netherlands. (19/1/2024 (Fredag) til 21/1/2024 (Søndag))

"Detailed and thorough training in a great environment."
Anonymous, SYP. (8/1/2024 (Mandag) til 12/1/2024 (Fredag))

"I attended the Cyber Crime Foundation course with very little knowledge.. The instructor was fantastic, explained subjects in depth and allowed a good amount of time for questions. I came away with lots of knowledge ready to use what I've learnt in my day-to-day role."
JS. (8/1/2024 (Mandag) til 12/1/2024 (Fredag))

"I very much enjoyed the training! It is a must for police officers new to Cyber Crime Units and is delivered at a good pace. Very good trainer with exceptional knowledge and enthusiasm."
Steve Lloyd, West Mercia Police. (8/1/2024 (Mandag) til 12/1/2024 (Fredag))

"I have really enjoyed the course, although I have been a self-professed nerd for many years, there was still some knowledge I took from this course. The tutorials on practical applications is something I will take away and use."
Joseph Ingram-Gettins, West Midlands Police. (8/1/2024 (Mandag) til 12/1/2024 (Fredag))

Kursusdatoer

Starter

Slutter

Tilgængelighed

Sted

Tilmelding

19/2/2024 (Mandag)

21/2/2024 (Onsdag)

Afsluttet - Giv feedback

-

 

24/6/2024 (Mandag)

26/6/2024 (Onsdag)

Venteliste

Landsdækkende

 

5/8/2024 (Mandag)

7/8/2024 (Onsdag)

Begrænsede pladser

Landsdækkende

 

16/9/2024 (Mandag)

18/9/2024 (Onsdag)

Tilgængelige pladser

Landsdækkende

 

28/10/2024 (Mandag)

30/10/2024 (Onsdag)

Tilgængelige pladser

Landsdækkende

 

9/12/2024 (Mandag)

11/12/2024 (Onsdag)

Tilgængelige pladser

Landsdækkende

 

Seneste anmeldelser fra vores studerende