Cloudera - CCP Data Engineer

Varighet

Varighet:

Bare 3 dager

Metode

Metode:

klasserommet / på nett / Hybrid

Neste dato

Neste dato:

24/6/2024 (Mandag)

Overview

Learn how to build big data applications to solve real-world problems using Apache Hadoop and associated tools, in just 3-days.

On this accelerated CCP Data Engineer course, you’ll get the knowledge to build and design solutions that can ingest data, determine the appropriate file format for storage, process stored data, and present the results to the end-user. 

You’ll be immersed in your accelerated course with Firebrand’s Lecture | Lab | Review methodology. Get CCP Data Engineer certified - in just 3 days - and join an elite group of data engineers. You’ll also learn how to:

  • Convert data between file formats
  • Purge bad data
  • Filter, sort, join, aggregate and transform complex data sets
  • Create linear and branching workflows that include Hadoop/Hive/Pig jobs

You’ll follow the curriculum for Cloudera’s Designing and Building Big Data Applications course. This includes additional Firebrand material to prepare you for the CCP Data Engineer exam (DE575), which you’ll take as part of your accelerated course. This exam is covered by your Certification Guarantee.

Her er 8 grunner til hvorfor du skal gjennomføre ditt CCP Data Engineer hos Firebrand Training:

  1. Du blir utdannet og sertifisert på bare 3 dager. Hos oss får du din utdanning og sertifisering på rekordtid, en sertifisering du også gjennomfører der og da som en integrert del av den intensive, akselererte utdanningen.
  2. Alt er inkludert. Et engangsbeløp dekker alt kursmaterial, eksamen, kost og losji og tilbyr den mest kostnadseffektive måten å gjennomføre ditt CCP Data Engineer kurs og sertifisering på. Og dette uten noen uannonserte ytterligere kostnader.
  3. Du klarer sertifiseringen første gangen eller kan gå kurset om igjen kostnadsfritt. Det er vår garanti. Vi er sikre på at du vil klare din CCP Data Engineer sertifisering første gangen. Men skulle du mot formodning ikke gjøre det kan du innen et år komme tilbake og kun betale for eventuelle overnattinger og din eksamen. Alt annet er gratis.
  4. Du lærer deg mer.Tradisjonelle utdanningsdager varer fra kl. 09.00 til 16.00 med lange lunsj- og kaffepauser. Hos Firebrand Training får du minst 12 timers effektiv og fokusert kvalitetsutdanning hver dag sammen med din instruktør, uten private eller arbeidsrelaterte, forstyrrende momenter.
  5. Du lærer deg CCP Data Engineer raskere. Vi kombinerer de tre innlæringsmetodene (Presentasjon |Øving| Diskusjon) slik at vi gjennomfører kurset på en måte som sikrer at du lærer deg raskere og lettere.
  6. Du er i sikre hender.Vi har utdannet og sertifisert 134.561 personer, vi er partner med alle de store navn i bransjen og vi har vunnet atskillige utmerkelser, bla. a. "Årets Learning Partner 2010, 2011, 2012, 2013 og 2015” fra Microsoft Danmark og med en vekst på 1430 % siden 2009 er vi årets Gazelle prisvinner på Sjælland, Danmark.
  7. Du lærer deg ikke bare teorien. Vi har videreutviklet CCP Data Engineer kursen og tilbyr flere praktiske øvelser og sikrer på den måten, at du kan bruke dine ferdigheter for å løse daglige praktiske problemstillinger.
  8. Du lærer av de beste. Våre instruktører på CCP Data Engineer er de beste i bransjen og tilbyr en helt unik blanding av kunnskap, praktisk erfaring og pasjon for å lære bort.

Curriculum

Introduction

Application architecture

  • Scenario explanation
  • Understanding development

Environment

  • Identifying and collecting input data
  • Selecting tools for data processing and analysis
  • Presenting results to the user

Defining and using data sets

  • Metadata management
  • What is Apache Avro?
  • Avro schemas
  • Avro schema evolution
  • Selecting a file format
  • Performance considerations

Using the Kite SDK data module

  • What is the Kite SDK?
  • Fundamental data module concepts
  • Creating new data sets using the Kite SDK
  • Loading, accessing and deleting a data set

Importing relational data with Apache Sqoop

  • What is Apache Sqoop?
  • Basic imports
  • Limiting results
  • Improving Sqoop’s performance
  • Sqoop 2

Capturing data with Apache Flume

  • What is Apache Flume?
  • Basic Flume architecture
  • Flume sources
  • Flume sinks
  • Flume configuration
  • Logging application events to Hadoop

Developing custom Flume components

  • Flume data flow and common extension points
  • Custom Flume sources
  • Developing a flume pollable source
  • Developing a Flume event-driven source
  • Custom Flume interceptors
  • Developing a header-modifying Flume interceptor
  • Developing a filtering flume interceptor
  • Writing Avro objects with a custom Flume interceptor

Managing workflows with Apache Oozie

  • The need for workflow management
  • What is Apache Oozie?
  • Defining an Oozie workflow
  • Validation, packaging and deployment
  • Running and tracking workflows using the CLI
  • Hue UI for Oozie

Processing data pipelines with Apache Crunch

  • What is Apache Crunch?
  • Understanding the runch Pipeline
  • Comparing Crunch to Java MapReduce
  • Working with Crunch Projects
  • Reading and writing Data in Crunch
  • Data collection API
  • Functions
  • Utility classes in the Crunch API

Working with tables in Apache Hive

  • What is Apache Hive?
  • Accessing Hive
  • Basic query syntax
  • Creating and populating Hive Tables
  • How Hive reads data
  • Using the RegexSerDe in Hive

Developing user-defined functions

  • What are user-defined functions?
  • Implementing a user-defined function
  • Deploying custom libraries in hive
  • Registering a user-defined function in Hive

Executing interactive queries with Impala

  • What is Impala?
  • Comparing Hive to Impala
  • Running queries in Impala
  • Support for user-defined functions
  • Data and metadata management

Understanding Cloudera Search

  • What is Cloudera Search?
  • Search architecture
  • Supported document formats

Indexing data with Cloudera Search

  • Collection and schema management
  • Morphlines
  • Indexing data in batch mode
  • Indexing data in near real time

Presenting results to users

  • Solr query syntax
  • Building a search UI with Hue
  • Accessing Impala through JDBC
  • Powering a custom web application with Impala and Search

Exam Track

You'll sit the following exam at the Firebrand Training Centre, covered by your Certification Guarantee:

  • CCP Data Engineer Exam (DE575)

You will be provided with five to eight customer problems, each with a large, unique data set and a CDH cluster. You will then have four hours in which to implement a technical solution to each problem that meets all functional requirements.

Additional information:

  • This is a hands-on practical exam using Cloudera technologies
  • You’ll get your own pre-loaded CDH cluster that includes:
    • Spark
    • Impala
    • Crunch
    • Hive
    • Pig
    • Sqoop
    • Kafka
    • Flume
    • Kite
    • Hue
    • Oozie
    • DataFu
  • Your CCP certification is valid for three years

What's Included

Prerequisites

You should possess in-depth experience developing data engineer solutions and a high-level working knowledge of data analysis. 

Anmeldelser

Vi har lært opp 134.561 personer på 12 år. Vi ba dem om å anmelde vår akselererte opplæring. Akkurat nå har 96,41% angitt at Firebrand overgikk forventningene:

"Very good Resources within the E-Book, Mixed with the Lessons, it is also has good Labs to get hands on Practice and put the theory somewhere practical to get full learning experience, with very high standard explanation throughout the lectures. 10/10!"
Rhys Thomas, Nokia. (7/5/2024 (Tirsdag) til 10/5/2024 (Fredag))

"All the staff are very attentive. A fantastic instructor that was able to answer all questions whilst keeping the sessions informative."
Ash Petch, Cepac. (7/5/2024 (Tirsdag) til 10/5/2024 (Fredag))

"My experience at Firebrand was brilliant. The people here are nice and approachable among the other apprentices of whom also are very nice and made great friends here too! I am looking forward to my next 2 trips here!"
Lewis Vick, Computacenter. (7/5/2024 (Tirsdag) til 10/5/2024 (Fredag))

"Good course lecturer. It was good to go over the whole content of Server+ and hear relevant examples."
Anonymous. (7/5/2024 (Tirsdag) til 10/5/2024 (Fredag))

"It was lovely being here as there was lots of new learning"
Imran Riaz, Watford Boys Grammar School. (7/5/2024 (Tirsdag) til 10/5/2024 (Fredag))

Kursdatoer

Start

Slutt

Kapasitet

Plass

Registrer deg

19/2/2024 (Mandag)

21/2/2024 (Onsdag)

Ferdig - Gi tilbakemelding

-

 

24/6/2024 (Mandag)

26/6/2024 (Onsdag)

Venteliste

Landsdekkende

 

5/8/2024 (Mandag)

7/8/2024 (Onsdag)

Begrenset kapasitet

Landsdekkende

 

16/9/2024 (Mandag)

18/9/2024 (Onsdag)

Ledige plasser

Landsdekkende

 

28/10/2024 (Mandag)

30/10/2024 (Onsdag)

Ledige plasser

Landsdekkende

 

9/12/2024 (Mandag)

11/12/2024 (Onsdag)

Ledige plasser

Landsdekkende

 

Siste anmeldelser fra studenten vår