Cloudera - CCP Data Engineer

Duration

Duration:

Just 3 Days

Method

Method:

Classroom / Online / Hybrid

Next date

Next date:

24/6/2024 (Monday)

Overview

Learn how to build big data applications to solve real-world problems using Apache Hadoop and associated tools, in just 3-days.

On this accelerated CCP Data Engineer course, you’ll get the knowledge to build and design solutions that can ingest data, determine the appropriate file format for storage, process stored data, and present the results to the end-user. 

You’ll be immersed in your accelerated course with Firebrand’s Lecture | Lab | Review methodology. Get CCP Data Engineer certified - in just 3 days - and join an elite group of data engineers. You’ll also learn how to:

  • Convert data between file formats
  • Purge bad data
  • Filter, sort, join, aggregate and transform complex data sets
  • Create linear and branching workflows that include Hadoop/Hive/Pig jobs

You’ll follow the curriculum for Cloudera’s Designing and Building Big Data Applications course. This includes additional Firebrand material to prepare you for the CCP Data Engineer exam (DE575), which you’ll take as part of your accelerated course. This exam is covered by your Certification Guarantee.

Four reasons why you should sit your CCP Data Engineer course with Firebrand Training

  1. You'll be CCP Data Engineer trained and certified faster. Learn more on this 3-day accelerated course. You'll get at least 12 hours a day of quality learning time in a distraction-free environment
  2. Your CCP Data Engineer course is all-inclusive. One simple price covers all course materials, exams, accommodation and meals – so you can focus on learning
  3. Pass CCP Data Engineer first time or train again for free. Your expert instructor will deliver our unique accelerated learning methods, allowing you to learn faster and be in the best possible position to pass first time. In the unlikely event that you don't, it's covered by your Certification Guarantee
  4. Study CCP Data Engineer with an award-winning training provider. We've won the Learning and Performance Institute's "Training Company of the Year" three times. Firebrand is your fastest way to learn, with 134.561 students saving more than one million hours since 2001

Curriculum

Introduction

Application architecture

  • Scenario explanation
  • Understanding development

Environment

  • Identifying and collecting input data
  • Selecting tools for data processing and analysis
  • Presenting results to the user

Defining and using data sets

  • Metadata management
  • What is Apache Avro?
  • Avro schemas
  • Avro schema evolution
  • Selecting a file format
  • Performance considerations

Using the Kite SDK data module

  • What is the Kite SDK?
  • Fundamental data module concepts
  • Creating new data sets using the Kite SDK
  • Loading, accessing and deleting a data set

Importing relational data with Apache Sqoop

  • What is Apache Sqoop?
  • Basic imports
  • Limiting results
  • Improving Sqoop’s performance
  • Sqoop 2

Capturing data with Apache Flume

  • What is Apache Flume?
  • Basic Flume architecture
  • Flume sources
  • Flume sinks
  • Flume configuration
  • Logging application events to Hadoop

Developing custom Flume components

  • Flume data flow and common extension points
  • Custom Flume sources
  • Developing a flume pollable source
  • Developing a Flume event-driven source
  • Custom Flume interceptors
  • Developing a header-modifying Flume interceptor
  • Developing a filtering flume interceptor
  • Writing Avro objects with a custom Flume interceptor

Managing workflows with Apache Oozie

  • The need for workflow management
  • What is Apache Oozie?
  • Defining an Oozie workflow
  • Validation, packaging and deployment
  • Running and tracking workflows using the CLI
  • Hue UI for Oozie

Processing data pipelines with Apache Crunch

  • What is Apache Crunch?
  • Understanding the runch Pipeline
  • Comparing Crunch to Java MapReduce
  • Working with Crunch Projects
  • Reading and writing Data in Crunch
  • Data collection API
  • Functions
  • Utility classes in the Crunch API

Working with tables in Apache Hive

  • What is Apache Hive?
  • Accessing Hive
  • Basic query syntax
  • Creating and populating Hive Tables
  • How Hive reads data
  • Using the RegexSerDe in Hive

Developing user-defined functions

  • What are user-defined functions?
  • Implementing a user-defined function
  • Deploying custom libraries in hive
  • Registering a user-defined function in Hive

Executing interactive queries with Impala

  • What is Impala?
  • Comparing Hive to Impala
  • Running queries in Impala
  • Support for user-defined functions
  • Data and metadata management

Understanding Cloudera Search

  • What is Cloudera Search?
  • Search architecture
  • Supported document formats

Indexing data with Cloudera Search

  • Collection and schema management
  • Morphlines
  • Indexing data in batch mode
  • Indexing data in near real time

Presenting results to users

  • Solr query syntax
  • Building a search UI with Hue
  • Accessing Impala through JDBC
  • Powering a custom web application with Impala and Search

Exam Track

You'll sit the following exam at the Firebrand Training Centre, covered by your Certification Guarantee:

  • CCP Data Engineer Exam (DE575)

You will be provided with five to eight customer problems, each with a large, unique data set and a CDH cluster. You will then have four hours in which to implement a technical solution to each problem that meets all functional requirements.

Additional information:

  • This is a hands-on practical exam using Cloudera technologies
  • You’ll get your own pre-loaded CDH cluster that includes:
    • Spark
    • Impala
    • Crunch
    • Hive
    • Pig
    • Sqoop
    • Kafka
    • Flume
    • Kite
    • Hue
    • Oozie
    • DataFu
  • Your CCP certification is valid for three years

What's Included

Your accelerated course includes:

  • Accommodation *
  • Meals, unlimited snacks, beverages, tea and coffee *
  • On-site exams **
  • Exam vouchers **
  • Practice tests **
  • Certification Guarantee ***
  • Courseware
  • Up-to 12 hours of instructor-led training each day
  • 24-hour lab access
  • Digital courseware **
  • * For residential training only. Accommodation is included from the night before the course starts. This doesn't apply for online courses.
  • ** Some exceptions apply. Please refer to the Exam Track or speak with our experts
  • *** Pass first time or train again free as many times as it takes, unlimited for 1 year. Just pay for accommodation, exams, and incidental costs.

Prerequisites

You should possess in-depth experience developing data engineer solutions and a high-level working knowledge of data analysis. 

Are you ready to get certified in record time?

We interview all applicants for the course on their technical background, degrees and certifications held, and general suitability. If you get through this screening process, it means you stand a great chance of passing.

Firebrand Training is an immersive training environment. You must be committed to the course. The above prerequisites are guidelines, but many students with less experience have other background or traits that have enabled their success in accelerated training through Firebrand Training.

If you have any doubts as to whether you meet the pre-requisites please call 21 96 61 82 and speak to one of our enrolment consultants, who can help you with a training plan.

Reviews

We've currently trained 134.561 students in 12 years. We asked them all to review our Accelerated Learning. Currently,
96,41% have said Firebrand exceeded their expectations:

"I had a great time. The instructor is very knowledgeable and his training is on point. Two exams in 2 days is a lot. But he makes it doable."
Juan van Gom, Ministry of Defense Netherlands. (19/1/2024 (Friday) to 21/1/2024 (Sunday))

"Detailed and thorough training in a great environment."
Anonymous, SYP. (8/1/2024 (Monday) to 12/1/2024 (Friday))

"I attended the Cyber Crime Foundation course with very little knowledge.. The instructor was fantastic, explained subjects in depth and allowed a good amount of time for questions. I came away with lots of knowledge ready to use what I've learnt in my day-to-day role."
JS. (8/1/2024 (Monday) to 12/1/2024 (Friday))

"I very much enjoyed the training! It is a must for police officers new to Cyber Crime Units and is delivered at a good pace. Very good trainer with exceptional knowledge and enthusiasm."
Steve Lloyd, West Mercia Police. (8/1/2024 (Monday) to 12/1/2024 (Friday))

"I have really enjoyed the course, although I have been a self-professed nerd for many years, there was still some knowledge I took from this course. The tutorials on practical applications is something I will take away and use."
Joseph Ingram-Gettins, West Midlands Police. (8/1/2024 (Monday) to 12/1/2024 (Friday))

Course Dates

Start

Finish

Status

Location

Book now

19/2/2024 (Monday)

21/2/2024 (Wednesday)

Finished - Leave feedback

-

 

24/6/2024 (Monday)

26/6/2024 (Wednesday)

Wait list

Nationwide

 

5/8/2024 (Monday)

7/8/2024 (Wednesday)

Limited availability

Nationwide

 

16/9/2024 (Monday)

18/9/2024 (Wednesday)

Open

Nationwide

 

28/10/2024 (Monday)

30/10/2024 (Wednesday)

Open

Nationwide

 

9/12/2024 (Monday)

11/12/2024 (Wednesday)

Open

Nationwide

 

Latest Reviews from our students