Cloudera - CCA Data Analyst

Duration

Duration:

Only 3 Days

Method

Method:

Classroom / Online / Hybrid

Next date

Next date:

24.6.2024 (Monday)

Overview

On this accelerated 3-day Cloudera CCA Data Analyst course, you'll get the skills you need to apply traditional data analytics and business intelligence skills to big data.

Your expert instructor will introduce you to the tools and techniques you need to access, manipulate, transform, and analyse complex data sets using SQL and familiar scripting languages.

You'll learn topics such as:

  • The features that Pig, Hive, and Impala offer for data acquisition, storage, and analysis
  • The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop
  • How Pig, Hive, and Impala improve productivity for typical analysis tasks
  • Joining diverse datasets to gain valuable business insight
  • Performing real-time, complex queries on datasets

Access to 24/7 labs means that you can test your hands-on skills in navigating the Hadoop ecosystem whenever you like. Through our unique Lecture | Lab | Review technique, you'll gain Apache Hadoop skills faster.

On this course, you'll prepare for and sit the CCA Data Analyst exam, covered by your Certification Gurantee.

If you're a data analyst, business intelligence specialist, developer, system architect or database administrator, this course is ideal for you.

Four reasons why you should sit your CCA Data Analyst course with Firebrand Training

  1. You'll be CCA Data Analyst trained and certified faster. Learn more on this 3-day accelerated course. You'll get at least 12 hours a day of quality learning time in a distraction-free environment
  2. Your CCA Data Analyst course is all-inclusive. One simple price covers all course materials, exams, accommodation and meals – so you can focus on learning
  3. Pass CCA Data Analyst first time or train again for free. Your expert instructor will deliver our unique accelerated learning methods, allowing you to learn faster and be in the best possible position to pass first time. In the unlikely event that you don't, it's covered by your Certification Guarantee
  4. Study CCA Data Analyst with an award-winning training provider. We've won the Learning and Performance Institute's "Training Company of the Year" three times. Firebrand is your fastest way to learn, with 134561 students saving more than one million hours since 2001

Benefits

Seven reasons why you should sit your course with Firebrand Training

  1. Two options of training. Choose between residential classroom-based, or online courses
  2. You'll be certified fast. With us, you’ll be trained in record time
  3. Our course is all-inclusive. A one-off fee covers all course materials, exams**, accommodation* and meals*. No hidden extras.
  4. Pass the first time or train again for free. This is our guarantee. We’re confident you’ll pass your course the first time. But if not, come back within a year and only pay for accommodation, exams and incidental costs
  5. You’ll learn more. A day with a traditional training provider generally runs from 9 am – 5 pm, with a nice long break for lunch. With Firebrand Training you’ll get at least 12 hours/day of quality learning time, with your instructor
  6. You’ll learn faster. Chances are, you’ll have a different learning style to those around you. We combine visual, auditory and tactile styles to deliver the material in a way that ensures you will learn faster and more easily
  7. You’ll be studying with the best. We’ve been named in the Training Industry’s “Top 20 IT Training Companies of the Year” every year since 2010. As well as winning many more awards, we’ve trained and certified over 135,000 professionals
  • * For residential training only. Doesn't apply for online courses
  • ** Some exceptions apply. Please refer to the Exam Track or speak with our experts

Curriculum

Introduction Apache Hadoop Fundamentals

  • The Motivation for Hadoop
  • Hadoop Overview
  • Data Storage: HDFS
  • Distributed Data Processing: YARN, MapReduce, and Spark
  • Data Processing and Analysis: Pig, Hive, and Impala
  • Database Integration: Sqoop
  • Other Hadoop Data Tools
  • Exercise Scenarios

Introduction to Apache Pig

  • What is Pig?
  • Pig's Features
  • Pig Use Cases
  • Interacting with Pig

Basic Data Analysis with Apache Pig

  • Pig Latin Syntax
  • Loading Data
  • Simple Data Types
  • Field Definitions
  • Data Output
  • Viewing the Schema
  • Filtering and Sorting Data
  • Commonly Used Functions

Processing Complex Data with Apache Pig

  • Storage Formats
  • Complex/Nested Data Types
  • Grouping
  • Built-In Functions for Complex Data
  • Iterating Grouped Data

Multi-Dataset Operations with Apache Pig

  • Techniques for Combining Datasets
  • Joining Datasets in Pig
  • Set Operations
  • Splitting Datasets

Apache Pig Troubleshooting and Optimisation

  • Troubleshooting Pig
  • Logging
  • Using Hadoop's Web UI
  • Data Sampling and Debugging
  • Performance Overview
  • Understanding the Execution Plan
  • Tips for Improving the Performance of Pig Jobs

Introduction to Apache Hive and Impala

  • What is Hive?
  • What is Impala?
  • Why Use Hive and Impala?
  • Schema and Data Storage
  • Comparing Hive and Impala to Traditional Databases
  • Use Cases

Querying with Apache Hive and Impala

  • Databases and Tables
  • Basic Hive and Impala Query Language Syntax
  • Data Types
  • Using Hue to Execute Queries
  • Using Beeline (Hive's Shell)
  • Using the Impala Shell

Apache Hive and Impala Data Management

  • Data Storage
  • Creating Databases and Tables
  • Loading Data
  • Altering Databases and Tables
  • Simplifying Queries with Views
  • Storing Query Results

Data Storage and Performance

  • Partitioning Tables
  • Loading Data into Partitioned Tables
  • When to Use Partitioning
  • Choosing a File Format
  • Using Avro and Parquet File Formats

Relational Data Analysis with Apache Hive and Impala

  • Joining Datasets
  • Common Built-In Functions
  • Aggregation and Windowing

Complex Data with Apache Hive and Impala

  • Complex Data with Hive
  • Complex Data with Impala

Analysing Text with Apache Hive and Impala

  • Using Regular Expressions with
  • Hive and Impala
  • Processing Text Data with SerDes in Hive
  • Sentiment Analysis and n-grams in Hive

Apache Hive Optimisation

  • Understanding Query Performance
  • Bucketing
  • Indexing Data
  • Hive on Spark

Apache Impala Optimisation

  • How Impala Executes Queries
  • Improving Impala Performance

Extending Apache Hive and Impala

  • Custom SerDes and File Formats in Hive
  • Data Transformation with
    • Custom Scripts in Hive
    • User-Defined Functions
    • Parameterised Queries

Choosing the Best Tool for the Job

  • Comparing Pig, Hive, Impala, and Relational Databases

Exam Track

On this course, you'll prepare for and take the following exam at the Firebrand Training centre, covered by your Certification Guarantee.

CCA Data Analyst Exam (CCA159)

  • Number of questions: 8-12
  • Format: performance-based
  • Duration: 120 minutes
  • Passing Score: 70%

What's Included

On this course, you'll receive:

  • Official Cloudera Data Analyst courseware

Your accelerated course includes:

  • Accommodation *
  • Meals, unlimited snacks, beverages, tea and coffee *
  • On-site exams **
  • Exam vouchers **
  • Practice tests **
  • Certification Guarantee ***
  • Courseware
  • Up-to 12 hours of instructor-led training each day
  • 24-hour lab access
  • Digital courseware **
  • * For residential training only. Accommodation is included from the night before the course starts. This doesn't apply for online courses.
  • ** Some exceptions apply. Please refer to the Exam Track or speak with our experts
  • *** Pass first time or train again free as many times as it takes, unlimited for 1 year. Just pay for accommodation, exams, and incidental costs.

Prerequisites

Before attending this course, you should have knowledge of:

  • SQL
  • Linux command line
  • At least one scripting language (e.g., Bash scripting, Perl, Python, Ruby).

You don't need to have experience in Apache Hadoop.

Unsure whether you meet the prerequisites? Don’t worry. Your training consultant will discuss your background with you to understand if this course is right for you.

Reviews

Here's the Firebrand Training review section. Since 2001 we've trained exactly 134561 students and asked them all to review our Accelerated Learning. Currently, 96.41% have said Firebrand exceeded their expectations.

Read reviews from recent accelerated courses below or visit Firebrand Stories for written and video interviews from our alumni.


"Course material is very comprehensive. Instructor was very friendly and gave multiple views on the topics being discussed and included a good amount of examples."
D. E. . (30.10.2023 (Monday) to 1.11.2023 (Wednesday))

"Course material is very comprehensive. Instructor was very friendly and gave multiple views on the topics being discussed and included a good amount of examples."
D. E. . (30.10.2023 (Monday) to 1.11.2023 (Wednesday))

"Great trainer who has plenty of experience that he shared with us! Valuable insights and practical examples were provided that will help me with my daily tasks! Thank you!"
K. S. . (30.10.2023 (Monday) to 1.11.2023 (Wednesday))

"Great trainer who has plenty of experience that he shared with us! Valuable insights and practical examples were provided that will help me with my daily tasks! Thank you!"
K. S. . (30.10.2023 (Monday) to 1.11.2023 (Wednesday))

"I was worried about doing an OIL course but i was pleasantly surprised at how easy and effective it was. I normally struggle with concentration but the instructor was engaging the whole way through and made learning very easy."
Rachel Honnor. (30.10.2023 (Monday) to 1.11.2023 (Wednesday))

Course Dates

Start

Finish

Status

Location

Book now

19.2.2024 (Monday)

21.2.2024 (Wednesday)

Finished - Leave feedback

-

 

24.6.2024 (Monday)

26.6.2024 (Wednesday)

Wait list

Nationwide

 

5.8.2024 (Monday)

7.8.2024 (Wednesday)

Limited availability

Nationwide

 

16.9.2024 (Monday)

18.9.2024 (Wednesday)

Open

Nationwide

 

28.10.2024 (Monday)

30.10.2024 (Wednesday)

Open

Nationwide

 

9.12.2024 (Monday)

11.12.2024 (Wednesday)

Open

Nationwide

 

Latest Reviews from our students