Are you interested in this course? Please let us know.
 Book nowWaitinglist
Prices are displayed without VAT by default..
  • Training info
  • Category Big Data
  • Price (excl. VAT)
  • Language {{course.language}}
  • Duration 4 days
  • Time 09:00 - 17:00
  • Lunch Included

Cloudera Data Analyst

Learn the tools data professionals need to access, manipulate, and analyze complex data sets using SQL and familiar scripting languages. This four-day training focuses on Apache Pig, Apache Hive, and Apache Impala (incubating) and alternates between instructional sessions and hands-on labs. Learn how to perform traditional data analytics, ETL (extract-transform-load), and apply business intelligence skills to big data. Xebia Academy is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.
"Good overview of what is possible with Hadoop, Hive and Pig."  - Elmar Reizen, Data Scientist

Audience Profile: Cloudera Data Analyst

You will benefit from Cloudera Data Analyst training if:

  • You work as a: data analyst, business intelligence specialists, developer, system architect, or database administrator 
  • You have some knowledge of SQL 
  • You have familiarity with basic Linux command-line 
  • You have knowledge of a least one scripting language (such as Bash scripting, Perl, Python, or Ruby)- helpful, but not required
  • No prior knowledge of Hadoop is required

Achievements Upon Completion

This Cloudera Data Analyst training gives you knowledge, experience and skills in the following areas:

You will learn: 

  • Pig, Hive, and Impala features for data acquisition, storage, and analysis
  • The fundamentals of data ETL (extract - transform - load), ingestion, and processing with Hadoop tools
  • How Pig, Hive, and Impala improve productivity for typical analysis tasks
  • How to join diverse datasets to gain valuable business insight
  • Perform real-time, complex queries on datasets

You will gain experience in:

  • Join multiple data sets and analyzing disparate data with Pig
  • Organize data into tables, perform transformations, and simplify complex queries with Hive
  • Make multi-structures data accessible with Hive

You will develop the skills to:

  • Perform real-time interactive analyses on massive datasets stored in HDFS or HBase using SQL with Impala
  • Pick the best analysis tool for any given task in Hadoop
  • Enable real-time interactive analysis of the data stored in Hadoop via a native SQL environment with Cloudera Impala

Additional Information


  • You need your own laptop for this training

Xebia Academy (based in Hilversum, Amsterdam area) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.