Book nowWaitinglist
Prices are displayed without VAT by default.

Cloudera Data Analyst

Learn the tools data professionals need to access, manipulate, and analyze complex data sets using SQL and familiar scripting languages. This four-day training focuses on Apache Pig, Apache Hive, and Apache Impala (incubating) and alternates between instructional sessions and hands-on labs. Learn how to perform traditional data analytics, ETL (extract-transform-load), and apply business intelligence skills to big data. Xebia Academy is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

"Good overview of what is possible with Hadoop, Hive and Pig."  - Elmar Reizen, Data Scientist

Audience Profile: Cloudera Data Analyst

You will benefit from Cloudera Data Analyst training if:

  • You work as a: data analyst, business intelligence specialists, developer, system architect, or database administrator 
  • You have some knowledge of SQL 
  • You have familiarity with basic Linux command-line 
  • You have knowledge of a least one scripting language (such as Bash scripting, Perl, Python, or Ruby)- helpful, but not required
  • No prior knowledge of Hadoop is required

Achievements Upon Completion

This Cloudera Data Analyst training gives you knowledge, experience and skills in the following areas:

You will learn: 

  • Pig, Hive, and Impala features for data acquisition, storage, and analysis
  • The fundamentals of data ETL (extract - transform - load), ingestion, and processing with Hadoop tools
  • How Pig, Hive, and Impala improve productivity for typical analysis tasks
  • How to join diverse datasets to gain valuable business insight
  • Perform real-time, complex queries on datasets

You will gain experience in:

  • Join multiple data sets and analyzing disparate data with Pig
  • Organize data into tables, perform transformations, and simplify complex queries with Hive
  • Make multi-structures data accessible with Hive

You will develop the skills to:

  • Perform real-time interactive analyses on massive datasets stored in HDFS or HBase using SQL with Impala
  • Pick the best analysis tool for any given task in Hadoop
  • Enable real-time interactive analysis of the data stored in Hadoop via a native SQL environment with Cloudera Impala

ADDITIONAL INFORMATION

Requirements

  • You need your own laptop for this training

Xebia Academy (based in Hilversum, Amsterdam area) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

http://training.xebia.com/big-data/cloudera-data-analyst-training