Cloudera Developer Training for Spark & Hadoop

Learn how Apache Spark integrates with the entire Hadoop ecosystem. This four-day training taught in English gives you the skills you need to ingest data on a Hadoop cluster and process it with Spark, Hive, Flume, Sqoop, Impala, and other Hadoop ecosystem tools. Through instructor-led discussions and interactive, hands-on exercises, you learn to identify the right tool(s) for any situation, and how to use them. You’ll walk away from this training with the practical knowledge you need to tackle the real-world challenges Hadoop developers face every day.
“The training was interesting, and the trainers were very knowledgeable.” - Data Scientist

Q: Is Cloudera Developer for Spark & Hadoop training right for me?

  • Yes - if you work as a developer or engineer
  • Yes - if you have programming experience
  • Yes - if you can program in Scala and/or Python (Apache Spark examples and hands-on exercises are presented in those languages)
  • Yes - if you have basic familiarity with the Linux command line
  • Yes - if you have some knowledge of SQL
  • Prior knowledge of Hadoop is NOT required

Q: What will I achieve by completing this training?

Cloudera Developer for Spark & Hadoop gives you skills, knowledge, tools and training in the following areas:

You will learn:

  • How data is distributed, stored, and processed in a Hadoop cluster
  • How to use Sqoop and Flume to ingest data
  • How to process distributed data with Apache Spark
  • How to model structured data as tables in Impala and Hive
  • How to choose the best data storage format for different data usage patterns
  • Best practices for data storage

You will gain experience in:

  • Know the right tool(s) for any situation, and how to use them
  • Best practices for data storage
  • Work with RDDs in Spark

You will develop the skills to:

  • Process distributed data with Apache Spark
  • Choose the best data storage format for different data usage patterns

Q: What else should I know?

Requirements

  • Please bring your own laptop

Certification

  • Xebia Academy (based in Hilversum, Amsterdam) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

Tell us what you need

Interested in this training, but looking for a customized, in-company course that fits your business best? We are here to help you succeed.

Or call Xebia Academy at +31 35 538 1921
Sales Team