Book nowWaitinglist
Prices are displayed without VAT by default.

Cloudera Data Science at Scale using Spark and Hadoop

Learn how data scientists use Spark and Hadoop to help companies reduce cost, increase profit, improve products, retain customers, and identify new opportunities. This three-day training taught in English alternates between instructional sessions and interactive, hands-on learning labs to teach you how to use data science to achieve impactful results. Xebia Academy (based in Hilversum, Amsterdam area) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

Audience Profile: Cloudera Data Science at Scale using Spark and Hadoop training

You will benefit most from this training if:

  • you are a developer, data analyst or statistician
  • you have basic knowledge of Apache Hadoop: HDFS, MapReduce, Hadoop Streaming, and Apache Hive
  • you have proficiency in a scripting language.

Experience with Linux environments and Python is strongly preferred, but familiarity with Perl or Ruby is sufficient.

Achievements Upon Completion

Through instructor-led discussion and interactive, hands-on exercises, participants will achieve the following outcomes:

You'll learn:

  • how to identify the potential business use cases where data science can provide impactful results
  • how to obtain, clean and combine disparate data sources to create a coherent picture for analysis
  • which statistical methods to leverage for data exploration that provides critical insight
  • where and when to leverage Hadoop streaming and Apache Spark for data science pipelines
  • how to choose the  machine learning techniques to use for particular data science projects
  • the pitfalls of deploying new analytics projects to production at scale
  • machine learning fundamentals and breakthroughs, the importance of algorithms, and data as a platform

You will gain hands-on experience and you will have the skills to:

  • applying data science methods to real-world challenges in different industries
  • implement and manage recommenders using Apache MLlib
  • set up and evaluate data experiments

Additional Information

Prerequisites

!Please note, that you need to bring your own laptop for this training.

Xebia Academy (based in Hilversum, Amsterdam area) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

https://training.xebia.com/data-science/cloudera-data-science-at-scale-using-spark-and-hadoop