© 2019 by the PA CONSULTING GROUP

  • Black Twitter Icon
  • Black LinkedIn Icon

Data Driven Organisation website

AI & Analytics

Spark / Scala

Anyone who want to learn how to program in Spark / Scala. Typically data engineers/scientists and software engineers.

Aimed at:

Delivery method:

Interactive classroom training combining theory with practical exercises

Prerequisites:

Required preparation:

Basic programming knowledge, some idea of machine learning concepts

None

3-4 hours

Practitioner

Duration:

Skill level:

Prefered group size:

+/-10 participants per trainer

Course description

The spark-Scala training is a practical introduction on working with spark using Scala. It focuses on spark 1.6.2 and for data processing we focus on data frames. Topics that are covered are: the Spark framework (what is it, why do we need it, where does it sit in relation to hadoop), Spark components, transformations and actions, Scala data frames and Scala basics, and Machine Learning pipelines in Spark (Spark ML). In the practical part of the session you'll be working with Spark and Scala in Zeppelin.

Learning objective

Upon completion of this training, participants will have some hands-on experience in working with Spark / Scala to further build on.