Apache Spark 3 – Databricks Certified Associate Developer

Learn Apache Spark 3 With Scala & Earn the Databricks Associate Certification to prove your skills as data professional

Do you want to learn how to handle massive amounts of data at scale?

What you’ll learn

  • How to prepare for the Databricks Certified Associate Developer For Apache Spark 3 Certification Exam.
  • The Architecture of an Apache Spark Application.
  • Learn how Apache Spark runs on a cluster of computer.
  • Learn the Execution Hierarchy of Apache Spark.
  • Create DataFrame from files and Scala Collections.
  • Spark DataFrame API and SQL functions.
  • Learn the different techniques to select the columns of a DataFrame.
  • How to define the schema of a DataFrame and set the data types of the columns.
  • Apply various methods to manipulate the columns of a DataFrame.
  • How to filter your DataFrame based on specifics rules.
  • Learn how to sort data in a specific order.
  • Learn how to sort rows of a DataFrame in a specific order.
  • How to arrange the rows of DataFrame as groups.
  • How to handle NULL Values in a DataFrame.
  • How to use JOIN or UNION to combine two data sets.
  • How you can save the result of complex data transformations to an external storage system.
  • The different deployment modes of an Apache Spark Application.
  • working with UDFs and Spark SQL functions.
  • How to use Databricks Community Edition to write Apache Spark Code.

Course Content

  • Apache Spark Architecture: Distributed Processing –> 4 lectures • 21min.
  • Apache Spark Architecture: Distributed Data –> 3 lectures • 23min.
  • DataFrame Transformations –> 23 lectures • 2hr 54min.
  • Apache Spark Architecture: Execution –> 4 lectures • 42min.
  • Exam Logistics –> 1 lecture • 12min.

Auto Draft

Requirements

  • Basic Scala Knowledge.
  • Basic data skills.
  • NO Previous Spark Knowledge.

Do you want to learn how to handle massive amounts of data at scale?

Learn Apache Spark 3 and pass the Databricks Certified Associate Developer for Apache Spark 3.0

Hi, My name is Wadson, and I’m a Databricks Certified Associate Developer for Apache Spark 3.0

In today’s data-driven world, Apache Spark has become the standard big-data cluster processing framework.

Apache Spark is used for Data Engineering, Data Science, and Machine Learning.

I will teach you everything you need to know about getting started with Apache Spark.

You will learn the Architecture of Apache Spark and use it’s Core APIs to manipulate complex data.
You will write queries to perform transformations such as Join, Union, GroupBy, and more.

This course is for beginners.
You do not need previous knowledge of Apache Spark.

There are Notebooks available to download so that you can follow along with me in the videos.
The Notebooks contains all the source code I use in the course.
There are also Quizzes to help you assess your understanding of the topics.

Get Tutorial