IKH

Session Overview

Welcome to the session on ‘Optimising Spark Clusters’.

In the next video, you will get an overview of the topics that will be covered in this session.

In this session

  • You will understand some of the basic concepts of Optimising Spark Clusters to improve cluster utilisation by your Spark jobs.
  • You will first understand the need for optimising cluster utilisation for Spark jobs. You will also learn about some of the common mistakes that lead to underutilisation and some best practices for optimising cluster utilisation.
  • Next, you will learn about the different job deployment modes available in Spark. In the next segment, you will learn about the cost-performance trade-offs that you need to consider while deciding the optimal cluster configuration.
  • You will also learn how Apache Spark jobs are implemented in the production environment. In the final segment, you will learn about some of the best practices that you should follow while working with Apache Spark

People you will hear from in this session

Subject Matter Expert

Vishwa Mohan

Senior Software Engineer, LinkedIn

Vishwa is currently working as a senior software engineer at LinkedIn, an online employment-oriented platform. He has over nine years of experience in the IT industry and has worked in various companies, including Amazon, Walmart, Oracle and others. He has deep knowledge of various tools and technologies that are used in the industry today.

Report an error