IKH

Session Introduction

Welcome to the third session on ‘Analytics using PySpark’. In the first session, you learnt how to perform the basic EDA on a data set. The second session was all about linear regression and the basic model building techniques using PySpark. 

So, let’s watch the upcoming video in which our SME, Ajay outlines the concepts that will be covered in this session.

In this session

As explained by Ajay, in this session, you will first see a quick recap of the logistic regression algorithm, which was covered in great detail in a previous course. Then, you would move on to the implementation part where you will get to see a very interesting and industry-relevant case study of Click-Through Rate (CTR) prediction wherein you will learn to build an end-to-end code to implement the case study.

The python notebook used in this session is as follows. 

Note:

The PowerPoint Presentation used in this session is available in the ‘Session Summary’ segment.

People you will hear from in this session

Subject Matter Expert

Ajay Shukla

Data Science Lead – Myntra

Sajan completed his undergraduate and postgraduate in Computer Science Engineering from IIT, BHU. He heads the pricing team at Myntra, actively working on Data Science, Big Data, Spark and Machine learning. Presently, his work mainly involves the development of discounting strategies for all the products offered by Myntra.

Report an error