Welcome to the third session on ‘Analytics using PySpark’. In the first session, you learnt how to perform the basic EDA on a data set. The second session was all about linear regression and the basic model building techniques using PySpark.
So, let’s watch the upcoming video in which our SME, Ajay outlines the concepts that will be covered in this session.
In this session
As explained by Ajay, in this session, you will first see a quick recap of the logistic regression algorithm, which was covered in great detail in a previous course. Then, you would move on to the implementation part where you will get to see a very interesting and industry-relevant case study of Click-Through Rate (CTR) prediction wherein you will learn to build an end-to-end code to implement the case study.
The python notebook used in this session is as follows.
Note:
The PowerPoint Presentation used in this session is available in the ‘Session Summary’ segment.
People you will hear from in this session
Subject Matter Expert
Ajay Shukla
Data Science Lead – Myntra
Sajan completed his undergraduate and postgraduate in Computer Science Engineering from IIT, BHU. He heads the pricing team at Myntra, actively working on Data Science, Big Data, Spark and Machine learning. Presently, his work mainly involves the development of discounting strategies for all the products offered by Myntra.
Report an error