IKH

Module Introduction

Welcome to this module on ”Analytics Using PySpark’‘. Let’s hear from Sajan about the broad topics you will be covering in this module.

As you know the size of data is increasing exponentially every day. So in order to manipulate this large amount of data, Spark ML library is used. So in this module, you will be covering the basic machine learning algorithms using the Spark ML library. The machine learning algorithms covered in this module are:

  1. Linear Regression
  2. Logistic Regression
  3. K-Means Clustering

As explained in the video, this module is more geared towards the hands-on coding aspect of the algorithms and relatively lesser towards the theoretical aspects of the same.

You will also go through an advanced demonstration after this module, where you will go through an end-to_end case study on the workings of a recommendation system.

Guidelines for this module

This module is more inclined towards the implementation of the basic machine learning algorithms using PySpark. Each session starts with a quick recap of the topics followed by the implementation part.

Please also try writing/running the codes with the experts. The presentation used in the sessions will be provided in the corresponding ‘Session Summary’ segment.

Guidelines for In-segment and Graded Questions

There will be a separate session for graded questions. The other sessions will contain non-graded question. Each graded question in this module will have 10 marks for a correct response and o for an incorrect response. Each graded question will have only one attempt, while each non-graded question will have one or two attempts depending upon the type of question and the number of options.

People you will hear from in this module

Adjunct Faculty

Ajay Shukla

Data Science Lead- Myntra

Ajay has completed his undergraduate and postgraduate in Computer Science Engineering from IIT, BHU. He heads the pricing team at Myntra, where he actively works on technologies like Data Science, Big Data, Spark and Machine learning. Presently, his work mainly involves the development of discounting strategies for all the products offered by Myntra.

Adjunct Faculty

Ajay Shukla

Senior Data Scientist at Gramener

With over 10 years of experience in data science and predictive analysis, Jaidev has worked in multiple firms such as Springboard, iDataLabs and cube26. He has completed his bachelor’s degree in Electrical and Electronics Engineering from Vishwakarma Institute of Technology, Pune. He is currently working as Senior Data Scientist at Gramener, a leading data science consulting company that advises clients on Data-Driven Leadership.

Adjunct Faculty

Ajay Shukla

AI-COE, IKH-Royal

Ajay has over 12 years of experience in machine learning and AI across various domains such as banking and financial services, e-commerce and telecom. He has worked with Amazon, Snapdeal and Citigroup.

He is expert in the application of ML, in marketing and risk. He has worked with organisations across multiple geographies and developed and implemented data science solutions targeting different stages in the customer life cycle. He has worked extensively on building ML models and has experience in advanced techniques such as neural networks, CBMs and SVMs.

Adjunct Faculty

Ajay Shukla

Senior Data Engineer

Kautuk is currently working as a senior data engineer. He has over 9 years of experience in the IT industry IT industry and has worked for several companies. He has deep knowledge of the various tools and technologies that are in use today.

Report an error