IKH

Module Summary

In the next video, let’s hear from our SME as he explains the key takeaways from this module on Redshift.

The key takeaways from this module are summarised below.

  • Amazon Redshift is based on PostgreSQL with additional capabilities to support OLAP operations.
  • Redshift offers up too 3x performance than other DW and you can store PBs of data without impacting the performance.
  • Redshift does not enforce constraints and does not support traditional indexes; instead, it has the sort key and distribution keys.
  • Using WLM, you can define separate queues for separate workloads and assign a portion of cluster memory to each queue.
  • Most maintenance activities, like VACUUM and ANALYZE in Redshift are automated.
  • Using Redshift Spectrum, you can query the data on S3 and join it with data on Redshift.
  • The new RA3 nodes decouple storage from compute and comes with AQUA, which provides up to 10x performance.
  • Using COPY command, you can load in parallel into Redshift.
  • Redshift offers automated snapshots which can also be scheduled.
  • With node reservations, you can save up to 60% costs over on-demand pricing.

Some links that could be useful are given below:

  • Amazon Redshift Utilities: Github Repo
  • Redshift Advanced Monitoring: Github Repo
  • Amazon Redshift UDFs: Github Repo

Some useful utilities are mentioned in the table below.

Admin ScriptsCollection of utilities for running diagnostics on your cluster.
Admin ViewsCollection of utilities for managing your cluster, generating a schema DDL, and so on.
Analyse Vacuum UtilityA Utility that can be scheduled to vacuum and analyse the tables within your Amazon Redshift cluster.
Column Encoding UtilityA utility that can apply optimal column encoding to an established schema with data already.

The lecture notes for this module are provided below.

Important Note:

Amazon Redshift is a costly service of AWS. Hence, to avoid burning up your monthly AWS budget, please make sure to Pause your Redshift cluster during the module week(when you’re learning/practicing Redshift. However, you can also choose to terminate it if there is any kind of break while you are going through the module).

Please note that clicking on the Stop button in AWS Academy dashboard does not affect the Redshift cluster and therefore, the cluster has to be terminated from inside the AWS management console page.

Additional Reading

Amazon Redshift: Offcial Documentation

Amazon Redshift: Database Developer Guide

Amazon Redshift System Overview

AWS Redshift Monitoring: The Complete Guide

Report an error