In the next video, let’s hear from our SME as he explains the key takeaways from this module on Redshift.
The key takeaways from this module are summarised below.
- Amazon Redshift is based on PostgreSQL with additional capabilities to support OLAP operations.
- Redshift offers up too 3x performance than other DW and you can store PBs of data without impacting the performance.
- Redshift does not enforce constraints and does not support traditional indexes; instead, it has the sort key and distribution keys.
- Using WLM, you can define separate queues for separate workloads and assign a portion of cluster memory to each queue.
- Most maintenance activities, like VACUUM and ANALYZE in Redshift are automated.
- Using Redshift Spectrum, you can query the data on S3 and join it with data on Redshift.
- The new RA3 nodes decouple storage from compute and comes with AQUA, which provides up to 10x performance.
- Using COPY command, you can load in parallel into Redshift.
- Redshift offers automated snapshots which can also be scheduled.
- With node reservations, you can save up to 60% costs over on-demand pricing.
Some links that could be useful are given below:
- Amazon Redshift Utilities: Github Repo
- Redshift Advanced Monitoring: Github Repo
- Amazon Redshift UDFs: Github Repo
Some useful utilities are mentioned in the table below.
| Admin Scripts | Collection of utilities for running diagnostics on your cluster. |
| Admin Views | Collection of utilities for managing your cluster, generating a schema DDL, and so on. |
| Analyse Vacuum Utility | A Utility that can be scheduled to vacuum and analyse the tables within your Amazon Redshift cluster. |
| Column Encoding Utility | A utility that can apply optimal column encoding to an established schema with data already. |
The lecture notes for this module are provided below.
Important Note:
Amazon Redshift is a costly service of AWS. Hence, to avoid burning up your monthly AWS budget, please make sure to Pause your Redshift cluster during the module week(when you’re learning/practicing Redshift. However, you can also choose to terminate it if there is any kind of break while you are going through the module).
Please note that clicking on the Stop button in AWS Academy dashboard does not affect the Redshift cluster and therefore, the cluster has to be terminated from inside the AWS management console page.
Additional Reading
Amazon Redshift: Offcial Documentation
Amazon Redshift: Database Developer Guide
Amazon Redshift System Overview
AWS Redshift Monitoring: The Complete Guide
Report an error