Premium Only Content

Reddit Data Pipeline | AWS End to End Data Engineering
🚀 In this video, we walk you through the integration of Reddit, Airflow, Celery, Postgres, S3, AWS Glue, Athena, and Redshift to create a seamless ETL process. 📊🔍
What You Will Learn 📝:
🌐 How to extract data from Reddit using its API.
🔄 Setting up and orchestrating ETL processes with Apache Airflow and Celery.
📦 Storing efficiently with Amazon S3 using Airflow.
🧠 Leveraging AWS Glue for data cataloging and ETL jobs.
📜 Querying and transforming data with Amazon Athena.
🏢 Setting up Redshift Cluster and Best practices for loading data into Amazon Redshift for analytics.
⏰ Timestamps:
0:00 Introduction
1:27 Setting up Apache airflow with Celery Backend and Postgres
9:20 Reddit Data Pipeline with airflow
41:00 Cleaning and Transforming Reddit Data
50:00 Connecting to AWS from Airflow
1:11:17 AWS Glue data transformation
1:22:13 Querying Data with Athena
1:24:47 Setting up Redshift Data Warehouse
1:27:26 Redshift Data Warehouse Query Tool
1:29:00 Loading Data into Data Warehouse
1:32:25 Charting with Redshift Data Warehouse
🔗 Useful Links:
Reddit API Documentation: https://www.reddit.com/wiki/api/
Apache Airflow Official Site: https://airflow.apache.org/docs/
AWS Glue Documentation: https://docs.aws.amazon.com/glue/latest/dg/catalog-and-crawler.html
💬 Let us know in the comments if you have any questions or if there's another topic you'd like us to cover next!
🌟 Don't forget to like, share, and subscribe for more data tutorials! 🌟
-
LIVE
Mally_Mouse
2 hours ago📣Telescreen Talks - LIVE!
135 watching -
1:57:29
DeVory Darkins
17 hours ago $34.27 earnedDemocrats drop SHOCKING Update regarding ICE Agents - Myron Gaines
117K59 -
21:24
Professor Nez
2 hours ago🚨WOW! Trump got EMOTIONAL when RFK Jr. Said THIS!
11.5K15 -
LIVE
Jeff Ahern
1 hour agoNever woke Wednesday with Jeff Ahern
68 watching -
1:06:21
Timcast
4 hours agoLiberals DEFEND Nazi Tattoo On Communist Democrat Senate Candidate, ITS A CULT
136K150 -
LIVE
Side Scrollers Podcast
2 days ago🔴FIRST EVER RUMBLE SUB-A-THON🔴DAY 3🔴100% REVENUE HELPS CHANGE CULTURE!
1,240 watching -
25:57
The Kevin Trudeau Show Limitless
6 hours agoThe Sound Of Control: This Is How They Program You
13.6K8 -
LIVE
Dr Disrespect
5 hours ago🔴LIVE - DR DISRESPECT - BATTLEFIELD 6 KILL CHALLENGE - VS VISS
1,192 watching -
11:32
Sponsored By Jesus Podcast
3 days agoWhat “Speaking the Truth in Love” REALLY Means | Tension of Grace and Truth
19.5K6 -
29:40
Paul Barron Network
2 days ago $1.02 earnedCrypto ETFs Launching... Even With Government SHUT DOWN?! 🤯 Grayscale INTERVIEW
21.4K1