Building a Data Lake on AWS with AWS Glue, Glue Studio, Amazon Athena, and S3

  Рет қаралды 21,111

Gary Stafford

Gary Stafford

2 жыл бұрын

Build a simple Data Lake on AWS using a combination of services, including AWS Glue Data Catalog, AWS Glue Crawlers, AWS Glue Jobs, AWS Glue Studio, Amazon Athena, and Amazon S3.
All open-source files on GitHub: github.com/garystafford/ticki....
This video represents my own viewpoints and not of my employer, Amazon Web Services (AWS). All product names, logos, and brands are the property of their respective owners.
📣 Please subscribe to my KZfaq channel for future videos.

Пікірлер: 17
@donaldmahaya2689
@donaldmahaya2689 Жыл бұрын
Excellent hands on video covering building a datalake in AWS.
@hareepjoshi
@hareepjoshi Жыл бұрын
Hey Gary, found you through your medium articles and now I'm watching your youtube videos. Excellent content!
@mohammedgt8102
@mohammedgt8102 Жыл бұрын
Gary, that was just perfect! Fast and straight to the point. 👏👏. Thank you!
@DarioRomeroDeveloper
@DarioRomeroDeveloper Жыл бұрын
I always enjoyed the 'down to earth' practical business cases from @Gary Stafford. This one is really good. Thanks for sharing. I've learned a lot with this tutorial.
@rixonmathew
@rixonmathew 2 жыл бұрын
Great video. Many complex concepts have been explained using simple language and examples.
@gatorpika
@gatorpika Жыл бұрын
Really great presentation, thanks for that.
@_truthful_q_
@_truthful_q_ 2 жыл бұрын
This was excellent 👏 Top marks my man!
@swapnilbops1486
@swapnilbops1486 10 ай бұрын
Very Useful 🌟
@RashaadFontenot
@RashaadFontenot 2 жыл бұрын
Great video
@freakinmonkey85
@freakinmonkey85 2 жыл бұрын
Thanks a lot! This video deserves more views. It’s the first concise to the point video I’ve found where actual data and actual results are shown end to end. I have a question I hope you could answer. How would you handle data that changes. E.g. in a couple of days a customer cancels a ticket with id 1234, concert id 321. Now the calculations needs to take this into account, no?
@GaryStafford
@GaryStafford 2 жыл бұрын
kzfaq.info/get/bejne/aJuDp8Sk0qm6g6s.html
@ericpho4060
@ericpho4060 2 жыл бұрын
Hi, thanks for this video. Very interesting ! Would be curious to know how would you handle incremental updates of this aggregated tables through Athena SQL queries ? with that architecture, would you run full calculation for entire set of data all over at each execution ?
@GaryStafford
@GaryStafford 2 жыл бұрын
Here is the documentation on the current level of integration possible with Amazon Athena (docs.aws.amazon.com/athena/latest/ug/querying-hudi.html): "Currently, Athena supports snapshot queries and read optimized queries, but not incremental queries. On MoR tables, all data exposed to read optimized queries are compacted. This provides good performance but does not include the latest delta commits. Snapshot queries contain the freshest data but incur some computational overhead, which makes these queries less performant."
@profbiyi
@profbiyi 2 жыл бұрын
Hi, Thanks so very much for this. Is it possible to do an incremental load into s3 from RDS with glue?
@ArchitectureBytes
@ArchitectureBytes Жыл бұрын
What's the use case?
@sampyism
@sampyism Жыл бұрын
How much did all of this cost for you for a month?
Deep Dive Into AWS Lake Formation - Level 300 (United States)
28:27
How Many Balloons Does It Take To Fly?
00:18
MrBeast
Рет қаралды 158 МЛН
КАК ДУМАЕТЕ КТО ВЫЙГРАЕТ😂
00:29
МЯТНАЯ ФАНТА
Рет қаралды 8 МЛН
Who has won ?? 😀 #shortvideo #lizzyisaeva
00:24
Lizzy Isaeva
Рет қаралды 64 МЛН
AWS re:Invent 2021 - Building a data lake on Amazon S3
54:52
AWS Events
Рет қаралды 30 М.
7 Best Practices for Implementing Apache Iceberg
57:01
Tabular
Рет қаралды 4,7 М.
Database vs Data Warehouse vs Data Lake | What is the Difference?
5:22
Alex The Analyst
Рет қаралды 747 М.
Look, this is the 97th generation of the phone?
0:13
Edcers
Рет қаралды 4 МЛН
Телефон-електрошокер
0:43
RICARDO 2.0
Рет қаралды 1,3 МЛН