Build a poor man’s data lake from scratch with DuckDB

  Рет қаралды 25,045

Dagster

Dagster

Күн бұрын

DuckDB is so hot right now. Could it replace our cloud data warehouses or data lakes?
Pete Hunt builds a data lake from scratch with DuckDB and Dagster. Follow the tutorial here: dagster.io/blog/duckdb-data-lake
Combined with Dagster, S3, and Apache Parquet, DuckDB can become a powerful, multiplayer data lake that can serve the needs of many organizations with very little effort. Think of it as a free, simple SQLite.
We can see the rise of DuckDB for subsets of workloads that don’t have massive scale and can take advantage of its simplicity and fast performance.
Give it a go!
Try Dagster for free for 30 days: dagster.io/lp/dagster-cloud-t...

Пікірлер: 10
@vikramtatke5930
@vikramtatke5930 2 ай бұрын
As a person with just 2 years of experience my mind was blown watching this. I am a single person writing code in my department so I don't have any seniors to learn from but I'm leading a data engineering project that deals with terabytes of data and each request is multiple times larger than the server's RAM and multiple such requests need to be processed in parallel to complete stuff in time. Also, we have the tiniest possible budget to aggregate 25 to 30 columns and billions of rows every day. Also, we need to cut down on costs. This was super helpful.
@michaelayoub2211
@michaelayoub2211 Жыл бұрын
Great video, thanks!
@marcosoliveira8731
@marcosoliveira8731 7 ай бұрын
Really good stuff! A lot of good ideas.
@tobiaspucher9597
@tobiaspucher9597 26 күн бұрын
Awesome!!! Please more!
@ImperialTerrain
@ImperialTerrain Жыл бұрын
thank you pete
@gw1284
@gw1284 Жыл бұрын
Thanks for this demo. Can you comment on what role polars may play in this?
@hwy9nightkid
@hwy9nightkid Жыл бұрын
polars is akin to pandas or spark dataframes.. a way to organize your tables of data , if im not mistaken
@marcosoliveira8731
@marcosoliveira8731 7 ай бұрын
As pandas alternative.
@kalidsherefuddin
@kalidsherefuddin 7 ай бұрын
Thanks for
@gauravlotekar660
@gauravlotekar660 Жыл бұрын
aawwwseome.
Big Data is Dead | MotherDuck
25:58
Data Council
Рет қаралды 11 М.
БОЛЬШОЙ ПЕТУШОК #shorts
00:21
Паша Осадчий
Рет қаралды 8 МЛН
The day of the sea 🌊 🤣❤️ #demariki
00:22
Demariki
Рет қаралды 98 МЛН
Heartwarming: Stranger Saves Puppy from Hot Car #shorts
00:22
Fabiosa Best Lifehacks
Рет қаралды 12 МЛН
Converting an ETL script to Software-Defined Assets
26:16
Dagster
Рет қаралды 6 М.
Data Warehouses are Gilded Cages  What Comes Next | Motherduck
39:41
Data Council
Рет қаралды 3,5 М.
15 futuristic databases you’ve never heard of
8:42
Fireship
Рет қаралды 651 М.
DuckDB vs Pandas vs Polars For Python devs
12:05
MotherDuck
Рет қаралды 14 М.
DuckDB: Supercharging Your Data Crunching  by Richard Wesley
30:45
Why should you care about DuckDB? ft. Mihai Bojin
14:35
MotherDuck
Рет қаралды 7 М.
Writing My Own Database From Scratch
42:00
Tony Saro
Рет қаралды 149 М.
Where are you from?
0:13
ARGEN
Рет қаралды 4,2 МЛН
Do you like icecream?
0:21
dednahype
Рет қаралды 11 МЛН