No video

In-Process Analytical Data Management with DuckDB - posit::conf(2023)

  Рет қаралды 6,630

Posit PBC

Posit PBC

8 ай бұрын

Presented by Hannes Mühleisen
This talks introduces DuckDB, an in-process analytical data management system that is deeply integrated into the R ecosystem.
DuckDB is an in-process analytical data management system. DuckDB supports complex SQL queries, has no external dependencies, and is deeply integrated into the R ecosystem. For example, DuckDB can run SQL queries directly on R data frames without any data transfer. DuckDB uses state-of-the-art query processing techniques like vectorised execution and automatic parallelism. DuckDB is out-of-core capable, meaning that it is possible to process datasets far bigger than main memory. DuckDB is free and open source software under the MIT license.

In this talk, we will describe the user values of DuckDB, and how it can be used to improve their day-to-day lives through automatic parallelisation, efficient operators, and out-of-core operations.
Materials:
- duckdb.org
- duckdb.org/docs/api/r.html
- github.com/duckdb/duckdb-r
Presented at Posit Conference, between Sept 19-20 2023,
Learn more at posit.co/conference.
--------------------------
Talk Track: Databases for data science with duckdb and dbt.
Session Code: TALK-1099

Пікірлер: 6
@caty863
@caty863 7 ай бұрын
I find it a bit dishonest that you intentianally chose to use *RPostgreSQL* in your demo instead of the faster *RPostgres.* In addition, what's wrong with plain old SQLite? You should have elaborated a little deeper on what exact problem DuckDB is solving here.
@HarmonicaTool
@HarmonicaTool 7 ай бұрын
There are many talks on DuckDB. As much as I could take away: They claim to be faster than SQLite for typical analytical purposes because they store data in columns, not in rows. It is optimized for different purposes. It reads SQLite data directly.
@ravishmahajan9314
@ravishmahajan9314 7 ай бұрын
​@@HarmonicaTool Its basically called Analytical SQLite. So SQLite is a lightweight database of applications from an OLTP perspective. Whereas DuckDB is for OLAP perspective. Means if you have lots and lots of application data and you want to analyze it in seconds, you can do it in duckdb as it is in memory analytics database.
@chalimsupa6603
@chalimsupa6603 Ай бұрын
duckdb is designed for analytical workflows compared to sqlite which is a transactional db.
@chuckbecker4983
@chuckbecker4983 Ай бұрын
Accusing someone of dishonesty should be the last resort, used only after every other reasonable explanation has been identified and investigated. Going to dishonesty first exposes as much about the accuser as the accused.
@chalimsupa6603
@chalimsupa6603 Ай бұрын
@@chuckbecker4983 you are right.... as you said, it reveals more about the accuser. I think he could not find a better word to express his point. "Dishonest" is a very strong word and is not suitable in this context
Running R-Shiny without a Server - posit::conf(2023)
20:10
Posit PBC
Рет қаралды 7 М.
Can A Seed Grow In Your Nose? 🤔
00:33
Zack D. Films
Рет қаралды 31 МЛН
No empty
00:35
Mamasoboliha
Рет қаралды 12 МЛН
Add any Plot to your {gt} table
15:40
R for the Rest of Us
Рет қаралды 3,5 М.
Quarto Dashboards | Charles Teague | Posit
12:16
Posit PBC
Рет қаралды 20 М.
Why should you care about DuckDB? ft. Mihai Bojin
14:35
MotherDuck
Рет қаралды 9 М.
dplyr 1.1.0 Features You Can't Live Without - posit::conf(2023)
5:28
Reproducible Manuscripts with Quarto - posit::conf(2023)
20:26