Dagster+ Launch: Welcome
4:33
21 сағат бұрын
Dagster+ overview
7:26
2 ай бұрын
Dagster+ Insights
6:28
2 ай бұрын
Dagster+ Data Reliability
6:31
2 ай бұрын
Dagster+ Data Catalog
5:14
2 ай бұрын
Introducing Dagster+
29:51
2 ай бұрын
Dagster Demo Recording
35:03
4 ай бұрын
Exploring the new Dagster UI
6:32
Пікірлер
@kosmylo
@kosmylo Күн бұрын
What is the difference between dagit and dagster-webserver?
@ShaneZarechian
@ShaneZarechian 4 күн бұрын
Is FreshnessPolicy being deprecated?
@JohnCF
@JohnCF 7 күн бұрын
Came here excited to learn about new features in the latest Dagster version. But it looks like you've decided to widen the feature-gap between the open source offering and the enterprise offering... even though this will be a maintenance burden on your team... causing delays in "backporting" features and bugfixes to the open-source version going forward. Kinda disappointed...
@dagsterio
@dagsterio 7 күн бұрын
Thanks for the comment @JohnCF. If you go through the enhancements introduced with this Dagster+ launch, you will see that many of them (in fact, all of them except for Dagster Insights) benefit both the open-source and the commercial offerings. The data cataloging capability is a good example of that. From our perspective, these new additions are moving us forward on both the OSS and the Dagster+ roadmaps. In addition, by providing more value to those organizations that adopt Dagster+ we are able to guarantee the longevity and accelerated development of Dagster Open-Source.
@JohnCF
@JohnCF 5 күн бұрын
@@dagsterio Does that mean what's mentioned at 7:15 about column lineage is available in open-source too? The phrasing definitely sounded like it's only available for Enterprise users...
@dagsterio
@dagsterio 4 күн бұрын
@@JohnCF Correct. Column level lineage is a Dagster+ feature and is not available in Dagster Open-Source.
@jordanfox470
@jordanfox470 14 күн бұрын
Is there native support for mapping time based partitions to static partitions defined like "today", "rest of month", "rest of year", "rest of history"? This is a common setup for power bi datasets, which can be represented as assets in dagster. Would be nice to take advantage of auto materialize policies.
@dagsterio
@dagsterio 13 күн бұрын
Dagster does not natively support mapping time-based partitions to static partitions like "today," "rest of month," "rest of year," and "rest of history" directly out of the box. However, you can achieve similar functionality by defining custom partitioning schemes and using the appropriate partition mappings. You can define custom partitions using StaticPartitionsDefinition for static and TimeWindowPartitionsDefinition for time-based partitions.
@tobiaspucher9597
@tobiaspucher9597 26 күн бұрын
Awesome!!! Please more!
@flogzer0
@flogzer0 28 күн бұрын
I'm fairly sure this sales guy never used Airflow
@AbhishekAgrawal-dv1id
@AbhishekAgrawal-dv1id Ай бұрын
If the requirement is to get the data from S3 files into a BQ table but perform some validations on those files before inserting into the table, how would we do it with Embedded ELT? We are using Dagster OSS heavily and looking to use embedded-elt for getting data from files, tables and APIs..
@tim-at-elementl
@tim-at-elementl Ай бұрын
Hey Abishek! In your case, would you be able to represent the S3 files as source assets first, adding asset checks onto those, and running Embedded ELT only if those asset checks pass? Sling currently (afaik) is heavily focused on doing ingestion well, so you can defer to the rest of the Dagster ecosystem (such as asset checks) for validations.
@AbhishekAgrawal-dv1id
@AbhishekAgrawal-dv1id Ай бұрын
@@tim-at-elementl Thanks, Tim. How would you rate dlt for my use-case? I see dlt is far more mature..
@tim-at-elementl
@tim-at-elementl Ай бұрын
​@@AbhishekAgrawal-dv1id we've found that dlt is a powerful framework for ingesting from APIs and it's definitely mature enough for production settings. I'll also say that neither Sling's or dlt's integration currently allow for creating asset checks in-flight during ingestion. Instead, have you thought about ingesting the files into a quarantined dataset first using whichever tool you'd like, applying asset checks to that, and then moving that data to your real "analytics-ready" BQ datasets once you've vetted the data? This way, you can do ad hoc analysis to understand why the data failed data quality tests easily, but also keep it isolated from your production analytics.
@AbhishekAgrawal-dv1id
@AbhishekAgrawal-dv1id Ай бұрын
Yeah, I am also leaning towards doing something like this. Thanks for this, Tim. Would you suggest using a similar approach to pull data from a different database? We'd still need to run minor validations on the incoming data, though. Would dlt help here at all?
@vikramtatke5930
@vikramtatke5930 2 ай бұрын
As a person with just 2 years of experience my mind was blown watching this. I am a single person writing code in my department so I don't have any seniors to learn from but I'm leading a data engineering project that deals with terabytes of data and each request is multiple times larger than the server's RAM and multiple such requests need to be processed in parallel to complete stuff in time. Also, we have the tiniest possible budget to aggregate 25 to 30 columns and billions of rows every day. Also, we need to cut down on costs. This was super helpful.
@atulverma7783
@atulverma7783 2 ай бұрын
anyone notice silicon valley reference in screenshots
@dagsterio
@dagsterio 2 ай бұрын
Yep. We are big fans. Enjoy the Easter eggs! ;-)
@JohnoScott
@JohnoScott 2 ай бұрын
Seems to be an alternative to dbt docs and dbt Cloud Explorer?
@user-xh6tx8py1r
@user-xh6tx8py1r 2 ай бұрын
For some teams, definitely, although it can be complementary to dbt docs, because it sucks in some of the data via the dbt integration. Essentially becomes a super set of documentation
@armanuki38911
@armanuki38911 2 ай бұрын
You lost me at "cloud".
@JohnoScott
@JohnoScott 2 ай бұрын
Where is Nick Shrock ?
@dagsterio
@dagsterio 2 ай бұрын
Behind the camera, helping out with the teleprompter while recovering from an injury.
@JohnoScott
@JohnoScott 2 ай бұрын
Wishing him a speedy recovery then. We miss him on KZfaq !
@schrockn
@schrockn 2 ай бұрын
👋 Right here! I just happened to be unable to participate in the recording session for this. Team killed it!
@JohnoScott
@JohnoScott 2 ай бұрын
@@schrockn yes they did. Keen to hear your take on all this Nick ; video from you soon ?
@fredguth1315
@fredguth1315 2 ай бұрын
What does Dagster+ mean for the open source version?
@dagsterio
@dagsterio 2 ай бұрын
Many of the enhancements in the 1.7 release benefit all users (Open-source and paid Dagster+ users). In general, the open-source solution gains more capabilities with each release both to support open-source users and to unlock more capabilities in Dagster+ which are built on top of core.
@minimapai
@minimapai 2 ай бұрын
Exciting
@Robay146
@Robay146 2 ай бұрын
Great presentation.
@Amapramaadhy
@Amapramaadhy 2 ай бұрын
Please update the gh repo when possible with the data mesh example. Multiple code locations seem super useful. Thanks
@dagsterio
@dagsterio 2 ай бұрын
Hi! I had to put it in a different repo to accommodate for running multiple code locations and not breaking our existing setup for the deep dive projects. The dedicated repo for the data mesh example can be found here! github.com/dagster-io/data-mesh-demo
@Amapramaadhy
@Amapramaadhy 2 ай бұрын
@@dagsterio much appreciated
@Jesufemi_O
@Jesufemi_O 3 ай бұрын
Hi Dagster team, great stuff here! I really enjoyed watching this!! is the demo code available in github?
@atkinsonr
@atkinsonr 3 ай бұрын
nice talk but the slides are hard to follow on here. Would be better if recorded with autofocus off, and white balanced to the projector screen.
@danielbartley516
@danielbartley516 3 ай бұрын
100%
@maxisqt
@maxisqt 3 ай бұрын
This is the coolest tech demo I've ever seen. I have wanted for so long to see an end-to-end analytics stack demo, or tutorial, and never found it. You just did it in 15 minutes, using free, open source tools I can run locally on my laptop. Absolutely incredible!
@dagsterio
@dagsterio 3 ай бұрын
Thanks. The Dagster capabilities are expanding with each new release.
@cornstarch4575
@cornstarch4575 3 ай бұрын
At around 8:20 you mention it's vulnerable to SQL injection - could I get more detail on that?
@zuesbenz
@zuesbenz 3 ай бұрын
fk the learning curve on this shit.
@krtmlry2719
@krtmlry2719 2 ай бұрын
is it really hard? Im planning to learn this too. lmao
@quinnherden
@quinnherden 3 ай бұрын
That's interesting. Do you expand on this somewhere?
@dagsterio
@dagsterio 3 ай бұрын
You might find this blog by Sandy interesting: dagster.io/blog/dagster-ml-pipelines. - Otherwise you can listen to the entire Podcast featuring Sandy here: datastackshow.com/podcast/machine-learning-pipelines-are-still-data-pipelines-with-sandy-ryza-of-dagster/
@rembautimes8808
@rembautimes8808 3 ай бұрын
I work in a financial institution and there is definitely a need for a reliable and resilient data process. Look forward to finding out more about Dagster. I also agree, no point building something flaky and have it barf 🤢
@rembautimes8808
@rembautimes8808 3 ай бұрын
Yes I’m excited. Thanks
@hungnguyenthanh4101
@hungnguyenthanh4101 3 ай бұрын
link repo please.
@dagsterio
@dagsterio 3 ай бұрын
Sorry, one of our redirects got broken - here is the link: github.com/dagster-io/devrel-project-demos/tree/main
@dagsterio
@dagsterio 3 ай бұрын
More specifically for this session: github.com/dagster-io/devrel-project-demos/tree/main/dagster-deep-dives/dagster_deep_dives/resources_and_configurations
@jakobullmann7586
@jakobullmann7586 3 ай бұрын
I don’t know… this video is one year old, but still uses the legacy DAG syntax from Airflow 1, rather than the TaskFlow API from Airflow 2. So the syntax doesn’t make a difference anymore. Regarding the coupling to environment: Airflow has different executors. The KubernetesPodOperator is not the only way to run on a Kubernetes environment. The rest may or may not be true. Probably there are many things that Dagster does better than Airflow. But I’m disappointed that you would publish such a biased comparison.
@user-yh1hx9pe7u
@user-yh1hx9pe7u 3 ай бұрын
@dagsterio Do you have the source of the demo avaialble somewhere?
@dagsterio
@dagsterio 3 ай бұрын
All the code for the demos from the deep dives are in this repository ( github.com/dagster-io/devrel-project-demos )! This one in particular is in the partitions directory.
@Jahaniam
@Jahaniam 2 ай бұрын
@@dagsterio unfortunately it is private/ link is broken.
@dagsterio
@dagsterio 8 күн бұрын
@@Jahaniam Sorry, the final parenthesis got included by KZfaq in the URL - try this: github.com/dagster-io/devrel-project-demos
@shaounakn
@shaounakn 4 ай бұрын
I like what you folks have done with this product.
@dagsterio
@dagsterio 4 ай бұрын
Thanks - there is a lot more in store coming next month!
@quinnherden
@quinnherden 4 ай бұрын
+1 I am rooting for you guys. Thank you for all of your hard work
@dagsterio
@dagsterio 4 ай бұрын
We appreciate it - thanks @@quinnherden !
@shaounakn
@shaounakn 3 ай бұрын
@@dagsterio Sure, thanks for making these sessions, these are really helpful.
@hungnguyenthanh4101
@hungnguyenthanh4101 4 ай бұрын
Please send me link Git repo on video
@dagsterio
@dagsterio 4 ай бұрын
Try: github.com/dagster-io/devrel-project-demos
4 ай бұрын
Joining other comments, I'd love to see more step-by-step tutorials and use cases. It took a few videos to grasp the concepts, and this one is a good one to start with. Docs are good, but videos are even better. I would love to see more of duckdb / dagster and ingestion cases.
@user-hs9lo5gh3r
@user-hs9lo5gh3r 4 ай бұрын
In 7:47 of the video you show using the Launchpad to configure assets... I can't figure out how to access this page?
@colton-dagster
@colton-dagster 4 ай бұрын
Hi @user-hs9lo5gh3r, the most common way to bring up this menu is to select an asset from the global asset lineage, and then in the top right where it says "Materialize selected...", open the dropdown menu and select "Open launchpad". Hope this helps!
@Amapramaadhy
@Amapramaadhy 4 ай бұрын
What’s with these shorts? Feels like a kid got hold of your social account! Stick to real content
@jesperbagge2504
@jesperbagge2504 4 ай бұрын
I really want to love Dagster but watching this video reminded me of why I stopped using Dagster for moving data from point A to point B. There are so, so many layers of configuration and plain infrastructure all over the place that kind of just needs to be there that the actual business logic (you know, the valuable part of the code that defines the data product) gets completely buried.
@Amapramaadhy
@Amapramaadhy 4 ай бұрын
IMO, one of the most confusing concept and unnecessarily convoluted item in Dagster (which is otherwise amazing). Eg what’s with RunConfig that has references to `ops` but then things have to be keyed/named by asset name. You totally glossed over the global config item (eg s3 bucket that is common to everyone ) then you have to use an awkward resource that doesn’t really do anything other than holds some fields (ahem config). I really wish this would get cleaned up.
@colton-dagster
@colton-dagster 4 ай бұрын
Hey @Amapramaadhy, what you’re expressing is totally valid. The concepts of Assets, Ops, and Jobs and how to compose them can be a bit convoluted - this has become more noticeable as our APIs evolve. We’re aware of this, and it’s on our roadmap to improve. Thanks for taking the time to respond and sharing your thoughts.
@huyhoangnguyen7465
@huyhoangnguyen7465 4 ай бұрын
Manualy Run success But Schedule run fail
@huyhoangnguyen7465
@huyhoangnguyen7465 4 ай бұрын
dagster._core.errors.DagsterInvariantViolationError: Cannot access partition_key for a non-partitioned run
@huyhoangnguyen7465
@huyhoangnguyen7465 4 ай бұрын
Hi, I meet bug dagster._core.errors.DagsterInvariantViolationError: Cannot access partition_key for a non-partitioned run
@xOnelinx
@xOnelinx 5 ай бұрын
это настолько поверхностное и лукавое сравнение что я даже не хочу писать комментарий на английском🤦‍♂
@lucasfermo8909
@lucasfermo8909 5 ай бұрын
Awesome!!!!
@congtinNguyen
@congtinNguyen 5 ай бұрын
I have struggled with Dagster concepts for some time. Its concepts are numerous and quite overwhelming for beginners.
@dagsterio
@dagsterio 5 ай бұрын
No doubt that evey new powerful framework takes some investment up front to learn. Have you explored Dagster University? courses.dagster.io/courses/dagster-essentials
@congtinNguyen
@congtinNguyen 5 ай бұрын
@@dagsterioThank you so much. It has helped enlighten many things
@EtienneTremblay
@EtienneTremblay 6 ай бұрын
In terms of debugging, being able to run dagster in debug mode in vscode, set breakpoints, inspect variables is game changer. Here is how to setup it: github.com/dagster-io/dagster/issues/17859#issuecomment-1805916514
@Amapramaadhy
@Amapramaadhy 6 ай бұрын
Awesome 👏🏽. Really nice and succinct description of an otherwise tricky feature. Hopefully a future video can cover advanced use cases of how to wire up sensors with partition definitions so that we can programmatically launch/backfill etc. Thanks again for the great content.
@dduran6609
@dduran6609 6 ай бұрын
We need acces control to the UI in the CLI dagster instalation, airflow have a good layer of acces control for múltiple users
@dagsterio
@dagsterio 6 ай бұрын
We explain the logic in a blog post here: arc.net/l/quote/euycfcsm
@JohnoScott
@JohnoScott 6 ай бұрын
Pulling Dagster apart is a great Dr Manhattan move and will make you stronger with all these new use-cases you have unlocked. A great response to user feedback
@colton.padden
@colton.padden 6 ай бұрын
TIL of Sling - thanks Pedram!
@ingenieroriquelmecagardomo4067
@ingenieroriquelmecagardomo4067 6 ай бұрын
dlt is better. light years beyond sling imo, and they have a better and more commited and active team. dlt is like the dagster of the lightweight integration libraries.
@joshi1q2w3e
@joshi1q2w3e 7 ай бұрын
Does Dagster work on premises as well as the cloud? Are any features on cloud not available on premises? My company is moving back to on premises and doesn’t want to use SSIS.
@dagsterio
@dagsterio 6 ай бұрын
Hi Joshi. Dagster is an open-source project and all the capabilities that Pedram describes in this video are available in the open-source solution, which you can self-host locally or on-prem. Dagster Cloud offers additional capabilities, which are detailed at dagster.io/cloud or on the pricing page here: dagster.io/pricing