Intro to Foundry Pipeline Builder

Рет қаралды 5,169

Күн бұрын

Pipeline Builder is the most accessible way to build production-grade data pipelines in Palantir Foundry. This video is an intro tutorial that covers the basics of how to use Pipeline Builder and some essential data engineering concepts. It also showcases the first few AI features of Pipeline Builder.
Ontologize
Founded by Taylor Gregoire-Wright, a former Palantir implementation engineer, Ontologize offers courses & live trainings for Palantir Foundry. Visit ontologize.com or connect on LinkedIn: / tgregoirewright
You can follow along with the same data I used for this tutorial
The data used in this tutorial is notional data from a fictional set of grocery stores in the US. The datasets include:
Transactions - A customer makes a transaction when they buy groceries
Baskets - Baskets represents which items and how many of each were bought in a single transaction
Customers - Each row is information about a single customer
Products - Descriptions about the different products customers can buy, including product name, brand, store department, etc.
Stores - The stores that the parent company owns
Download the data from Ontologize's GitHub: github.com/ontologize/fake-gr...
0:00 Creating a pipeline
1:30 Reference pipeline
3:29 Adding input data
4:40 Cleaning transactions data
7:30 Aggregating data
9:00 Joining data
10:35 Nesting expressions
12:12 Completing the transactions pipeline
15:23 Changing column order
16:15 Editing upstream parts of the pipeline
17:34 AI Feature: auto-naming
17:56 Drag and drop pipeline nodes
18:39 Creating an output dataset
19:29 Saving, Builds, and Jobs
20:38 Saving vs. Deploying
23:20 Organizing with colors
24:16 Creating synthetic primary keys
26:46 Wrapping up the rest of the pipeline
28:59 Reusables: parameters aka variables
30:43 Reusables: functions
32:37 AI Features: auto-generating regex expressions
34:04 Version Control in Pipeline Builder
38:24 Pipeline organization
41:43 Course announcement

Пікірлер: 13

@johndoez6481 Ай бұрын

Hello professor Ontologize, Wondering when Pipeline Builder 102 starts? Thanks again for your work. 🇺🇸

@thiorofall8751 2 ай бұрын

Many thank you! Best tutorial for Data Pipeline!

@dkfresh6656 11 ай бұрын

So glad I found this channel. I recently started learning how to develop apps palantir and just learned about pipeline. Thankful you created this video. Not alot of resources out there! looking forward to more videos related to palantir app development!

@ontologize 11 ай бұрын

Thanks! I'm glad you found it useful. Feel free to make specific requests for the public tutorial series here: www.ontologize.com/contact

@GG-uz8us 3 ай бұрын

Super helpful!!! Thanks for the effort!

@user-iw5th5dj6k 10 ай бұрын

Very helpful tutorial !!! Looking for more on code repositories !!

@Heyvyner 10 ай бұрын

Thnaks for sharing! Really informative

@ontologize 9 ай бұрын

Glad it was helpful!

@nafiulhaque5585 9 ай бұрын

Very good clean and clear instructions. I would love to learn from your self-paced course. I want to know how to create a new column with an 'if statement', and I want to know more about 'Join', and more transformations in at least 1 end to end project, that data from the pipeline I could use for building a dashboard in Tableau. How can I get access to that? Appreciate your response. Thanks

@mattreed656 Жыл бұрын

Great tutorial. Clear, concise and great production quality. How long would this work have taken you if you didn't have access to a platform like Palantir and tool like Pipeline Builder? Thanks!

@ontologize Жыл бұрын

Frankly, I was skeptical when Pipeline Builder first came out because the history of no-code tools for things like data pipelining is, at best, mixed. However, I've been very pleasantly surprised. It gives you the speed of writing SQL with the expressiveness of writing Python (within limits -- e.g. can't manipulate raw files unless you modify dataset schemas). All that said, the real speed of any particular tool in Foundry is more "organizational" than "individual". Meaning, some tools are faster for my particular workflows than external tools; some are slower. But getting something into production and maintaining it when it's there is almost always way, way faster with Foundry.

@mattreed656 Жыл бұрын

@@ontologize very cool. Thanks for the detailed response. Looking forward to future videos. I don’t use Foundry where I work but hope to in the future.