Keynote - Data Mesh by Zhamak Dehghani

  Рет қаралды 50,982

Thoughtworks

Thoughtworks

3 жыл бұрын

This talk will cover an introduction to Data Mesh and the motivations behind it - the failure modes of past paradigms of big data management. Zhamak will compare and contrast data mesh to existing big data management approaches, and introduce the technical components underpinning the architecture
Related materials mentioned by Zhamak in the session:
Original write up on data mesh martinfowler.com/articles/dat...
InfoQ architecture trends of 2020 www.infoq.com/articles/archit...
Martins original write up on data lake martinfowler.com/bliki/DataLa...
www.thoughtworks.com/

Пікірлер: 31
@allampallisandeep7355
@allampallisandeep7355 3 жыл бұрын
Game changer. In reality, people do build data products as a result of that friction in the centralized approach . They just don’t call it that way. However, I loved the phrases interoperable and trustworthy. There needs to be a well defined pattern around bringing consistency to inter-dataproduct interactions.
@sonynavi
@sonynavi 2 жыл бұрын
Absolutely, crisp and clear explaination by just bringing on the abstract of the concept. My perspective, though this is going to be a paradigm shift from lakehouse to data mesh, what is absolutely requried is to understand the traps which the Arch. Strategy team should be careful about.
@Lestca1
@Lestca1 2 жыл бұрын
Fantastic! Working for a steel company that is just dipping its toes into the Big Data platform field, aspects of optimzing resources with respect to data engineering et al. are paramount moving forward.
@kamesh7818
@kamesh7818 3 жыл бұрын
Very useful and thought provoking, thank you for sharing.
@ericstridinger8701
@ericstridinger8701 Жыл бұрын
Great job Zhamak! Was very good information regarding data mesh. Easy to understand and good ideas.
@cbesriram
@cbesriram Жыл бұрын
Fantastic, thought provoking and great guidance.
@jasonbperkins
@jasonbperkins 3 жыл бұрын
Thought provoking session on distributed analytics architecture. Thanks
@sheosamh
@sheosamh 3 жыл бұрын
Absolutely agree!
@somilgupta3004
@somilgupta3004 3 жыл бұрын
Very interesting presentation.. the only risk I foresee is that when data products consume each other instead of from a centralized lake, the errors and bugs will spread out into the network virtually unihibited. In centralized data pipelines, it is still possible to just purge and recreate the data product from immutable logs. But in this case, it will be extremely difficult to prevent the spread and later correct. If one data product goes down, it will impact the whole network. This is a problem with microservices architecture and this will be a problem in Data Mesh as well.
@aurchwd
@aurchwd 2 жыл бұрын
Absolutely true , in lala land this might work but not in real world. when we propose any architecture something we should always think of how seamless can we operationalize it not how can we build it , not just that u have couple it with resourceing challanges .
@desavera
@desavera 3 жыл бұрын
Low coupling and high cohesion ... again :) Very nice presentation !!!
@fabiodesalles2732
@fabiodesalles2732 2 жыл бұрын
Lots of those problems have already been solved a decade or more ago. Most data project failures today (which she did not mention but Gartner ranges at around 70-80% percente) do not stem from lack of technology.
@nfuryboss
@nfuryboss 2 жыл бұрын
data mesh ~ data analog of "microservices" architecture for a mesh of services
@bradk7462
@bradk7462 2 жыл бұрын
Data mesh + Better Value Sooner Safer Happier for the win :-)
@IssamHijazi
@IssamHijazi 3 жыл бұрын
Love this! Hello to the new future of data management :)
@ozlemakpinarsel2674
@ozlemakpinarsel2674 2 жыл бұрын
Hey I'm watching this now and seeing your name :) Small world, how are you?
@IssamHijazi
@IssamHijazi 2 жыл бұрын
@@ozlemakpinarsel2674 hehe small world indeed! All good here, hope you are doing great as well
@andregomesdasilva
@andregomesdasilva 3 жыл бұрын
Very nice I just think it would be great if there were examples of how all of this connects to real world products. I mean, data products are commonly only legacy business products like a ERP or CRM, or MES system. What are the changes that should be made in those products so they can adapt to the domain perspective? How it would impact the current user journey? When I consider this I either conclude that we still have lots of big gaps to cover or I conclude that I didn't understand the concept
@debashisghosh4908
@debashisghosh4908 3 жыл бұрын
Me too in the same wavelength - I believe it started with new concept but ended with standard data governance process. The speaker should include examples how data product roles makes agile the modern data architecture governance .
@vchandm23
@vchandm23 2 жыл бұрын
How i can think of is if you have a client_id in one of the Data Product /Pod team revolving around Claims, the same client_id can have a Policy but Policy will be owned by another Data Pod/Product team. Now technically speaking, pretend you have table_policy in one workspace and table_claim owned by another team in another workspace, then you have to join them when you want to consume for analytics. This is one weave of the data between pods. This is how am envisioning it, more thoughts or ideas better than me would help me as well to learn. I may be wrong here but this how they are trying to move the ownership and splitting the laod now across different Data Pod teams. Then you simply join those information for your analytics when needed in reports. The technical concepts remains same but the accountability and responsibility are drafted to teams. This is how Mesh is created. Straigh away i can see this will create more dependency and costly to implment as well. Just like the front-end teams. I have a feel this will be too expensive to implement that is the only negative i see. Pretend you have 25 applications in an organization. How are you doing to split the ownership of those to the Data Pods ? I have those questions in my mind. Very interesting approach and am also curious as you to work in that space now and learn from it. It is a mindset and journey to embark on. A new paradigm hope this doesnt become a recruiter marketing gimmic. Another problem this can create a hugh dependency between networks of Data Pods. See this is not like K8s like you can bring the app in a flash, this is data with volumns which means it is massive in nature, hence if one Pod has issues then there will be ripple effect on other pods how are conusming that. So this is creating new problems in engineering. Trying to solve problem by creating a new one clearly not a good architecture imho. Sometime sticking to basics and planning properly solves the problem in simple.
@alirezaamedeo
@alirezaamedeo 2 жыл бұрын
Persians are really smart yet friendly and nice. Zhamak is not an exception obviously.
@anuthayaparan5211
@anuthayaparan5211 3 жыл бұрын
Excellent presentation! I would definitely agree that maintaining a single data lake at scale for a large organization is definitely a big challenge. I believe that some organizations built data lakes to avoid data duplication and also to build a “single source of truth”. How does this architecture address those problems ?
@AnkitAnchan666
@AnkitAnchan666 3 жыл бұрын
If you look into data lake architectures, there is no importance given to domain of the input sources or consumers that they're serving. This is why data lakes are basically "single source of truth" for all of the data in the organization irrespective of the domain that they belong to. Now if you logically break down these into data products to their respective domain, then each of these data products are infact "single source of truth" for that particular domain. So instead of having "single source of truth" for all domains, we have "single source of truth" for each of the domains in the form of data products.
@anuthayaparan5211
@anuthayaparan5211 3 жыл бұрын
@@AnkitAnchan666 Thanks. That makes sense.
@alauddinm
@alauddinm 3 жыл бұрын
Good presentation! How we will manage cross domain data in this data mesh approach?
@vchandm23
@vchandm23 2 жыл бұрын
@@AnkitAnchan666 - this we are already doing it in refined layer or Gold Layer. I see the perspective now. Just organizing team to use the lake and start building information wearing a data product mindset (Ownership and Accountable is solved). Again perspective and from where you are looking at things matters. Situation, scneario and people matters when making this choise. With my Data Enterprise architect hat. @Anuruthan Thayaparan - This is not an architecture change it is just a mindset and management of people carving them to a particular scope. It is like creating TRUSTED DATA PRODUCT TEAMS in GOLD/ REFINED/ TRUSTED LAYER in the data lake world.
@henniedenooijer4348
@henniedenooijer4348 Жыл бұрын
Star models are built for technologies? No, these are optimized around business processes and are more end user understandable. So what about the integration of business keys if you move the DWH to the source systems?
@sweeper240
@sweeper240 2 жыл бұрын
Complimentary to any warehouse lake, lake house, or mesh... or whatever new concept of data storage governance and security model is thought up in the future Our AI/ML effortlessly harmonizes and contextualizes all data across all silos even third-party or unstructured content like handwriting, images, etc If you want to achieve mesh at any scale, you'll want to talk to me.
@slipperyeel9206
@slipperyeel9206 Жыл бұрын
I don’t get it. Data mesh seems stupid to me
@eversruud
@eversruud Жыл бұрын
Data mesh? Another hype? We have gone from centralized to decentralized data. From centralized analytics to self service analytics? There is no silver bullet solution. Data mesh is mostly a cultural change and less technology. And no matter what solution. The weakest link in the multiple data product teams determines the success. Data Governance is a challenge and with cross domain products teams are dependable on each other. So what problem are we really trying to solve we are already not able to solve in monolytical data silos? It doesn’t matter if you ingest data, cleanse and create data product decentralized or centralized. We are still talking about creating data processes that need to be developed, released and maintained. Will it go faster? Will it add more quality? Will we be more flexible? Or are we creating chaos because we are not able to manage the multiple data product teams that will do what ever they want eventually because they don’t want to be dependable on other teams? So centralize where you have to and decentralize where not. Within 5 years from now we are talking about the Data Labyrinth because we are all lost in translation.
Data Mesh and Governance
1:29:08
Thoughtworks
Рет қаралды 9 М.
Introduction to Data Mesh - Zhamak Dehghani
34:52
Thoughtworks
Рет қаралды 86 М.
Survival skills: A great idea with duct tape #survival #lifehacks #camping
00:27
🤔Какой Орган самый длинный ? #shorts
00:42
DEFINITELY NOT HAPPENING ON MY WATCH! 😒
00:12
Laro Benz
Рет қаралды 39 МЛН
The HR would like to have a word with you.
5:17
AiyyoShraddha
Рет қаралды 1,3 МЛН
What Is Data Mesh, and How Does it Work? ft. Zhamak Dehghani
34:57
Introduction to Data Mesh with Zhamak Dehghani
1:05:31
Stanford Deep Data Research Center
Рет қаралды 30 М.
22 Inventions That Are Saving The Earth
7:02
Insider Tech
Рет қаралды 9 МЛН
Data Mesh and Domain Ownership
1:24:18
Thoughtworks
Рет қаралды 14 М.
I Spent 24 Hours Trapped in the Metaverse | WSJ
7:34
The Wall Street Journal
Рет қаралды 5 МЛН
Data Mesh Paradigm Shift in Data Platform Architecture - Arif Wider - DDD Europe 2020
47:53
Data Lakehouse, Data Mesh, and Data Fabric |  James Serra
1:04:04
Iteration Insights
Рет қаралды 18 М.
Easy Art with AR Drawing App - Step by step for Beginners
0:27
Melli Art School
Рет қаралды 11 МЛН
ОБСЛУЖИЛИ САМЫЙ ГРЯЗНЫЙ ПК
1:00
VA-PC
Рет қаралды 1,7 МЛН
Зачем ЭТО электрику? #секрет #прибор #энерголикбез
0:56
Александр Мальков
Рет қаралды 152 М.
НЕ ПОКУПАЙ СМАРТФОН, ПОКА НЕ УЗНАЕШЬ ЭТО! Не ошибись с выбором…
15:23
Samsung Galaxy Unpacked July 2024: Official Replay
1:8:53
Samsung
Рет қаралды 23 МЛН