Why You Shouldn’t Care About Iceberg | Tabular

  Рет қаралды 12,485

Data Council

2 жыл бұрын

Slides: www.datacouncil.ai/talks/why-you-shouldnt-care-about-iceberg
ABOUT THE TALK:
Ryan Blue, co-creator of the Apache Iceberg project will try to convince you not to care about Iceberg: if you’re thinking about your table format, then it isn’t doing a good enough job.
This session will show how Iceberg solves real-world problems that used to take hours or days of time from data engineers and analysts:
Safe schema changes - no more zombie data columns
Layout evolution - update table partitioning without rewriting any queries
Hidden partitioning - safe and fast queries without being a DBA
Future work - current frustrations and how we’re making them disappear
ABOUT THE SPEAKER:
Ryan is the co-creator of Apache Iceberg and spent the last decade working on big data formats and infrastructure at Netflix, Cloudera, and now Tabular. He is an ASF member and a committer in the Apache Parquet, Avro, and Spark communities.
ABOUT DATA COUNCIL:
Data Council (www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.
FOLLOW DATA COUNCIL:
Twitter: DataCouncilAI
LinkedIn: www.linkedin.com/company/datacouncil-ai/

Пікірлер: 4
@npestrov
@npestrov Жыл бұрын
A really great talk! Looking forward to filling the box between storage and compute layers over the next few years.
@FlavioPompermaier
@FlavioPompermaier 10 ай бұрын
I went through all the problems you mention.. When I first started using Hadoop / Spark / Flink I was very frustrating about all the very low level aspects you were required to master before being able to read or write any single piece of data I had the feeling of being the only one asking for data portabiliy and security. Having a common format for describing input/otput data (and metadata as well) is the fundamental point of any big or small data solution
@joshreji7510
@joshreji7510 2 жыл бұрын
Great talk sir!!
@nosh3019
@nosh3019 Жыл бұрын
Nice talk, thanks.
Универ. 13 лет спустя - ВСЕ СЕРИИ ПОДРЯД
9:07:11
Комедии 2023
Рет қаралды 6 МЛН
Cadiz smart lock official account unlocks the aesthetics of returning home
0:30