No video

Integrating Iceberg REST Catalog Specification with Spark and Trino

  Рет қаралды 564

Upsolver

Upsolver

5 ай бұрын

Watch additional Chill Data Summit 2024 recordings at www.upsolver.com/resources/ev...
Description
Jack from Amazon EMR and Athena shares his expertise on integrating the REST Catalog Specification with Spark and Trino. As a member of the Iceberg community and PMC member, Jack delves into the details of how his team manages table formats and storage services, focusing on the integration of REST Catalog Specification with Spark and Trino.
Key Topics Covered:
- Introduction to REST Catalog Specification: Jack explains the differences between Glue Data Catalog and REST Catalog, highlighting the unique needs of different customer types.
- Customer Use Cases: Learn about various customer scenarios where REST Catalog Specification is preferred, including third-party vendors and in-house data catalog solutions.
- Internal Experimentation at Amazon: Discover how Amazon experimented with building an internal data catalog service, leading to performance improvements and optimization research.
- Performance Improvements: Jack showcases intelligent scan planning techniques, reducing scan times from minutes to seconds, and the implementation of a scan API to enhance performance.
API Enhancements: Explore the introduction of new APIs for better scan and commit operations, promoting faster and more efficient data handling.
- Future Prospects: Jack discusses potential enhancements and optimization directions for Iceberg, driven by practical customer feedback and production use cases.
Tags: #AmazonEMR #Athena #Spark #Trino #Iceberg #DataIntegration #RESTCatalog #PerformanceImprovements #APIs #BigData

Пікірлер
Apache Iceberg in Snowflake
10:15
Upsolver
Рет қаралды 47
Parenting hacks and gadgets against mosquitoes 🦟👶
00:21
Let's GLOW!
Рет қаралды 9 МЛН
Вы чего бл….🤣🤣🙏🏽🙏🏽🙏🏽
00:18
OKSII
Рет қаралды 4,5 МЛН
Kind Waiter's Gesture to Homeless Boy #shorts
00:32
I migliori trucchetti di Fabiosa
Рет қаралды 2,5 МЛН
Embedded Memory
16:28
DevHeads
Рет қаралды 171
A Message from ARK’s CIO Cathie Wood
6:04
ARK Invest
Рет қаралды 243 М.
This is PAINFUL  Do THIS Before it's too late
13:54
Felix & Friends (Goat Academy)
Рет қаралды 30 М.