What's the difference of Iceberg/Hudi/Delta?
I just list some comparison articles and talks in this page for now 🧐
Feature comparisons
- A Thorough Comparison of Delta Lake, Iceberg and Hudi by Junjie Chen, 2020
- Delta lake has best integration with Spark ecosystem and could be used out of box
- Apache Iceberg has great design and abstraction that enable more potentials
- Apache Hudi provide most conveniences for streaming process
- Comparison of Data Lake Table Formats (Apache Iceberg, Apache Hudi and Delta Lake) by Alex Merced, 2022
- Table Format Partitioning Comparison: Apache Iceberg, Apache Hudi, and Delta Lake by Alex Merced, 2022
- Data Lakehouse Table Format Comparison (Iceberg, Hudi and Delta Lake) - Subsurface Meetup by Alex Merced, 2022
- Open Table Formats — Delta, Iceberg & Hudi by Amit Singh Rathore, Jun 2022
- Apache Hudi vs Delta Lake vs Apache Iceberg - Lakehouse Feature Comparison by Kyle Weller, August 2022
- Hudi, Iceberg and Delta Lake: Data Lake Table Formats Compared from LakeFS, 2021
Performance comparisons
- Delta vs Iceberg vs hudi : Reassessing Performance from DataBeans, Jun 2022
Some articles written in chinese
- 深度对比delta、iceberg和hudi三大开源数据湖方案 from Apache Iceberg 技术社区, 2020