How Apache Iceberg Won The Open Table Wars
BigDATAwire, Tuesday, December 3rd, 2024
Apache Iceberg has recently emerged as the de facto open-table standard for large-scale datasets, with a thriving community and support from many of the leading data infrastructure vendors.
But why did Iceberg emerge as the preferred format? And what should you know before you wade in?
Iceberg is a high-performance table format that brings the reliability and simplicity of SQL tables to large-scale data analytics. Its ecosystem has grown rapidly, with robust tooling and support from engines like Apache Spark, Trino, and Apache Flink, as well as from vendors including Snowflake, Amazon, Dremio, and Confluent. Even Databricks is betting on Iceberg, having spent more than $1B on Tabular, a startup co-founded by some of the Iceberg co-creators.