Short Updates

Thoughts on DuckDB's DuckLake so far:

1. As someone's who is familiar with the backend implementations of Delta Lake and Iceberg, the simplicity of DuckLake is admirable. 2. Iceberg table...

  1. As someone’s who is familiar with the backend implementations of Delta Lake and Iceberg, the simplicity of DuckLake is admirable.
  2. Iceberg table import/export is coming. Compaction support seems to be in the extension.
  3. I would love to see integrations for DuckLake with AWS Athena (Trino/Presto) and Spark. Implementors, just vendor the DuckDB implementation and call into it via JNI.
  4. The use of table level statistics is fantastic, join planning will perform well.
  5. Adding support for data sketches (bloom filters) on columns seems possible too!

Originally posted on LinkedIn.

#DuckDB #Delta Lake #Iceberg #Apache Iceberg #DuckLake #Lakehouse #Format Wars #Lakehouse Format Wars #Hudi #Parquet