Posts
All the articles I've posted.
- 4 MIN READ•Aug 12, 2025
Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
Learn how to prevent and clean up metadata bloat in Apache Iceberg by expiring snapshots and rewriting manifests for better performance and manageability.
Apache IcebergMetadata OptimizationSnapshot Expiration - 4 MIN READ•Aug 5, 2025
Smarter Data Layout – Sorting and Clustering Iceberg Tables
Improve query performance in Apache Iceberg by organizing your data layout with sorting and Z-order clustering. Learn how to reduce scan cost and improve filter effectiveness.
Apache IcebergClusteringSorting - 3 MIN READ•Jul 29, 2025
Optimizing Compaction for Streaming Workloads in Apache Iceberg
Learn how to design fast, incremental compaction strategies in Apache Iceberg to support high-throughput streaming pipelines without disrupting freshness or performance.
Apache IcebergData OptimizationStreaming