Checkpoint Chronicle - March 2024

Published by in Checkpoint Chronicle at https://rmoff.net/2024/03/22/checkpoint-chronicle-march-2024/

Note
This post originally appeared on the Decodable blog.

Welcome to the Checkpoint Chronicle, a monthly roundup of interesting stuff in the data and streaming space. Your hosts and esteemed curators of said content are Gunnar Morling and Robin Moffatt (your editor-in-chief for this edition). Feel free to send our way any choice nuggets that you think we should feature in future editions.

Stream Processing, Streaming SQL, and Streaming Databases đź”—

Event Streaming đź”—

Change Data Capture đź”—

Data Platforms and Architecture đź”—

Data Ecosystem đź”—

  • Two good articles about real-world database administration, with GitHub.com’s migration from MySQL 5.7 to 8.0 , and the UK Government’s Digital Service’s migration of their PostgreSQL database within AWS.

  • The ecosystem around open-table formats (OTF) continues to be an interesting one, with the project formerly known as OneTable now becoming Apache XTable (incubating). This move to an ASF project I think is really important as it defines a well-known and respected governance model. It will still be interesting to see how OTFs play out in the coming months and years—will we end up with one format simply winning outright, or will the cross-format support that XTable (and to a slight extent Databricks’ Uniform ) provide give the best of all worlds? Personally, I’m sceptical; I think the formats will slug it out and in the end, one or two will survive—perhaps for different use cases—and the others will either pivot or die.

  • LinkedIn has quite the pedigree for open-source projects (Apache Kafka, anyone?) and this newly open-sourced one looks interesting. OpenHouse is described as a control plane for managed tables, and includes a catalog and services for managing Iceberg tables .

  • I always enjoy reading my former colleague Jack Vanlightly’s writing, and this recent article about Confluent’s new capability called Tableflow—which will support writing from Apache Kafka into Apache Iceberg —is no exception.

Events & Call for Papers (CfP) đź”—

New Releases đź”—

A bumper batch of new releases this month:

Plus a couple of useful-looking projects that I noticed:

That’s all for this month! We hope you’ve enjoyed the newsletter and would love to hear about any feedback or suggestions you’ve got.

Gunnar ( LinkedIn / X / Mastodon / Email )
Robin ( LinkedIn / X / Mastodon / Email )


TABLE OF CONTENTS