Interesting links - December 2025

Published by in Interesting Links at https://rmoff.net/2025/12/16/interesting-links-december-2025/

Well it’s that time of year already! Whilst munching on a mince pie, enjoy the final Interesting Links for 2025.

It’s been a busy twelve months for me; this time last year I was signing off from my last company, which went on to be acquired—and last week I found out that my current company (Confluent) is to be acquired by IBM. Despite my reaction against any kind of cheese moving, I figure this is going to be an interesting development and a whole new experience for me :)

Just one blog post of my own to share from this month—a write-up of some investigation that I did using Neo4j and graph analysis to identify astroturfing on Reddit. It turns out that there are marketing agencies out there who think it’s a good idea to spoil things for everyone else by offering astroturfing-as-a-service to at least two vendors in this space who paid them for it 🙄.

My previous employer has kindly allowed me to host my previous blog posts here on rmoff.net, which I’m delighted about. If you’ve not seen them already, here are some of the highlights:


And so…on with the interesting links!

Not got time for all this? I’ve marked 🔥 for my top reads of the month :)

Tip
Medium posts often skulk behind a gate, so I’ve hyperlinked to the Freedium version and included a link to the original using a ⓜ️ icon should you prefer to visit that (or if freedium goes offline).

Kafka and Event Streaming 🔗

  • Aviv Dozorets published a new tool, klag, billed as a replacement for the deprecated Kafka Lag Exporter.

  • 🔥 Sandon Jacobs has an excellent lightboard explainer of the new Queues for Kafka introduced with KIP-932.

  • Jepsen’s test reports are always interesting to read, including this recent one in which he uncovers issues in NATS JetStream.

  • I mean, if a Kafka alternative isn’t written in Rust these days, is it even worth writing? Snark aside, walrus claims higher performance than Kafka, although it isn’t API compatible.

  • WarpStream have always published good blog posts, and this one from Maud Gautier continues the trend, with technical details of how they added support for Protobuf with Schema Registry.

  • This post from Yifeng Liu gives some practical advice on how to architect Kafka topics, specifically with regards to duplication (which is sometimes totally OK, the author argues).

  • Platformatic’s Node.js client has had a 223% speed boost—Paolo Insogna describes how.

Stream Processing 🔗

Analytics 🔗

Data Platforms, Architectures, and Modelling 🔗

Data Engineering, Pipelines, and CDC 🔗

Open Table Formats (OTF), Catalogs, Lakehouses etc. 🔗

RDBMS 🔗

General Data Stuff 🔗

AI 🔗

I warned you previously…this AI stuff is here to stay, and it’d be short-sighted to think otherwise. As I read and learn more about it, I’m going to share interesting links (the clue is in the blog post title) that I find—whilst trying to avoid the breathless hype and slop.

Note

A request to you: Are there any good blog posts out there documenting how companies are actually implementing their user-facing AI features?

For example, Strava has the awfully-named "Athlete Intelligence" (AI - geddit?!) - but I would love to see how it’s built.

It feels like there’s a chasm between "ooooh you can build this" from the vendors (hi!) and the reality of actually building with it. Perhaps that’s always the case, but for hype stuff it’s even more valuable to hear [unfiltered] stories of how people really build with it.

And finally… 🔗

Nothing to do with data, but stuff that I’ve found interesting or has made me smile.

Fun 🔗

Write 🔗

Listen 🔗


Note

TABLE OF CONTENTS