Adventures with Apache Flink and Delta Lake

This post originally appeared on the Decodable blog. Delta Lake (or Delta, as it’s often shortened to) is an open-source project from the Linux …

Using Delta from pySpark - java.lang.ClassNotFoundException: delta.DefaultSource

No great insights in this post, just something for folk who Google this error after me and don’t want to waste three hours chasing their tails… …

Data Engineering in 2022: Storage and Access

In this article I look at where we store our analytical data, how we organise it, and how we enable access to it. I’m considering here potentially …