rmoff's random ramblings
about talks

Delta Lake

Apr 5, 2023
Apr 5, 2023

Using Delta from pySpark - java.lang.ClassNotFoundException: delta.DefaultSource

No great insights in this post, just something for folk who Google this error after me and don’t want to waste three hours chasing their tails… 😄

Sep 14, 2022
Sep 14, 2022

Data Engineering in 2022: Storage and Access

In this article I look at where we store our analytical data, how we organise it, and how we enable access to it. I’m considering here potentially large volumes of data for access throughout an organisation. I’m not looking at data stores that are used for specific purposes (caches, low-latency analytics, graph etc).

The article is part of a series in which I explore the world of data engineering in 2022 and how it has changed from when I started my career in data warehousing 20+ years ago. Read the introduction for more context and background.


Robin Moffatt

Robin Moffatt works on the DevRel team at Confluent. He likes writing about himself in the third person, eating good breakfasts, and drinking good beer.

Story logo

© 2025