Loading delimited data into Kafka - quick & dirty (but effective)

Whilst Apache Kafka is an event streaming platform designed for, well, streams of events, it’s perfectly valid to use it as a store of data which …

Performing a GROUP BY on data in bash

One of the fun things about working with data over the years is learning how to use the tools of the day—but also learning to fall back on the tools …

Replacing UTF8 non-breaking-space with bash/sed on the Mac

A script I’d batch-run on my Markdown files had inserted a UTF-8 non-breaking-space between Markdown heading indicator and the text, which meant that …

Kafka Connect CLI tricks

I do lots of work with Kafka Connect, almost entirely in Distributed mode—even just with 1 node -> makes scaling out much easier when/if needed. …

Simple export/import of Data Sources in Grafana

Grafana API Reference Export all Grafana data sources to data_sources folder mkdir -p data_sources && curl -s …

Streaming data to InfluxDB from any bash command

InfluxDB is a great time series database, that’s recently been rebranded as part of the “TICK” stack, including data collectors, …