Example code on running Confluent Platform
Mirror of Apache Kudu
Hive UDF's for the data warehouse
Kafka Consumer Lag Checking
Dashboard UI for Kafka Cluster monitoring topics and offsets with Burrow 1.0 ( /v3/kafka API )
Clemens Valiente's blog
Dataset for OggDude's character generator
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.