We regularly publish articles about all things data. Keep up to date with what we're working on!
CDC with PostgreSQL
CDC with MySQL
Batch vs. Streaming
Using PostgreSQL RLS with Hibernate Reactive
This guide explains how PostgreSQL Row-Level Security can be used to securely implement multitenant applications with Hibernate Reactive and the Quarkus framework.
DataCater 2023.2 is here
We are happy to announce the newest open-core release of DataCater, 2023.2, which introduces the Config resource and implements a lot of user feedback.
Change Data Capture 101: The complete guide
Everything you need to know to get started with applying change data capture to database systems and APIs.
Optimizing Apache Kafka® for High Throughput
This guide explores the most important consumer and producer properties of Apache Kafka for achieving a high throughput.
How to mask data in Redpanda with Python and DataCater
This tutorial walks you through developing custom data masking functions in Python and applying them to Redpanda topics.
Three Use Cases for Getting Started With Apache Kafka
This article introduces three use cases for getting started with Apache Kafka: log analytics, change data capture, and data validation.
Using Python vs. SQL for Data Pipelines
Learn how to decide between Python and SQL for building data pipelines.
Introducing DataCater 2023.1: Your fast lane to Streaming ETL
Build better streaming data pipelines faster with our newest open core release 2023.1.
A Beginner’s Guide to User-Defined Functions in ksqlDB
Learn everything you need to know to get started with developing and using UDFs in ksqlDB.
Using Apache Kafka in development and test environments
Learn how to set up Apache Kafka for your development and test environments.
Using Change Data Capture with Google Cloud SQL for MySQL
Use DataCater for streaming change events from Google Cloud SQL for MySQL to your data sinks.
Data Streaming with Python
We compare three different tools for streaming data with Apache Kafka and Python: kafka-python, Faust, and DataCater.
Querying change data capture events with cloud data warehouses
Learn how to build consistent snapshots of CDC events that were captured from transactional database systems.
The core of DataCater is now source-available and free
We are announcing that the core of DataCater, the real-time ETL platform, becomes free and source-available.
How DataCater helps AX Semantics’ Clients with Data Enrichment
Learn how the streaming data platform DataCater enables the clients of AX Semantics to enrich their data in real-time.
Capturing Changes in Real-Time from Google Cloud SQL PostgreSQL
Use DataCater for streaming change events from Google Cloud SQL PostgreSQL to your data sinks.
5-Minute Introduction To Streaming Data Pipelines
Everything you need to know about the concepts of streaming data pipelines.
DataCater Partner Hour Recap
On May 17, Robert Bråkenhielm from Resultify showed us how they use DataCater in their projects.
Connecting Applications to Userlist with DataCater
Use DataCater's plug & play CDC connectors to connect your app with Userlist, in real-time and without writing code.
How to extract data change events from HubSpot's CRM API
Learn how to apply change data capture for extracting data changes from HubSpot.
10 Useful Python Transforms for your Streaming Data Pipeline
Learn how to transform your data with a few lines of Python code.
DataCater introduces support for declarative data pipelines
Learn how you can declare streaming data pipelines in YAML.
Community Meetup #3: Streaming and Batching with DataCater and dbt
Check this recap for a short summary of the third DataCater community meetup.
Building Real-Time ETL Pipelines with Apache Kafka
Learn how to use Apache Kafka to implement streaming ETL.
Why Digital Agencies should think Data First
Digital agencies adapt to their customer’s needs, that’s in their DNA, but will that be enough to follow instead of anticipating?
Unlocking Streaming Data Pipelines on Google Cloud Platform
Learn how DataCater runs streaming data pipelines on GCP.
DevOps and DataOps
The ultimate goal of DataOps is to reduce the time needed for developing and deploying data pipelines.
Declarative Data Pipelines
Declarative data pipelines allow for more reliable, resilient, and reproducible deployments and faster iterations in development.
Sidecars: Observability for Cloud-Native Data Pipelines
Learn how to unlock non-intrusive observability for cloud-native data pipelines.
The Data Literacy Guide
Get a complete introduction to data literacy.
Data Pipeline Runtime Consistency with Containers
This article applies the principles of container-based application design to building and deploying data pipelines in the cloud era.
Cloud-Native Data Pipelines
Accelerate your data development by adopting cloud-native principles.
Recap of our Community Meetup #2: Streaming Spatial Data
On November 30, we ran our second community meetup with a guest talk on streaming spatial data.
Unlocking Data Silos of Legacy Applications
This article shows how to apply change data capture to unlock data silos of legacy applications, without changing their code.
Recap of our first Community Meetup
On September 28, we ran our first community meetup with more than 25 participants. Here is our recap.
How to use Change Data Capture (CDC) with Elasticsearch
Learn how to extract changes from Elasticsearch in real-time.
Data Pipelines in Content Automation
What is content automation and how does it benefit from data pipelines?
PostgreSQL Change Data Capture (CDC): The Complete Guide
This guide helps you to get started using CDC with the PostgreSQL database system.
MySQL Change Data Capture (CDC): The Complete Guide
Everything you need to know to get started using change data capture with MySQL.
Why we run Data Pipelines as Containers
Five reasons why we deploy data pipelines as containers: Ease of integration, Security, Scalability, Immutability, and Robustness.
Overcoming the Hurdles in Data Democratization
Data expert Wouter Neef from Data Booster describes five common challenges in data democratization and how to overcome them.
How to use Change Data Capture with Web APIs
Improve the efficiency and freshness of your data processing by extracting change events from Web APIs instead of performing bulk loads.
Let's make event streaming a commodity
Build the rock-solid foundation for your next-generation real-time business intelligence.
Troubleshooting The Performance of Streaming Data Pipelines
Get to know two essential performance indicators: pipeline lag and sink connector lag.
Introducing Projects to DataCater
DataCater introduces projects as a collaborative means for data teams to prepare and integrate data collectively.
Meetup: Say Goodbye To Serving Outdated Content
Learn how to keep data and content up to date without manual effort and completely automate your content production.
Batch vs. Streaming Data Pipelines
A comparison between event-based streaming data pipelines and their batch-based counterparts.
Under the Hood of DataCater
An introduction to the building blocks that make up DataCater, the platform for continuous data preparation.
Everything you need to know about Change Data Capture
Learn how to turn data stores into streams of change events.