Category: Architecture

This category contains all documents regarding BI, DWH, ETL and Front-end architecture. Also includes datamodelling (Data Vault, Inmon, Kimball).

Architecture

July 27, 2023

Joining tables in the Persistent Staging Area

Joining tables in the Persistent Staging Area (PSA) could be a practical solution that avoids downstream complexities. This post explains the pattern to do so.

Should we record the fact that a piece is missing?

Architecture

March 22, 2021

Why a delete flag makes sense (and when it doesn’t)

Recording a deleted flag is essential to delivering the correct data, and this post explains why.

General / Architecture

August 18, 2020

Getting started with Confluent Kafka as a data professional – part 4

An overview of the end-to-end process video demonstrating the Confluent Kafka pub/sub as available on YouTube.

Architecture / Data Vault / ETL

August 18, 2020

Getting started with Confluent Kafka as a data professional – part 3

Example code for creating a Confluent Kafka consumer using the .Net libraries.

ETL / Architecture

June 1, 2020

Major revision of the DIRECT framework

A new set of improvements have been committed to the data logistics control (‘ETL process control’) framework Github. This framework, referred to DIRECT (thanks to acronimify.com), assumes the spot of the Data Logistics Process Control in the engine metaphor for flexible data platform management. In case you were wondering, it’s the Data Integration Run-time Execution Control Tool! This is one of my favourite components, as it’s both so simple but also hard to truly get...

The components of the engine, required for automated and flexible information delivery.

General / Architecture

May 25, 2020

The Engine – adapting to ever changing data interpretation

The engine represents an ecosystem of data warehouse automation tooling and ideas, making it easier to shape data into the desired delivery formats.

Architecture

July 24, 2019

Changes to generic metadata interface Github

The repository that contains work on the generic Data Warehouse Automation interface has been rebuild.

Architecture

May 11, 2019

Exciting new ways of thinking about Persistent Staging

A Persistent Staging Area if often associated with database, but this is not the way it should be. This post covers alternative ways of thinking about what a PSA can be.

Architecture / General

January 19, 2019

Receiving and integrating data from multiple time zones

How to capture the date / time information that is received from different international locations, across different time zones, is a challenge that comes up from time to time. Recently I was involved in some conversations about this again, which prompted me to capture this once and for all and share this here. As outlined in the pattern for Data Mart delivery you should be able to deliver information according to the timeline that the...

Architecture / Data Vault / General

January 4, 2018

Is Data Vault becoming obsolete?

What value do we get from having an intermediate hyper-normalised layer? Let me start by stating that a Data Warehouse is a necessary evil at the best of times. In the ideal world, there would be no need for it, as optimal governance and near real-time multidirectional data harmonisation would have created an environment where it is easy to retrieve information without any ambiguity across systems (including its history of changes). Ideally, we would not...

Category: Architecture

Joining tables in the Persistent Staging Area

Why a delete flag makes sense (and when it doesn’t)

Getting started with Confluent Kafka as a data professional – part 4

Getting started with Confluent Kafka as a data professional – part 3

Major revision of the DIRECT framework

The Engine – adapting to ever changing data interpretation

Changes to generic metadata interface Github

Exciting new ways of thinking about Persistent Staging

Receiving and integrating data from multiple time zones

Is Data Vault becoming obsolete?

Search this site

Upcoming Events

Recent Posts