Areas of collaboration

The following diagram provides an overview of areas that are being addressed on an ongoing basis. If you are interested feel free to reach out at For now most collaborations are limited open source until we’re certain all IP has been sorted out properly.

The intent of sharing this code is to foster increased meritocracy in the BI/DWH community and generally work (together) on something that can be combined using agreed APIs. The idea is that various people / teams can chase their passion while knowing the work will fit in somewhere in the overall scope. This overview may drastically change over time, as will the composition and scope of the projects – but that’s the nature of the work.

ETL Control Framework (DIRECT)

The Data Integration Run-time Execution Control Tool (DIRECT) is a generic execution and control framework that orchestrates the execution of ETL processes. It provides various hooks into an ETL process to manage topics such as restartability, recovery from failure, logging, ETL classification and event handling.

There are many ETL control frameworks, as they are needed in every project. Let’s make this the best one! Ideally this becomes a commodity.

  • The datamodel and sample code can be found here:
  • Documentation for the ETL Control Framework can be found here, this is a generic process control framework (happy reading!)
  • The DIRECT code and content is managed via Github here

Metadata Management (TEAM)

The Taxonomy for ETL Automation Metadata (TEAM) is a management tool for Data Vault metadata, a component also integrated in the VEDW software. It offers metadata mapping validation, data entry and visualisation. The metadata within TEAM is used to generate ETL (i.e. using Biml, SQL) using the interface / APIs.

Virtual Enterprise Data Warehouse (VEDW)

VEDW is the virtualisation and rapid prototyping software for Data Vault that can be downloaded from this site. More information is available here.