Test smarter not harder: add the right tests to your dbt project
The Analytics Development Lifecycle (ADLC) is a workflow for improving data maturity and velocity. Testing is a key phase here. Many dbt developers tend to focus on primary keys and source freshness. We think there is a more holistic and in-depth path to tread. Testing is a key piece of the ADLC, and it should drive data quality.
In this blog, we’ll walk through a plan to define data quality. This will look like:
- identifying data hygiene issues
- identifying business-focused anomaly issues
- identifying stats-focused anomaly issues
Once we have defined data quality, we’ll move on to prioritize those concerns. We will:
- think through each concern in terms of the breadth of impact
- decide if each concern should be at error or warning severity