Skip to main content

Modeling your data with dbt

dbt enables analytics engineers to transform data in their warehouses by simply writing select statements. Snowplow has written and maintain a number of dbt packages to model your snowplow data for various purposes and produce derived tables for use in analytics, AI, ML, BI, or reverse ETL tools.

Using Snowplow's dbt packages means you can draw insight and value from your data quicker, easier, and cheaper than building your own modeling from scratch.

To setup dbt, Snowplow Community Edition users can start with the dbt User Guide and then we have prepared some introduction videos for working with the Snowplow dbt packages.

For Snowplow BDP customers, dbt projects can be configured and scheduled in the console meaning you can get started running dbt models alongside your Snowplow pipelines.

Snowplow dbt Packages

info

Our dbt packages are available under a mix of licenses. For more information about how to get access to these packages, please contact us by requesting a demo if you are new to Snowplow or by reaching out to our support team.

Our dbt packages come with powerful built-in features such as an optimization to the incremental materialization to save you cost on warehouse compute resources compared to the standard method, a custom incremental logic to ensure we process just the required data for each run and keep your models in sync, plus the ability to build your own custom models using both of these!

There are 4 core snowplow dbt packages:

We also have 2 utility packages:

  • Snowplow Normalize package that makes it easy for you to build models that transform your events data into a different structure that may be better suited for downstream consumers
  • Snowplow Utils contains all our shared macros and features used across our packages

There are also 3 legacy dbt packages for web, mobile (superseded by unified) and fractribution (superseeded by attribution):

Each package comes with a set of standard models to take your Snowplow tracker data and produce tables aggregated to different levels, or to perform analysis for you. You can also add your own models on top, see the page on custom modules for more information on how to do this.

The supported data warehouses per version can be seen below:

snowplow-unified versiondbt versionsBigQueryDatabricksRedshiftSnowflakePostgresSpark
0.5.0>=1.6.0 to <2.0.0
0.4.5>=1.6.0 to <2.0.0

dbt Version Compatibility Checker

You may be using other dbt packages that require you to use a specific version of dbt, or a specific version of the dbt_utils package. Or you simply may not wish to upgrade your existing setup.

Here’s a way to check the latest version of our packages you can install for a given setup. Simply enter your dbt version and (optionally) dbt_utils version below.




Please enter a valid dbt version.