Skip to main content

Configure the Identities data model

This page helps you configure the Snowplow Identities dbt package. You can customize variables, generate configuration code, and set output schemas.

Package configuration variables

This package sets variables to recommended defaults for optimal model performance. Depending on your use case, you might want to override these values by adding to your dbt_project.yml file.

:::note Variable name prefix All variables in Snowplow packages start with snowplow__ but we have removed these in the below tables for brevity. :::

Warehouse and tracker


Operation and logic


Contexts, filters, and logs


Config generator

Use the inputs below to generate configuration code for your dbt_project.yml. Any values not specified will use their package defaults.

Warehouse and tracker
Schema (dataset) that contains your atomic events
Database that contains your atomic events
Target name of your development environment as defined in your `profiles.yml` file
Reference to your events table
Operation and logic
The date to start processing events from on first run or a full refresh
The maximum number of days of new data to be processed since the latest event processed
Hash all identifier values with SHA-256 before storing them (recommended for PII compliance)
Identifiers

> Click the plus sign to add a new entry
Use this field to accept the Snowplow user license.
Entities (contexts), filters, and logs
App IDs

> Click the plus sign to add a new entry

Project Variables:

yaml
vars:
snowplow_identities: null

Output schemas

By default all scratch/staging tables will be created in the <target.schema>_scratch schema, the derived tables, will be created in <target.schema>_derived and all manifest tables in <target.schema>_snowplow_manifest. Some of these schemas are only used by specific packages, ensure you add the correct configurations for each packages you are using. To change, please add the following to your dbt_project.yml file:

tip

If you want to use just your connection schema with no suffixes, set the +schema: values to null

yaml
models:
snowplow_identities:
+schema: my_derived_schema
base:
manifest:
+schema: my_manifest_schema
scratch:
+schema: my_scratch_schema