Skip to main content

Organize Data Sources with Source Applications

For data collection, you will often have different sources of information that correspond to applications designed for a particular purpose. These are what we will refer to as Source Applications.

To illustrate, let's consider Snowplow. We can identify several applications designed for distinct purposes, each serving as a separate data source for behavioral data, or in other words, a Source Application:

  • The Snowplow website that corresponds to the application served under www.snowplow.io
  • The BDP Console application that is served under console.snowplowanalytics.com.
  • The documentation website serving as our information hub, for all things related to our product, served under docs.snowplow.io.

Source Applications are a foundational component that enables you to establish the overarching relationships that connect application IDs and Application Entites and Data Products.

Application IDsโ€‹

For each of these applications you would set up a unique application ID using the app_id field to distinguish them later on in analysis.

tip

We often see, and recommend as a best practice, setting up a unique application ID for each deployment environment you are using. For example ${appId}-qa for staging, ${appId}-dev for development environments.

Application Contextโ€‹

Application Context, also referred to as Global Context, is a set of entities that can be sent with every event recorded in the application. Using Source Applications you can document which Application Contexts are expected. This is really useful for tracking implementation, data discovery and preventing information duplication in Data Products.

info

Since Application Entities can also be set conditionally, you can mark any of them as optional with a note to better understand the condition or any extra information required. The method for conditionally adding an Application Context is through rulesets, filter functions and context generators.

Was this page helpful?