Skip to main content

YAUAA enrichment

YAUAA (Yet Another User Agent Analyzer) enrichment is a powerful user agent parser and analyzer.

It uses YAUAA API to parse and analyze all user agent information of an HTTP request and extract as many relevant information as possible about the user's device and browser, like for instance the device class (Phone, Tablet, etc.).

caution

YAUAA parsing relies entirely on in-memory HashMaps and requires roughly 400 MB of RAM (see here). Additional memory is also needed if caching is enabled (by default).

There is no interaction with an external system.

Configure your website to send client hints

The current trend in web browsers is to reduce the amount of information sent to servers in the User-Agent HTTP header, to help with user privacy. Client Hints are a way for sites to opt in to sending the extra information which previously would have been part of the user agent header.

The YAUAA enrichment does not require you to make any changes to your website, if you are happy with the level of detail you get from the reduced user agent. However, it is possible to improve the accuracy of the user agent data produced by YAUAA, if you are able to configure your website to send high entropy client hints to the collector by following these instructions.

There are two ways for your website to opt-in to sending client hints to the collector. In the first method, you must configure your webserver to set both a Accept-CH HTTP header and a Permissions-Policy HTTP header when serving the your site's main HTML pages:

Accept-CH: sec-ch-ua, sec-ch-ua-full-version-list, sec-ch-ua-full-version, sec-ch-ua-mobile, sec-ch-ua-platform, sec-ch-ua-platform-version, sec-ch-ua-arch, sec-ch-ua-bitness, sec-ch-ua-model, sec-ch-ua-wow64
Permissions-Policy: ch-ua=("https://<YOUR COLLECTOR>"), ch-ua-full-version-list=("https://<YOUR COLLECTOR>"), ch-ua-full-version=("https://<YOUR COLLECTOR>"), ch-ua-mobile=("https://<YOUR COLLECTOR>"), ch-ua-platform=("https://<YOUR COLLECTOR>"), ch-ua-platform-version=("https://<YOUR COLLECTOR>"), ch-ua-arch=("https://<YOUR COLLECTOR>"), ch-ua-bitness=("https://<YOUR COLLECTOR>"), ch-ua-model=("https://<YOUR COLLECTOR>"), ch-ua-wow64=("https://<YOUR COLLECTOR>"),

Alternatively, in the second method, you can put a meta tag in the header secion of your site's HTML:

<meta http-equiv="delegate-ch" content="sec-ch-ua https://<YOUR COLLECTOR>; sec-ch-ua-full-version-list https://<YOUR COLLECTOR>; sec-ch-ua-full-version https://<YOUR COLLECTOR>; sec-ch-ua-mobile https://<YOUR COLLECTOR>; sec-ch-ua-platform https://<YOUR COLLECTOR>; sec-ch-ua-platform-version https://<YOUR COLLECTOR>; sec-ch-ua-arch https://<YOUR COLLECTOR>; sec-ch-ua-bitness https://<YOUR COLLECTOR>; sec-ch-ua-model https://<YOUR COLLECTOR>; sec-ch-ua-wow64 https://<YOUR COLLECTOR>;">

Configuration

Testing with Micro

Unsure if your enrichment configuration is correct or works as expected? You can easily test it using Snowplow Micro on your machine. Follow the Micro usage guide to set up Micro and configure it to use your enrichment.

Only one parameter can be set in the configuration : cacheSize. This field determines the number of already parsed user agents that are kept in memory for faster processing. If set to 0, caching is disabled. If not set, a default size is used for the cache (10000).

Input

This enrichment uses the following inputs:

  • The useragent field from the Snowplow event. Typically this field is taken from the User-Agent HTTP header. But Snowplow trackers can also override the user agent by setting the ua field in the tracker payload.
  • Client hint HTTP headers. These are a set of standard headers such as Sec-CH-UA which provide extra detail about the user agent.

Output

This enrichment adds a new derived context to the enriched event with this schema (since enrich 1.4.0).

If a field can't be figured out by the algorithm, it won't be in the output. But some fields can have value UNKNOWN.

The only field that will always be present is deviceClass.

Here is an example of a derived context attached by this enrichment for a page visited with a Samsung Galaxy S9:

{
"schema":"iglu:nl.basjes/yauaa_context/jsonschema/1-0-1",
"data": {
"deviceClass":"Phone",
"deviceName":"Samsung SM-G960F",
"deviceBrand":"Samsung",
"operatingSystemClass":"Mobile",
"operatingSystemName":"Android",
"operatingSystemVersion":"8.0.0",
"operatingSystemNameVersion":"Android 8.0.0",
"operatingSystemVersionBuild":"R16NW",
"layoutEngineClass":"Browser",
"layoutEngineName":"Blink",
"layoutEngineVersion":"62.0",
"layoutEngineVersionMajor":"62",
"layoutEngineNameVersion":"Blink 62.0",
"layoutEngineNameVersionMajor":"Blink 62",
"agentClass":"Browser",
"agentName":"Chrome",
"agentVersion":"62.0.3202.84",
"agentVersionMajor":"62",
"agentNameVersion":"Chrome 62.0.3202.84",
"agentNameVersionMajor":"Chrome 62"
}
}

The full output possiblities generated by the YAUAA algorithm can be found here.