Setting up Snowplow Self-Hosted
This page is an introduction for the Self-Hosted Quick Start guide. Follow the Quick Start to set up a self-hosted deployment.
We have built a set of terraform modules, which automates the setup and deployment of the required infrastructure and applications for an operational Snowplow Community Edition pipeline, with just a handful of input variables required on your side.
After following this guide, you will be able to:
- Collect granular, well-structured data with our suite of web, mobile and server side SDKs
- Create your own custom events and entities
- Easily enable and disable our suite of out-of-the-box enrichments
- Consume your rich data from the data warehouse, database, lake and/or real-time stream
Please read the terms of the Snowplow Limited Use License Agreement (FAQ) which apply to Community Edition.
In short, Community Edition is meant for testing and evaluating Snowplow and must not be deployed in production.
Required time
If you are proficient with Terraform and cloud tooling, 1 hour should be sufficient. Otherwise, expect to spend a few hours.
Cost
Assuming around 100 events per second, the pipeline will cost around $200 per month on AWS and $240 per month on GCP.
To reduce the costs, you can tweak the configuration (e.g. use smaller instances), or shut down the pipeline when not in use.
Scale
Out of the box, the deployed pipeline will handle up to ~100 events per second (~9 million events per day).
Getting help
Check out our Community. If you run into any problems or have any questions, we are here to help.