---
title: Connect your Data Warehouse for Warehouse-Native Experiment Analysis
description: >-
  Connect a data warehouse to Datadog to enable warehouse-native experiment
  analysis.
breadcrumbs: >-
  Docs > Experiments > Experiments Guides > Connect your Data Warehouse for
  Warehouse-Native Experiment Analysis
---

> For the complete documentation index, see [llms.txt](https://docs.datadoghq.com/llms.txt).

# Connect your Data Warehouse for Warehouse-Native Experiment Analysis

{% callout %}
# Important note for users on the following Datadog sites: app.ddog-gov.com, us2.ddog-gov.com

{% alert level="danger" %}
This product is not supported for your selected [Datadog site](https://docs.datadoghq.com/getting_started/site.md). ({% placeholder "user-datadog-site-name" /%}).
{% /alert %}

{% /callout %}

## Overview{% #overview %}

Warehouse-native experiment analysis lets you run statistical computations directly in your data warehouse.

{% section displayed-if="Data warehouse is BigQuery" %}
This section only applies to users who meet the following criteria: Data warehouse is BigQuery

To set this up for BigQuery, connect a BigQuery service account to Datadog and configure your experiment settings. This guide covers:

- Preparing Google Cloud resources
- Granting permissions to the Datadog service account
- Configuring experiment settings in Datadog

## Prerequisites{% #prerequisites %}

Datadog connects to BigQuery through a Google Cloud service account. If you already have a service account connected to Datadog, skip to Step 1. Otherwise, expand the section below to create one.

{% collapsible-section %}
#### Create a Google Cloud service account

1. Open your [Google Cloud console](https://console.cloud.google.com/).
1. Navigate to IAM & Admin > Service Accounts.
1. Click Create service account.
1. Enter the following:
   1. Service account name.
   1. Service account ID.
   1. Service account description.
1. Click Create and continue.
   1. **Note**: The Permissions and Principals with access settings are optional here. These are configured in Step 2.
1. Click Done.

After you create the service account, continue to Step 1 to set up the Google Cloud resources.
{% /collapsible-section %}

{% alert level="info" %}
If you plan to use other Google Cloud observability functionality in Datadog, see [Datadog's Google Cloud Platform integration documentation](https://docs.datadoghq.com/integrations/google-cloud-platform.md#metric-collection) to determine which resources to enable.
{% /alert %}

## Allow Datadog IP addresses{% #allow-datadog-ip-addresses %}

If your Google Cloud project uses [VPC Service Controls](https://cloud.google.com/vpc-service-controls/docs/overview) to restrict access to BigQuery, add Datadog's outbound IP addresses to the access policy for your service perimeter. For standard BigQuery connections through a service account, this step is not required.

### Find Datadog's outbound IP addresses{% #find-datadog-s-outbound-ip-addresses %}

Datadog's outbound IP addresses vary by Datadog site. To get the current list:

1. Open the [IP Ranges page](https://docs.datadoghq.com/api/latest/ip-ranges.md) in the Datadog API documentation.
1. Select your Datadog site from the site selector in the top right corner. The page displays the API endpoint URL for your site, shown next to `GET` (for example, `https://ip-ranges.us5.datadoghq.com/`).
1. Open that URL in a browser or HTTP client.
1. In the JSON response, find the `webhooks.prefixes_ipv4` property. These IPv4 addresses are what Datadog uses to connect to your data warehouse.

## Step 1: Prepare the Google Cloud resources{% #step-1-prepare-the-google-cloud-resources %}

Datadog Experiments uses a BigQuery dataset for caching experiment results and a Cloud Storage bucket for staging experiment records.

### Create a BigQuery dataset{% #create-a-bigquery-dataset %}

1. Open your [Google Cloud console](https://console.cloud.google.com/).
1. In the Search bar, search for **BigQuery**.
1. In the Explorer panel, expand your project (for example, `datadog-sandbox`).
1. Select Datasets, then click Create dataset.
   {% image
      source="https://docs.dd-static.net/images//product_analytics/experiment/exp_bq_gc_create_dataset.a8e3bfc798ef8d9dbf75ca5db16c7739.png?auto=format"
      alt="The BigQuery Datasets page in the Google Cloud console showing the datadog-sandbox project expanded in the left Explorer menu with Datasets selected, a list of datasets with columns for Dataset ID, Type, Location, Create time, and Label, and the Create dataset button highlighted in the top right." /%}
1. Enter a Dataset ID (for example, `datadog_experiments_output`).
1. (Optional) Select a Data location from the dropdown, add Tags, and set Advanced options.
1. Click Create dataset.

### Create a Cloud Storage bucket{% #create-a-cloud-storage-bucket %}

Create a Cloud Storage bucket that Datadog Experiments can use to stage experiment exposure records. See Google's [Create a bucket](https://docs.cloud.google.com/storage/docs/creating-buckets#console) documentation.

## Step 2: Grant permissions to the Datadog service account{% #step-2-grant-permissions-to-the-datadog-service-account %}

The Datadog Experiments service account requires specific permissions to run warehouse-native experiment analysis.

### Assign IAM roles at the project level{% #assign-iam-roles-at-the-project-level %}

To assign IAM roles so Datadog Experiments can read and write data, and run jobs in your data warehouse:

1. Open your [Google Cloud console](https://console.cloud.google.com/) and navigate to IAM & Admin > IAM.
1. Select the Allow tab and click Grant access.
1. In the New principals field, enter the service account email.
1. Using the Select a role dropdown, add the following roles:
   1. [BigQuery Job User](https://docs.cloud.google.com/iam/docs/roles-permissions/bigquery#bigquery.jobUser): Allows the service account to run BigQuery jobs.
   1. [BigQuery Data Owner](https://docs.cloud.google.com/iam/docs/roles-permissions/bigquery#bigquery.dataOwner): Grants the service account full access to the Datadog Experiments output dataset.
   1. [Storage Object User](https://docs.cloud.google.com/iam/docs/roles-permissions/storage#storage.objectUser): Allows the service account to read and write objects in the storage bucket that Datadog Experiments uses.
   1. [BigQuery Data Viewer](https://docs.cloud.google.com/iam/docs/roles-permissions/bigquery#bigquery.dataViewer): Allows the service account to read tables used in warehouse-native metrics.
1. Click Save.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/exp_bq_gc_iam_role.fd4b6118392be7ee8ec919cd289f4a06.png?auto=format"
   alt="The Google Cloud IAM page showing the Grant access panel for a project, with the Grant access button highlighted on the left, a New principals field highlighted in the Add principals section, and a Select a role dropdown highlighted in the Assign roles section." /%}

### Grant read access to specific source tables{% #grant-read-access-to-specific-source-tables %}

Repeat the following steps for each dataset you plan to use for experiment metrics:

1. In the [Google Cloud console](https://console.cloud.google.com/) Search bar, search for **BigQuery**.
1. In the Explorer panel, expand your project (for example, `datadog-sandbox`).
1. Click Datasets, then select the dataset containing your source tables.
1. Click the Share dropdown and select Manage permissions.
   {% image
      source="https://docs.dd-static.net/images//product_analytics/experiment/exp_bq_gc_permissions.df815d0c5c9d3e75f5a762e95a47f089.png?auto=format"
      alt="The BigQuery dataset page with the Share dropdown expanded and Manage permissions highlighted, showing additional options including Copy link, Authorize Views, Authorize Routines, Authorize Datasets, Manage Subscriptions, and Publish as Listing." /%}
1. Click Add principal.
1. In the New principals field, enter the service account email.
1. Using the Select a role dropdown, select the BigQuery Data Viewer role.
1. Click Save.
{% /section %}

{% section displayed-if="Data warehouse is Databricks" %}
This section only applies to users who meet the following criteria: Data warehouse is Databricks

To set this up for Databricks, connect a Databricks service account to Datadog and configure your experiment settings. This guide covers:

- Granting permissions to the service principal
- Connecting Databricks to Datadog
- Configuring experiment settings in Datadog

## Prerequisites{% #prerequisites-2 %}

Datadog Experiments connects to Databricks through the [Datadog Databricks integration](https://docs.datadoghq.com/integrations/databricks.md?tab=useaserviceprincipalforoauth). If you already have a Databricks integration configured for the workspace you plan to use, skip to Step 1. Otherwise, expand the section below to create a service principal.

{% collapsible-section %}
#### Create a Databricks service principal

**In your Databricks Workspace**:

1. Click your profile in the top right corner and select Settings.
1. In the Settings menu, click Identity and access.
1. On the Service principals row, click Manage, then:
   1. Click Add service principal, then Add new.
   1. Enter a service principal name and click Add.
1. Click the name of the new service principal to open its details page.
1. Select the Permissions tab, then:
   1. Click Grant access.
   1. Under User, Group or Service Principal, enter the service principal name.
   1. Using the Permission dropdown, select Manage.
   1. Click Save.
1. Select the Secrets tab, then:
   1. Click Generate secret.
   1. Set the Lifetime (days) value to the maximum allowed (for example, 730).
   1. Click Generate.
   1. Note your Secret and Client ID.
   1. Click Done.

After you create the service principal, continue to Step 1 to grant the required permissions.
{% /collapsible-section %}

{% alert level="info" %}
If you plan to use other warehouse observability functionality in Datadog, see [Datadog's Databricks integration documentation](https://docs.datadoghq.com/integrations/databricks.md) to determine which resources to enable.
{% /alert %}

## Allow Datadog IP addresses{% #allow-datadog-ip-addresses-2 %}

If your Databricks workspace has [IP access lists](https://docs.databricks.com/en/security/network/front-end/ip-access-list-workspace.html) enabled, add Datadog's outbound IP addresses to the workspace allowlist so Datadog can connect.

### Find Datadog's outbound IP addresses{% #find-datadog-s-outbound-ip-addresses-2 %}

Datadog's outbound IP addresses vary by Datadog site. To get the current list:

1. Open the [IP Ranges page](https://docs.datadoghq.com/api/latest/ip-ranges.md) in the Datadog API documentation.
1. Select your Datadog site from the site selector in the top right corner. The page displays the API endpoint URL for your site, shown next to `GET` (for example, `https://ip-ranges.us5.datadoghq.com/`).
1. Open that URL in a browser or HTTP client.
1. In the JSON response, find the `webhooks.prefixes_ipv4` property. These IPv4 addresses are what Datadog uses to connect to your data warehouse.

After retrieving the IPs, add them to your workspace's IP access list. See [Databricks documentation on IP access lists](https://docs.databricks.com/en/security/network/front-end/ip-access-list-workspace.html) for instructions.

## Step 1: Grant permissions to the service principal{% #step-1-grant-permissions-to-the-service-principal %}

{% alert level="info" %}
You must be an account admin to grant these permissions.
{% /alert %}

In your Databricks Workspace, open the SQL Editor to run the following commands and grant the service principal permissions for warehouse-native experiment analysis.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/databricks_experiments_sql_editor.bc7d2a668f0d5e2c9e13007b345508df.png?auto=format"
   alt="The Databricks Workspace with SQL Editor highlighted in the left navigation under the SQL section, Queries listed below it, a New Query tab open with the New SQL editor: ON toggle at the top, an empty query editor, and a Run all (1000) button with a dropdown arrow." /%}

### Grant read access to source tables{% #grant-read-access-to-source-tables %}

Grant the service principal read access to the tables containing your experiment metrics. Run both `GRANT USE` commands, then run the `GRANT SELECT` option that matches your access needs. Replace `<catalog>`, `<schema>`, `<table>`, and `<principal>` with the appropriate values.

```
GRANT USE CATALOG ON CATALOG <catalog> TO `<principal>`;
GRANT USE SCHEMA ON SCHEMA <catalog>.<schema> TO `<principal>`;

-- Option 1: Give read access to a single table
GRANT SELECT ON TABLE <catalog>.<schema>.<table> TO `<principal>`;

-- Option 2: Give read access to all tables in the schema
GRANT SELECT ON ALL TABLES IN SCHEMA <catalog>.<schema> TO `<principal>`;
```

### Create an output schema{% #create-an-output-schema %}

Run the following commands to create a schema where Datadog Experiments can write intermediate results and temporary tables. Replace `datadog_experiments_output` with your output schema name, and `<catalog>` and `<principal>` with the appropriate values.

```
CREATE SCHEMA IF NOT EXISTS <catalog>.datadog_experiments_output;
GRANT USE SCHEMA ON SCHEMA <catalog>.datadog_experiments_output TO `<principal>`;
GRANT CREATE TABLE ON SCHEMA <catalog>.datadog_experiments_output TO `<principal>`;
```

### Configure a volume for temporary data staging{% #configure-a-volume-for-temporary-data-staging %}

Datadog Experiments uses a [volume](https://docs.databricks.com/aws/en/sql/language-manual/sql-ref-volumes) to temporarily save exposure data before copying it into a Databricks table. Run the following commands to create and grant access to this volume. Replace `datadog_experiments_output` with your output schema name, and `<catalog>` and `<principal>` with the appropriate values.

```
CREATE VOLUME IF NOT EXISTS <catalog>.datadog_experiments_output.datadog_experiments_volume;
GRANT READ VOLUME ON VOLUME <catalog>.datadog_experiments_output.datadog_experiments_volume TO `<principal>`;
GRANT WRITE VOLUME ON VOLUME <catalog>.datadog_experiments_output.datadog_experiments_volume TO `<principal>`;
```

### Grant SQL warehouse access{% #grant-sql-warehouse-access %}

Grant the service principal access to the SQL warehouse that Datadog Experiments uses to run queries.

1. Navigate to SQL Warehouses in your Databricks Workspace.
1. Select the warehouse for Datadog Experiments.
1. At the top right corner, click Permissions.
1. Grant the service principal the Can use permission.
1. Close the Manage permissions modal.

## Step 2: Connect Databricks to Datadog{% #step-2-connect-databricks-to-datadog %}

To connect your Databricks Workspace to Datadog for warehouse-native experiment analysis:

1. Navigate to [Datadog's integrations page](https://app.datadoghq.com/integrations/) and search for **Databricks**.
1. Click the Databricks tile to open its modal.
1. Select the Configure tab and click Add Databricks Workspace. If this is your first Databricks account, the setup form appears automatically.
1. Under the Connect a new Databricks Workspace section, enter:
   - Workspace Name.
   - Workspace URL.
   - Client ID.
   - Client Secret.
   - System Tables SQL Warehouse ID.
1. Toggle off Jobs Monitoring and all other products.
1. Toggle off the Metrics - Model Serving resource.
1. Click Save Databricks Workspace.

{% alert level="info" %}
If you turn on other features in the Configure tab, additional configuration steps may be required. See the documentation for those features to complete their setup.
{% /alert %}
{% /section %}

{% section displayed-if="Data warehouse is Redshift" %}
This section only applies to users who meet the following criteria: Data warehouse is Redshift

To set this up for Amazon Redshift, connect a Redshift cluster to Datadog using the AWS integration and configure your experiment settings. This guide covers:

- Preparing the Redshift cluster
- Creating AWS resources and granting IAM permissions
- Configuring experiment settings in Datadog

## Prerequisites{% #prerequisites-3 %}

Datadog Experiments connects to Redshift through [Datadog's Amazon Web Services (AWS) integration](https://docs.datadoghq.com/integrations/amazon-web-services.md). If you already have the AWS integration configured for the account containing your Redshift cluster, skip to Step 1.

{% collapsible-section %}
#### Set up the AWS integration

{% alert level="info" %}
Adding an AWS account requires the **AWS Configurations Manage** permission. If your organization uses custom roles, verify that your role includes this permission.
{% /alert %}

1. Navigate to [Datadog's integrations page](https://app.datadoghq.com/integrations/) and search for **Amazon Web Services**.
1. Click the Amazon Web Services tile to open its modal.
1. Click Add AWS Account(s) under the Configuration tab.
   1. If you do not yet have the AWS integration installed, Add AWS Account(s) appears on the AWS landing page after you open the integration tile.
1. Follow the CloudFormation setup flow to create an IAM role that allows Datadog to make API calls to your AWS account:
   1. Select your AWS Region.
   1. Choose your Datadog API Key.
   1. Create a Datadog Application Key.
   1. Toggle off Deploy log forwarder and Disable All Log Resources (these are not needed for experiment analysis).
   1. Select No for Detect security issues.
   1. Click Open in AWS Console to launch your CloudFormation template. See the [Getting Started with AWS documentation](https://docs.datadoghq.com/getting_started/integrations/aws.md) for instructions on navigating the AWS console.

You can follow your configuration's completion steps under Deployment Status on the integration setup page in Datadog.
{% /collapsible-section %}

{% alert level="info" %}
If you plan to use other warehouse observability functionality in Datadog, see [Datadog's Amazon Web Services integration documentation](https://docs.datadoghq.com/integrations/amazon-web-services.md#resource-collection) to determine which resources to enable.
{% /alert %}

## Allow Datadog IP addresses{% #allow-datadog-ip-addresses-3 %}

Add Datadog's outbound IP addresses as inbound rules to the VPC security group associated with your Redshift cluster. This allows Datadog to connect to your cluster and run experiment queries.

### Find Datadog's outbound IP addresses{% #find-datadog-s-outbound-ip-addresses-3 %}

Datadog's outbound IP addresses vary by Datadog site. To get the current list:

1. Open the [IP Ranges page](https://docs.datadoghq.com/api/latest/ip-ranges.md) in the Datadog API documentation.
1. Select your Datadog site from the site selector in the top right corner. The page displays the API endpoint URL for your site, shown next to `GET` (for example, `https://ip-ranges.us5.datadoghq.com/`).
1. Open that URL in a browser or HTTP client.
1. In the JSON response, find the `webhooks.prefixes_ipv4` property. These IPv4 addresses are what Datadog uses to connect to your data warehouse.

### Update the Redshift security group{% #update-the-redshift-security-group %}

Add each IP address as an inbound rule in the [VPC security group](https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-security-groups.html) associated with your Redshift cluster, allowing TCP traffic on port `5439` (or your cluster's configured port). See [Amazon's documentation on VPC security groups](https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-security-groups.html) for instructions.

## Step 1: Prepare the Redshift cluster{% #step-1-prepare-the-redshift-cluster %}

Create a Datadog service user and a dedicated schema for Datadog to store experiment results and intermediate tables.

{% alert level="info" %}
You must have `superuser` or `admin` privileges in the Redshift database to create the Datadog service user.
{% /alert %}

### Create a Datadog service user in your Redshift database{% #create-a-datadog-service-user-in-your-redshift-database %}

Run the following command to create a service user with a strong password that Datadog can use to execute queries. Replace `datadog_experiments_user` with your user value and `Your_Strong_Password` with your password.

```
CREATE USER datadog_experiments_user PASSWORD 'Your_Strong_Password';
```

### Create a Redshift output schema{% #create-a-redshift-output-schema %}

Run the following commands to create a schema where Datadog can store experiment results and intermediate tables. Replace `datadog_experiments_output` with your schema name and `datadog_experiments_user` with your service user value.

```
CREATE SCHEMA IF NOT EXISTS datadog_experiments_output;
GRANT ALL ON SCHEMA datadog_experiments_output TO datadog_experiments_user;
```

### Grant the service user read access to your metric data{% #grant-the-service-user-read-access-to-your-metric-data %}

Grant the service user read access to the tables or schemas that contain your source data. These are the tables you plan to use for experiment metrics, and are typically in a different schema than the output schema created above. Run the `GRANT USAGE` command, then run the `GRANT SELECT` option that matches your access needs. Replace `datadog_experiments_user`, `<schema>`, and `<table>` with the appropriate values.

```
GRANT USAGE ON SCHEMA <schema> TO datadog_experiments_user;

-- Option 1: Give read access to a single table
GRANT SELECT ON TABLE <schema>.<table> TO datadog_experiments_user;

-- Option 2: Give read access to all tables in the schema
GRANT SELECT ON ALL TABLES IN SCHEMA <schema> TO datadog_experiments_user;
```

## Step 2: Create AWS resources and grant IAM permissions{% #step-2-create-aws-resources-and-grant-iam-permissions %}

### Create an S3 bucket{% #create-an-s3-bucket %}

Create an S3 bucket for importing exposure events into your warehouse. The bucket name must start with `datadog-experimentation-` (for example, `datadog-experimentation-[aws_account_id]`). You can use the bucket's default settings.

### Grant additional IAM permissions{% #grant-additional-iam-permissions %}

In addition to the permissions listed in the [AWS integration documentation](https://docs.datadoghq.com/getting_started/integrations/aws.md#prerequisites), Datadog Experiments requires additional IAM permissions to run warehouse-native experiment analysis.

Use the following table to gather the values for your environment, then add the policy statement below to the IAM role that your Datadog AWS integration uses.

| Field                     | Example                                                                |
| ------------------------- | ---------------------------------------------------------------------- |
| `[Redshift cluster ARN]`  | `arn:aws:redshift:us-east-1:[account-id]:namespace:[namespace-id]`     |
| `[Redshift user ARN]`     | `arn:aws:redshift:us-east-1:[account-id]:dbuser:[cluster-name]/[user]` |
| `[Redshift database ARN]` | `arn:aws:redshift:us-east-1:[account-id]:dbname:[cluster-name]`        |
| `[S3 bucket ARN]`         | `arn:aws:s3:::[bucket-name]`                                           |

```
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "RedshiftGetClusterCredentials",
      "Effect": "Allow",
      "Action": [
        "redshift:GetClusterCredentials"
      ],
      "Resource": [
        "[Redshift cluster ARN]",
        "[Redshift user ARN]",
        "[Redshift database ARN]"
      ]
    },
    {
      "Sid": "QueryRedshift",
      "Effect": "Allow",
      "Action": [
        "redshift-data:ExecuteStatement",
        "redshift-data:GetStatementResult",
        "redshift-data:DescribeStatement",
        "redshift-data:ListStatements",
        "redshift-data:CancelStatement"
      ],
      "Resource": "*"
    },
    {
      "Sid": "ListExperimentationBucket",
      "Effect": "Allow",
      "Action": [
        "s3:ListBucket"
      ],
      "Resource": "[S3 bucket ARN]"
    },
    {
      "Sid": "ReadWriteExperimentationBucket",
      "Effect": "Allow",
      "Action": [
        "s3:GetObject",
        "s3:PutObject",
        "s3:DeleteObject"
      ],
      "Resource": "[S3 bucket ARN]/*"
    }
  ]
}
```

### (Optional) Create an IAM role for Redshift to read exposure data{% #optional-create-an-iam-role-for-redshift-to-read-exposure-data %}

This step is required only if you use Datadog Feature Flagging. It enables Datadog to synchronize your feature flag exposures into Redshift so that metrics in your warehouse can be used in experiments. Datadog Experiments stages exposure data in the S3 bucket you created, then runs a Redshift `COPY` command to load that data into your warehouse. The `COPY` command uses a dedicated IAM role that your Redshift cluster assumes to read from the bucket. This role is separate from the role your Datadog AWS integration uses.

If you use your own feature flagging solution, exposure data already lives in your systems and Datadog does not synchronize exposures into Redshift. In that case, skip this step and leave the **Copy IAM role ARN** field blank in Step 3.

If you do need to synchronize exposures, create this role and associate it with your cluster.

#### Create an IAM policy{% #create-an-iam-policy %}

Create an IAM policy that grants read access to the S3 bucket you created. Replace `[bucket-name]` with your bucket name (for example, `datadog-experimentation-[aws_account_id]`).

```
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "ListStagingBucket",
      "Effect": "Allow",
      "Action": [
        "s3:ListBucket",
        "s3:GetBucketLocation"
      ],
      "Resource": "arn:aws:s3:::[bucket-name]"
    },
    {
      "Sid": "ReadStagedExposures",
      "Effect": "Allow",
      "Action": "s3:GetObject",
      "Resource": "arn:aws:s3:::[bucket-name]/*"
    }
  ]
}
```

#### Create a role that Redshift can assume{% #create-a-role-that-redshift-can-assume %}

Create an IAM role and use the following trust policy so the Redshift service can assume the role:

```
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Principal": {
        "Service": "redshift.amazonaws.com"
      },
      "Action": "sts:AssumeRole"
    }
  ]
}
```

#### Attach the policy to the role{% #attach-the-policy-to-the-role %}

Attach the policy you created to the role. For instructions, see [Adding and removing IAM identity permissions](https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies_manage-attach-detach.html) in the AWS documentation.

#### Associate the role with your Redshift cluster{% #associate-the-role-with-your-redshift-cluster %}

Associate the role with your cluster so the `COPY` command can assume it:

- For a provisioned cluster, open your cluster in the [Amazon Redshift console](https://console.aws.amazon.com/redshiftv2/), select Actions > Manage IAM roles, add the role, and save.
- For Redshift Serverless, open your namespace, go to Security and encryption > Manage IAM roles, add the role, and save.

For more details, see [Associating IAM roles with clusters](https://docs.aws.amazon.com/redshift/latest/mgmt/copy-unload-iam-role.html) in the AWS documentation.

After the role is associated, note its ARN (for example, `arn:aws:iam::[aws_account_id]:role/[role-name]`). You enter this ARN when configuring experiment settings in Step 3.
{% /section %}

{% section displayed-if="Data warehouse is Snowflake" %}
This section only applies to users who meet the following criteria: Data warehouse is Snowflake

To set this up for Snowflake, connect a Snowflake service account to Datadog and configure your experiment settings. This guide covers:

- Preparing a Snowflake service account
- Connecting it to Datadog
- Configuring experiment settings

## Allow Datadog IP addresses{% #allow-datadog-ip-addresses-4 %}

If your Snowflake account uses [network policies](https://docs.snowflake.com/en/user-guide/network-policies) to restrict connections by IP address, add Datadog's outbound IP addresses to a network policy and apply it to the Datadog Experiments service user.

### Find Datadog's outbound IP addresses{% #find-datadog-s-outbound-ip-addresses-4 %}

Datadog's outbound IP addresses vary by Datadog site. To get the current list:

1. Open the [IP Ranges page](https://docs.datadoghq.com/api/latest/ip-ranges.md) in the Datadog API documentation.
1. Select your Datadog site from the site selector in the top right corner. The page displays the API endpoint URL for your site, shown next to `GET` (for example, `https://ip-ranges.us5.datadoghq.com/`).
1. Open that URL in a browser or HTTP client.
1. In the JSON response, find the `webhooks.prefixes_ipv4` property. These IPv4 addresses are what Datadog uses to connect to your data warehouse.

After retrieving the IPs, see [Snowflake documentation on network policies](https://docs.snowflake.com/en/user-guide/network-policies) to create a policy that allows those addresses and apply it to the service user you create in Step 1.

## Step 1: Prepare the Snowflake service account{% #step-1-prepare-the-snowflake-service-account %}

The examples in this guide use `datadog_experiments_user` and `datadog_experiments_role` as the service account's user and role. Replace these with your own values.

### Create a dedicated service user and role in Snowflake{% #create-a-dedicated-service-user-and-role-in-snowflake %}

1. Use the [Snowflake documentation](https://docs.snowflake.com/en/user-guide/key-pair-auth) to create a public-private key pair for enhanced authentication. Datadog only supports unencrypted private keys.
1. Run the following commands in Snowflake to create the user and role in the service account. Replace `<public_key>` with the public key you generated in the previous step.

```
USE ROLE ACCOUNTADMIN;
CREATE ROLE IF NOT EXISTS datadog_experiments_role;
CREATE USER IF NOT EXISTS datadog_experiments_user
    RSA_PUBLIC_KEY = '<public_key>';
GRANT ROLE datadog_experiments_role TO USER datadog_experiments_user;
ALTER USER datadog_experiments_user SET DEFAULT_ROLE = datadog_experiments_role;
```

### Grant privileges to the role{% #grant-privileges-to-the-role %}

1. Identify the tables in Snowflake from which you intend to create metrics.
1. Run the following commands to grant read privileges to the new role, replacing `<database>`, `<schema>`, and `<table>` with their appropriate values. Run both `GRANT USAGE` commands, then run the `GRANT SELECT` option or options that match your access needs.

```
GRANT USAGE ON DATABASE <database> TO ROLE datadog_experiments_role;
GRANT USAGE ON SCHEMA <database>.<schema> TO ROLE datadog_experiments_role;

-- Option 1: Give read access to a single table
GRANT SELECT ON TABLE <database>.<schema>.<table> TO ROLE datadog_experiments_role;

-- Option 2: Give read access to all existing tables in the schema
GRANT SELECT ON ALL TABLES IN SCHEMA <database>.<schema> TO ROLE datadog_experiments_role;

-- Option 3: Give read access to all future tables in the schema
GRANT SELECT ON FUTURE TABLES IN SCHEMA <database>.<schema> TO ROLE datadog_experiments_role;
```

### Grant the role access to the output schema{% #grant-the-role-access-to-the-output-schema %}

Datadog writes experiment exposure logs and intermediate metric results to tables in a dedicated output schema. Run the following commands to create the schema and grant the role full access. Replace `<database>` with the appropriate value.

```
CREATE SCHEMA IF NOT EXISTS <database>.datadog_experiments_output;
GRANT ALL ON SCHEMA <database>.datadog_experiments_output TO ROLE datadog_experiments_role;
GRANT ALL PRIVILEGES ON FUTURE TABLES IN SCHEMA <database>.datadog_experiments_output TO ROLE datadog_experiments_role;
```

### Create a dedicated warehouse for Datadog Experiments (optional){% #create-a-dedicated-warehouse-for-datadog-experiments--optional %}

{% alert level="info" %}
The role you created must have access to at least one warehouse to compute results. You must enter the warehouse name when configuring experiment settings in Step 3.
{% /alert %}

Creating a dedicated warehouse for Datadog Experiments is optional. Run the following commands to create one. Replace `<wh_size>` with the appropriate value.

```
CREATE WAREHOUSE IF NOT EXISTS datadog_experiments_wh
    WAREHOUSE_SIZE = <wh_size>
    AUTO_SUSPEND = 300
    INITIALLY_SUSPENDED = true;
GRANT ALL PRIVILEGES ON WAREHOUSE datadog_experiments_wh TO ROLE datadog_experiments_role;
```

## Step 2: Connect Snowflake to Datadog{% #step-2-connect-snowflake-to-datadog %}

To connect your Snowflake account to Datadog for warehouse-native experiment analysis:

1. Navigate to [Datadog's integrations page](https://app.datadoghq.com/integrations/) and search for **Snowflake**.
1. Click the Snowflake tile to open its modal.
1. Select the Configure tab and click Add Snowflake Account.
1. Add your Account URL. To find your account URL, see the [Snowflake guide](https://docs.snowflake.com/en/user-guide/organizations-connect).
1. Toggle off all resources (these are not needed for experiment analysis).
1. Enter the Snowflake User Name you created in Step 1 (for example, `datadog_experiments_user`).
1. Scroll to the Configure a key pair authentication section and upload your unencrypted private key.
1. Click Save.

{% alert level="info" %}
The grants in the Recommended Warehouse Settings section of the Snowflake integration tile are not needed for warehouse-native experiment analysis. The privileges granted in Step 1 are sufficient.

If you plan to use other warehouse observability functionality in Datadog, see [Datadog's Snowflake integration documentation](https://docs.datadoghq.com/integrations/snowflake-web.md) to determine which resources to enable.
{% /alert %}

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/snowflake_main_integration.6732b76797eee7fffd863012a603bffd.png?auto=format"
   alt="The Snowflake integration tile in Datadog showing the Configure tab with the Add a new Snowflake account form, including an Account URL field and resource toggles for Metrics and Logs." /%}
{% /section %}

## Step 3: Configure experiment settings{% #step-3-configure-experiment-settings %}

{% section displayed-if="Data warehouse is BigQuery" %}
This section only applies to users who meet the following criteria: Data warehouse is BigQuery

{% alert level="info" %}
Datadog supports one warehouse connection per organization. Connecting BigQuery replaces any existing warehouse connection (for example, Snowflake).
{% /alert %}

After you set up your Google Cloud resources and IAM roles, configure the experiment settings in Datadog:

1. Open [Datadog Product Analytics](https://app.datadoghq.com/product-analytics).
1. In the left navigation, hover over Settings and click Experiments.
1. Select the Warehouse Connections tab.
1. Click Connect a data warehouse. If you already have a warehouse connected, click Edit instead.
1. Select the BigQuery tile.
1. Under Select BigQuery Account, enter:
   - GCP service account: The service account you are using for Datadog Experiments.
   - Project: Your Google Cloud project.
1. Under Dataset and GCS Bucket, enter:
   - Dataset: The dataset you created in Step 1 (for example, `datadog_experiments_output`).
   - GCS Bucket: The Cloud Storage bucket you created in Step 1.
1. Click Save.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/bigquery_experiment_setup_dd.a2f5683641daec29ffe3bf8dbd6fe2e3.png?auto=format"
   alt="The Edit Data Warehouse modal with BigQuery selected, showing two sections: Select BigQuery Account with fields for GCP service account and Project, and Dataset and GCS Bucket with fields for Dataset and GCS Bucket." /%}

After you save your warehouse connection, [create experiment metrics](https://docs.datadoghq.com/experiments/defining_metrics.md) using your BigQuery data.
{% /section %}

{% section displayed-if="Data warehouse is Databricks" %}
This section only applies to users who meet the following criteria: Data warehouse is Databricks

{% alert level="info" %}
Datadog supports one warehouse connection per organization. Connecting Databricks replaces any existing warehouse connection (for example, Snowflake).
{% /alert %}

After you set up your Databricks integration and workspace, configure the experiment settings in Datadog:

1. Open [Datadog Product Analytics](https://app.datadoghq.com/product-analytics).
1. In the left navigation, hover over Settings and click Experiments.
1. Select the Warehouse Connections tab.
1. Click Connect a data warehouse. If you already have a warehouse connected, click Edit instead.
1. Select the Databricks tile.
1. Using the Account dropdown, select the Databricks Workspace you configured in Step 2.
1. Enter the Catalog, Schema, and Volume name you configured in Step 1. If your catalog and schema do not appear in the dropdown, enter them manually to add them to the list.
1. Click Save.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/databricks_experiment_setup_1.e1533cca70cab80e83987df6faaab3e3.png?auto=format"
   alt="The Edit Data Warehouse modal with Databricks selected, showing input fields for Account, Catalog, Schema, and Volume Name." /%}

After you save your warehouse connection, [create experiment metrics](https://docs.datadoghq.com/experiments/defining_metrics.md) using your Databricks data.
{% /section %}

{% section displayed-if="Data warehouse is Redshift" %}
This section only applies to users who meet the following criteria: Data warehouse is Redshift

{% alert level="info" %}
Datadog supports one warehouse connection per organization. Connecting Redshift replaces any existing warehouse connection (for example, Snowflake).

Configuring experiment settings requires the **Product Analytics Settings Write** permission. If your organization uses custom roles, verify that your role includes this permission.
{% /alert %}

After you set up your AWS integration and Redshift cluster, configure the experiment settings in Datadog:

1. Open [Datadog Product Analytics](https://app.datadoghq.com/product-analytics).
1. In the left navigation, hover over Settings and click Experiments.
1. Select the Warehouse Connections tab.
1. Click Connect a data warehouse. If you already have a warehouse connected, click Edit instead.
1. Select the Redshift tile.
1. Select your AWS account from the dropdown.
1. Under Cluster Connection, enter:
   - AWS region: The region your Redshift cluster is in (for example, `us-east-1`).
   - Cluster identifier: The name of your Redshift cluster.
   - Cluster endpoint: The full endpoint URL for your cluster.
   - Port: The port your cluster is listening on (default: `5439`).
1. Under Database and Storage, enter:
   - Database: The name of the database containing your source tables.
   - Database user: The service user you created in Step 1 (for example, `datadog_experiments_user`).
   - Schema: The schema you created in Step 1 for Datadog Experiments to write to (for example, `datadog_experiments_output`).
   - Temp S3 bucket: The S3 bucket you created in Step 2 (for example, `datadog-experimentation-[aws_account_id]`).
   - Copy IAM role ARN (optional): The ARN of the IAM role you created in Step 2 for Redshift to read exposure data from S3 (for example, `arn:aws:iam::[aws_account_id]:role/[role-name]`). Provide this only if you use Datadog Feature Flagging and want Datadog to synchronize exposures into Redshift. If you use your own feature flagging solution, leave this blank.
1. Click Save.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/redshift_pa_setup.318283f38a3b54aeb25a69c362a83bb1.png?auto=format"
   alt="The Redshift connection setup page in Datadog showing warehouse type tiles for Snowflake, BigQuery, Redshift (selected), and Databricks, with three sections: Select AWS Account with an AWS account dropdown, Cluster Connection with fields for AWS region, Cluster identifier, Cluster endpoint, and Port, and Database and Storage with fields for Database, Database user, Schema, and Temp S3 bucket." /%}

After you save your warehouse connection, [create experiment metrics](https://docs.datadoghq.com/experiments/defining_metrics.md) using your Redshift data.
{% /section %}

{% section displayed-if="Data warehouse is Snowflake" %}
This section only applies to users who meet the following criteria: Data warehouse is Snowflake

{% alert level="info" %}
Datadog supports one warehouse connection per organization. Connecting Snowflake replaces any existing warehouse connection (for example, Redshift).
{% /alert %}

After you set up your Snowflake integration, configure the experiment settings in [Datadog Product Analytics](https://app.datadoghq.com/product-analytics):

1. In the left navigation, hover over Settings, then click Experiments.
1. Select the Warehouse Connections tab.
1. Click Connect a data warehouse. If you already have a warehouse connected, click Edit instead.
1. Select the Snowflake tile.
1. Enter the Account, Role, Warehouse, Database, and Schema you configured in Step 1. If your database and schema do not appear in the dropdown, enter them manually to add them to the list.
1. Click Save.

{% image
   source="https://docs.dd-static.net/images//product_analytics/experiment/guide/snowflake_experiment_setup.513728767a9be9ef74574c5af29a07e7.png?auto=format"
   alt="The Edit Data Warehouse modal with Snowflake selected, showing two sections: Select Snowflake Account with fields for Account, Role, and Warehouse, and Select Database and Schema with fields for Database and Schema." /%}

After you save your warehouse connection, [create experiment metrics](https://docs.datadoghq.com/experiments/defining_metrics.md) using your Snowflake data.
{% /section %}

## Further reading{% #further-reading %}

- [Defining metrics in Datadog Experiments](https://docs.datadoghq.com/experiments/defining_metrics.md)
- [How to bridge speed and quality in experiments through unified data](https://www.datadoghq.com/blog/experimental-data-datadog/)


{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_create_dataset.a8e3bfc798ef8d9dbf75ca5db16c7739.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_create_dataset.a8e3bfc798ef8d9dbf75ca5db16c7739.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_iam_role.fd4b6118392be7ee8ec919cd289f4a06.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_iam_role.fd4b6118392be7ee8ec919cd289f4a06.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_permissions.df815d0c5c9d3e75f5a762e95a47f089.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/exp_bq_gc_permissions.df815d0c5c9d3e75f5a762e95a47f089.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/databricks_experiments_sql_editor.bc7d2a668f0d5e2c9e13007b345508df.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/databricks_experiments_sql_editor.bc7d2a668f0d5e2c9e13007b345508df.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/snowflake_main_integration.6732b76797eee7fffd863012a603bffd.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/snowflake_main_integration.6732b76797eee7fffd863012a603bffd.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/bigquery_experiment_setup_dd.a2f5683641daec29ffe3bf8dbd6fe2e3.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/bigquery_experiment_setup_dd.a2f5683641daec29ffe3bf8dbd6fe2e3.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/databricks_experiment_setup_1.e1533cca70cab80e83987df6faaab3e3.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/databricks_experiment_setup_1.e1533cca70cab80e83987df6faaab3e3.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/redshift_pa_setup.318283f38a3b54aeb25a69c362a83bb1.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/redshift_pa_setup.318283f38a3b54aeb25a69c362a83bb1.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}

{% image
   source="https://docs.dd-static.net/images/product_analytics/experiment/guide/snowflake_experiment_setup.513728767a9be9ef74574c5af29a07e7.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/product_analytics/experiment/guide/snowflake_experiment_setup.513728767a9be9ef74574c5af29a07e7.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="" /%}