Airbyte For Data Observability

Integration version1.0.0

Overview

Airbyte is a data integration platform that syncs data from sources to destinations. With Datadog’s Airbyte integration, data teams can understand how data flows through Airbyte connections and trace quality issues across their pipelines. The integration collects metadata and lineage for objects in your Airbyte account.

Data Catalog Assets

The integration builds a representation of the following Airbyte objects:

  • Workspaces
  • Connections
  • Streams

Lineage

The integration generates lineage between warehouse tables through Airbyte connections. When both the source and destination of a connection are supported data warehouses, Datadog automatically derives warehouse-to-warehouse lineage across all active streams.

Setup

Installation

No installation steps are required.

Configuration

Airbyte Cloud

Create an Airbyte application to generate OAuth credentials:

  1. Log in to your Airbyte Cloud account.
  2. Navigate to Settings > Applications.
  3. Create a new application and copy the Client ID and Client Secret.

Airbyte OSS (Self-Hosted)

For self-hosted Airbyte deployments, you need the base API URL of your Airbyte instance and, if authentication is enabled, the username and password.

Add the Airbyte integration

  1. Navigate to the Airbyte integration tile.

  2. Click Configure > + Add New Account.

  3. Fill out the form with the following:

    For Airbyte Cloud:

    • Client ID: The client ID from the previous step.
    • Client Secret: The client secret from the previous step.

    For Airbyte OSS:

    • Base API URL: The URL of your Airbyte instance API (for example, https://airbyte.company.com/api).
    • Username: Your Airbyte username (if authentication is enabled).
    • Password: Your Airbyte password (if authentication is enabled).
  4. Click Save.

Validation

It can take up to 60 minutes for data to appear after the initial setup.

Check the Data Catalog to verify that your Airbyte account appears and that the expected catalog assets are present.

What’s next

When your Airbyte account is successfully connected, Datadog syncs every 60 minutes and automatically derives lineage from source to destination warehouse tables across all active connections and streams.

After syncing, you can explore your Airbyte connections and their warehouse dependencies in the Data Observability Catalog.

Data Collected

Metrics

This integration does not include any metrics.

Events

This integration does not include any events.

Service Checks

This integration does not include any service checks.

Troubleshooting

Need help? Contact Datadog support.