Airbyte For Data Observability
Overview
Airbyte is a data integration platform that syncs data from sources to destinations. With Datadog’s Airbyte integration, data teams can understand how data flows through Airbyte connections and trace quality issues across their pipelines. The integration collects metadata and lineage for objects in your Airbyte account.
Data Catalog Assets
The integration builds a representation of the following Airbyte objects:
- Workspaces
- Connections
- Streams
Lineage
The integration generates lineage between warehouse tables through Airbyte connections. When both the source and destination of a connection are supported data warehouses, Datadog automatically derives warehouse-to-warehouse lineage across all active streams.
Setup
Installation
No installation steps are required.
Configuration
Airbyte Cloud
Create an Airbyte application to generate OAuth credentials:
- Log in to your Airbyte Cloud account.
- Navigate to Settings > Applications.
- Create a new application and copy the Client ID and Client Secret.
Airbyte OSS (Self-Hosted)
For self-hosted Airbyte deployments, you need the base API URL of your Airbyte instance and, if authentication is enabled, the username and password.
Add the Airbyte integration
Navigate to the Airbyte integration tile.
Click Configure > + Add New Account.
Fill out the form with the following:
For Airbyte Cloud:
- Client ID: The client ID from the previous step.
- Client Secret: The client secret from the previous step.
For Airbyte OSS:
- Base API URL: The URL of your Airbyte instance API (for example,
https://airbyte.company.com/api). - Username: Your Airbyte username (if authentication is enabled).
- Password: Your Airbyte password (if authentication is enabled).
Click Save.
Validation
It can take up to 60 minutes for data to appear after the initial setup.
Check the Data Catalog to verify that your Airbyte account appears and that the expected catalog assets are present.
What’s next
When your Airbyte account is successfully connected, Datadog syncs every 60 minutes and automatically derives lineage from source to destination warehouse tables across all active connections and streams.
After syncing, you can explore your Airbyte connections and their warehouse dependencies in the Data Observability Catalog.
Data Collected
Metrics
This integration does not include any metrics.
Events
This integration does not include any events.
Service Checks
This integration does not include any service checks.
Troubleshooting
Need help? Contact Datadog support.