Create a Log Pipeline

Docs > Développeurs > Créer une intégration > Create a Log Pipeline

Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.

Overview

This guide walks Technology Partners through creating a log pipeline for an integration that sends logs to Datadog. A log pipeline is required to process, structure, and enrich logs for optimal usability.

Best practices

Use supported Datadog log endpoints.
- The integration must use one of Datadog’s supported log ingestion endpoints.
- Alternatively, use the Logs Ingestion HTTP endpoint to send logs to Datadog.
Support all Datadog sites.
- Ensure users can select between different Datadog sites when applicable.
- Refer to Getting Started with Datadog Sites for site-specific details.
- The Datadog site endpoint for log ingestion is: http-intake.logs.
Allow users to attach custom tags.
- Tags should be set as key-value attributes in the JSON body of the log payload.
- If logs are sent using API, tags can also be set using the ddtags=<TAGS> query parameter.
Set the integration’s logs source tag.
- Define the source tag as the integration name, such as source: okta.
- The source tag must be:
  - Lowercase
  - Not user-editable (use for pipelines and dashboards)
Avoid sending logs with arrays in the JSON body.
- While arrays are supported they cannot be faceted, which limits filtering.
Protect Datadog API and application keys.
- Never log Datadog API keys. These should only be passed in the request header or HTTP path.
- Do not use application keys for log ingestion.

Create log integration assets

Log integration assets consist of:

Pipelines - Process and structure logs.
Facets - Attributes used for filtering and searching logs. Technology Partner integrations must follow Datadog’s standard naming convention to ensure compatibility with out-of-the-box dashboards.

To be reviewed by Datadog's integration team, log integrations must include assets and have pipeline processors or facets.

Pipelines overview

Logs sent to Datadog are processed in log pipelines using pipeline processors. These processors allow users to parse, remap, and extract attribute information, enriching and standardizing logs for use across the platform.

Create a pipeline

Navigate to the Pipelines page and select New Pipeline.
In the Filter field, enter the unique source tag for the logs. For example, source:okta for the Okta integration.
[Optional] Add tags and a description for clarity.
Click Create.

Important: Ensure logs sent through the integration are tagged before being ingested.

Add pipeline processors

Review Datadog’s Standard Attributes for log structuring best practices.
Standard Attributes are reserved attributes that apply across the platform.
Click Add Processor and choose from the following options:
- Attribute Remapper - Maps from custom log attributes to standard Datadog attributes.
- Service Remapper - Ensures logs are linked to the correct service name.
- Date Remapper - Assigns the correct timestamp to logs.
- Status Remapper - Maps log statuses to standard Datadog attributes.
- Message Remapper - Assigns logs to the correct message attribute.
If logs are not in JSON format, use a grok parser processor to extract attributes. Grok processors parse out attributes and enrich logs prior to remapping or further processing.

For advanced processing, consider:

Arithmetic Profecessor - Performs calculations on log attributes.
String Builder Processor - Concatenates multiple string attributes.

Tips

Remove original attributes when remapping log attributes by using preserveSource:false. This helps avoid confusion and removes duplicates.
To maintain optimal grok parsing performance, avoid wildcard matchers.

Use processors within your pipelines to enrich and restructure your data, and generate log attributes. For a list of all log processors, see the Processors documentation.

Requirements

Map the application’s logs attributes to Datadog’s Standard Attributes: Use the Attribute Remapper to map attribute keys to Datadog Standard Attributes where possible. For example, an attribute for a network service client IP value should be remapped to network.client.ip.
Map the log service tag to the name of the service producing telemetry: Use the Service Remapper to remap the service attribute. When source and service share the same value, remap the service tag to the source tag. service tags must be lowercase.
Map the log’s internal timestamp to its official Datadog timestamp: Use the Date Remapper to define the official timestamp for logs. If a log’s timestamp does not map to a standard date attribute, Datadog sets its timestamp to the time of ingestion.
Map the custom status attributes of the logs to the official Datadog status attribute: Use a Status Remapper to remap the status of a log, or a Category Processor for statuses mapped to a range (as with HTTP status codes).
Map the custom message attribute of the logs to the official Datadog message attribute: Use the message remapper to define the official message of the log if application logs do not map to the standard message attribute. This allows users to search for logs using free text.
Set a namespace for custom attributes within your logs: Generic log attributes that do not map to a Datadog Standard Attribute must be namespaced if they are mapped to Facets. For example, file would be remapped to integration_name.file. Use the Attribute Remapper to set attribute keys to a new namespaced attribute.

Expand the newly created pipeline and click Add Processor to begin building your pipeline using processors.
If the integration’s logs aren’t in JSON format, add the Grok Processor to extract attribute information. Grok processors parse out attributes and enrich logs prior to remapping or further processing.
After extracting log attributes, remap them to Datadog’s Standard Attributes where possible using Attribute Remappers.
Set the timestamp of an integration’s logs to be its official Datadog timestamp using the Date Remapper.
For more advanced processing and data transformations, make use of additional processors.
For example, the Arithmetic Processor can be used to calculate information based off of attributes, or the String Builder Processor can concatenate multiple string attributes.

Tips

Remove original attributes when remapping log attributes by using preserveSource:false. This helps avoid confusion and removes duplicates.
To maintain optimal grok parsing performance, avoid wildcard matchers such as %{data:} and %{regex(".*"):}. Make your parsing statements as specific as possible.
Take the free course Going Deeper with Logs Processing for an overview on writing processors and leveraging standard attributes.

Facets are specific qualitative or quantitative attributes that can be used to filter and narrow down search results. While facets are not strictly necessary for filtering search results, they play a crucial role in helping users understand the available dimensions for refining their search.

Facets for standard attributes are automatically added by Datadog when a pipeline is published. Review if the attribute should be remapped to a Datadog Standard Attribute.

Not all attributes are meant to be used as a facet. The need for facets in integrations is focused on two things:

Facets provide a straightforward interface for filtering logs. They are leveraged in Log Management autocomplete features, allowing users to find and aggregate key information found in their logs.
Facets allow for attributes with low readability to be renamed with a label that is easier to understand. For example: @deviceCPUper → Device CPU Utilization Percentage.

You can create facets in the Log Explorer.

Correctly defining facets is important as they improve the usability of indexed logs in analytics, monitors, and aggregation features across Datadog’s Log Management product.

They allow for better findability of application logs by populating autocomplete features across Log Management.

Quantitative facets, called "Measures", allow users to filter logs over a range of numeric values using relational operators. For example, a measure for a latency attribute allows users to search for all logs greater-than a certain duration.

Requirements

Attributes mapped to custom facets must be namespaced first: Generic custom attributes that do not map to Datadog Standard Attribute must be namespaced when used with custom facets. An Attribute Remapper can be used to namespace an attribute with the integration’s name.
For example, remapping attribute_name to integration_name.attribute_name.
Custom facets must not duplicate an existing Datadog Facet: To avoid confusion with existing out-of-the-box Datadog facets, do not create custom facets that duplicate any existing facets already mapped to Datadog Standard Attributes.
Custom facets must be grouped under the source name: When creating a custom facet a group should be assigned. Set the Group value to the source, same as the integration’s name.
Custom facets must have the same data type as the mapped attribute: Set the facet data type (String, Boolean, Double, or Integer) to the same type as the Attribute mapped to it. Mismatched types prevent the facet from being used as intended and can cause it to populate incorrectly.

Add a facet or measure

Click on a log that contains the attribute you want to add a facet or measure for.
In the log panel, click the Cog icon next to the attribute.
Select Create facet/measure for @attribute.
For a measure, to define the unit, click Advanced options. Select the unit based on what the attribute represents. Note: Define the unit of a measure based on what the attribute represents.
Specify a facet Group to help navigate the Facet List. If the facet group does not exist, select New group, enter the name of the group matching the source tag, and add a description for the new group.
To create the facet, click Add.

In the log panel, click the Cog icon next to the attribute that you want to configure or group.
Select Edit facet/measure for @attribute. If there isn’t a facet for the attribute yet, select Create facet/measure for @attribute.
Click Add or Update when done.

Tips

Measures should have a unit where possible. Measures can be assigned a unit. Two families of units are available, TIME and BYTES, with units such as millisecond or gibibyte.
Facets can be assigned a description. A clear description of the facet can help users understand how to best use it.
If you remap an attribute and keep the original using the preserveSource:true option, define a facet on only a single one.
When manually configuring facets in a pipeline’s .yaml configuration files, note they are assigned a source. This refers to where the attribute is captured from and can be log for attributes or tag for tags.

Export your log pipeline

Hover over the pipeline you would like to export and select export pipeline.

Exporting your log pipeline includes two YAML files: -pipeline-name.yaml: The log pipeline, including custom facets, attribute remappers, and grok parsers. -pipeline_name_test.yaml: The raw sample logs provided and an empty section result.

Note: Depending on your browser, you may need to adjust your setting to allow file downloads.

Upload your log pipeline

Navigate to the Integration Developer Platform, and under the Data tab > Submitted logs, specify the log source and upload the two files exported from the previous step.