OpenAI

Documentos > Integraciones > OpenAI

Supported OS Linux Windows Mac OS

Versión de la integración3.0.0

OpenAI Dashboard Usage Trends

OpenAI Dashboard Samples

OpenAI Dashboard Tokens and Cost

OpenAI Dashboard Cost Usage

Esta página aún no está disponible en español. Estamos trabajando en su traducción.
Si tienes alguna pregunta o comentario sobre nuestro actual proyecto de traducción, no dudes en ponerte en contacto con nosotros.

Overview

Monitor, troubleshoot, and evaluate your LLM-powered applications, such as chatbots or data extraction tools, using OpenAI. With LLM Observability, you can investigate the root cause of issues, monitor operational performance, and evaluate the quality, privacy, and safety of your LLM applications.

LLM Obs tracing view video

Get cost estimation, prompt and completion sampling, error tracking, performance metrics, and more out of OpenAI account-level, Python, Node.js, and PHP library requests using Datadog metrics and APM.

Setup

Note: The Supported OS requirements for this integration apply to APM library installations (Python, Node.js, PHP) only. The API key-based setup has no OS requirements.

IMPORTANT: An admin-scoped API key is required to collect usage metrics and Cloud Cost Management (CCM) data. Without an admin-scoped API key, this integration cannot ingest audio_speeches, audio_transcriptions, code_interpreter_sessions, completions, embeddings, images, moderations, and vector_stores metrics, and CCM cost data are not available.

Note: To collect all metrics provided by this integration, also follow the APM setup instructions in addition to the API key setup.

Installation

Configuring OpenAI Integration for Datadog

Overview

Datadog’s OpenAI integration allows you to collect usage metrics, cost data, and enables LLM Observability to monitor your OpenAI models. Follow the steps below to generate an OpenAI API key and configure the integration.

Prerequisites

An OpenAI account with admin write permissions
A valid OpenAI API key:
- For Cloud Cost Management (CCM) and usage metrics: An admin-scoped API key is mandatory. Project-scoped keys cannot collect this data.
- For LLM Observability only: A standard API key with write permissions for model capabilities is sufficient.

Setup

1. Generate an OpenAI API key

For Cloud Cost Management and usage metrics, you must create an admin-scoped API key:

Log in to your OpenAI Account.
Navigate to the Admin Keys page or go to API keys under Organization settings and select the Admin keys tab.
Click Create a new secret key.
Copy the created admin API Key to your clipboard.

For LLM Observability only (without CCM and usage metrics), you can use a standard API key:

Log in to your OpenAI Account.
Navigate to API keys under Organization settings.
Click Create a new secret key.
- Ensure that the API key has write permission for model capabilities to invoke models in your LLM account.
Copy the created API Key to your clipboard.

2. Configure Datadog’s OpenAI integration

Navigate to Datadog’s OpenAI integration tile and open the Configuration tab.
Click Add Account.
Under Account Name, enter a name for your account. Under API Key, enter your OpenAI API key (must be admin-scoped for CCM and usage metrics). Optionally, add a comma-separated list of tags for metrics associated with this account.
Under Resources, enable toggles depending on your use case:
- Collect Cost Data: If enabled, cost data is visible in Cloud Cost Management within 24 hours. Requires an admin-scoped API key. See collected data.
- Use this API key to evaluate your LLM applications: If enabled, evaluations are run through this API key in LLM Observability.

Additional Notes

Admin-scoped API key requirement: An admin-scoped API key is mandatory for collecting usage metrics and Cloud Cost Management data. This integration only collects audio_speeches, audio_transcriptions, code_interpreter_sessions, completions, embeddings, images, moderations, and vector_stores metrics when an admin-scoped API key is provided.
If you enable Cloud Cost Management for OpenAI without an admin-scoped API key, cost metrics are not available.

Additional Resources

Note: This setup method does not collect audio_speeches, audio_transcriptions, code_interpreter_sessions, completions, embeddings, images, moderations, and vector_stores metrics. To collect these metrics, also follow the API key setup instructions.

Installation

LLM Observability: Get end-to-end visibility into your LLM application’s calls to OpenAI

You can enable LLM Observability in different environments. Follow the appropriate setup based on your scenario:

If you do not have the Datadog Agent:

Install the ddtrace package:
```
  pip install ddtrace
```

Start your application with the following command, enabling agentless mode:

  DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_AGENTLESS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME> ddtrace-run python <YOUR_APP>.py

If you already have the Datadog Agent installed:

Make sure the Agent is running and that APM and StatsD are enabled. For example, use the following command with Docker:

docker run -d \
  --cgroupns host \
  --pid host \
  -v /var/run/docker.sock:/var/run/docker.sock:ro \
  -v /proc/:/host/proc/:ro \
  -v /sys/fs/cgroup/:/host/sys/fs/cgroup:ro \
  -e DD_API_KEY=<DATADOG_API_KEY> \
  -p 127.0.0.1:8126:8126/tcp \
  -p 127.0.0.1:8125:8125/udp \
  -e DD_DOGSTATSD_NON_LOCAL_TRAFFIC=true \
  -e DD_APM_ENABLED=true \
  gcr.io/datadoghq/agent:latest

Install the ddtrace package if it isn’t installed yet:
```
  pip install ddtrace
```

Start your application using the ddtrace-run command to automatically enable tracing:

   DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME> ddtrace-run python <YOUR_APP>.py

Note: If the Agent is running on a custom host or port, set DD_AGENT_HOST and DD_TRACE_AGENT_PORT accordingly.

If you are running LLM Observability in a serverless environment (AWS Lambda):

Install the Datadog-Python and Datadog-Extension Lambda layers as part of your AWS Lambda setup.

Enable LLM Observability by setting the following environment variables:

   DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME>

Note: In serverless environments, Datadog automatically flushes spans when the Lambda function finishes running.

Automatic OpenAI tracing

LLM Observability provides automatic tracing for OpenAI’s completion and chat completion methods without requiring manual instrumentation.

The SDK will automatically trace the following OpenAI methods:

OpenAI().completions.create(), OpenAI().chat.completions.create()
For async calls: AsyncOpenAI().completions.create(), AsyncOpenAI().chat.completions.create()

No additional setup is required to capture latency, input/output messages, and token usage for these traced calls.

Validation

Validate that LLM Observability is properly capturing spans by checking your application logs for successful span creation. You can also run the following command to check the status of the ddtrace integration:

ddtrace-run --info

Look for the following message to confirm the setup:

Agent error: None

Debugging

If you encounter issues during setup, enable debug logging by passing the --debug flag:

ddtrace-run --debug

This will display detailed information about any errors or issues with tracing.

APM: Get Usage Metrics for Python Applications

Enable APM and StatsD in your Datadog Agent. For example, in Docker:

docker run -d
  --cgroupns host \
  --pid host \
  -v /var/run/docker.sock:/var/run/docker.sock:ro \
  -v /proc/:/host/proc/:ro \
  -v /sys/fs/cgroup/:/host/sys/fs/cgroup:ro \
  -e DD_API_KEY=<DATADOG_API_KEY> \
  -p 127.0.0.1:8126:8126/tcp \
  -p 127.0.0.1:8125:8125/udp \
  -e DD_DOGSTATSD_NON_LOCAL_TRAFFIC=true \
  -e DD_APM_ENABLED=true \
  gcr.io/datadoghq/agent:latest

Install the Datadog APM Python library.
```
pip install ddtrace
```
Prefix your OpenAI Python application command with ddtrace-run and the following environment variables as shown below:
```
DD_SERVICE="my-service" DD_ENV="staging" ddtrace-run python <your-app>.py
```

Notes:

If the Agent is using a non-default hostname or port, be sure to also set DD_AGENT_HOST, DD_TRACE_AGENT_PORT, or DD_DOGSTATSD_PORT.

See the APM Python library documentation for more advanced usage.

Configuration

See the APM Python library documentation for all the available configuration options.

Validation

Validate that the APM Python library can communicate with your Agent using:

ddtrace-run --info

You should see the following output:

    Agent error: None

Debug Logging

Pass the --debug flag to ddtrace-run to enable debug logging.

ddtrace-run --debug

This displays any errors sending data:

ERROR:ddtrace.internal.writer.writer:failed to send, dropping 1 traces to intake at http://localhost:8126/v0.5/traces after 3 retries ([Errno 61] Connection refused)

Note: This setup method does not collect openai.api.usage.* metrics. To collect these metrics, also follow the API key setup instructions.

Installation

LLM Observability: Get end-to-end visibility into your LLM application’s calls to OpenAI

You can enable LLM Observability in different environments. Follow the appropriate setup based on your scenario:

If you do not have the Datadog Agent:

Install the dd-trace package:
```
  npm install dd-trace
```

Start your application with the following command, enabling agentless mode:

  DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_AGENTLESS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME> node -r 'dd-trace/init' <your_app>.js

If you already have the Datadog Agent installed:

Make sure the Agent is running and that APM and StatsD are enabled. For example, use the following command with Docker:

docker run -d \
  --cgroupns host \
  --pid host \
  -v /var/run/docker.sock:/var/run/docker.sock:ro \
  -v /proc/:/host/proc/:ro \
  -v /sys/fs/cgroup/:/host/sys/fs/cgroup:ro \
  -e DD_API_KEY=<DATADOG_API_KEY> \
  -p 127.0.0.1:8126:8126/tcp \
  -p 127.0.0.1:8125:8125/udp \
  -e DD_DOGSTATSD_NON_LOCAL_TRAFFIC=true \
  -e DD_APM_ENABLED=true \
  gcr.io/datadoghq/agent:latest

Install the Datadog APM Node.js library.
```
npm install dd-trace
```

Start your application using the -r dd-trace/init or NODE_OPTIONS='--require dd-trace/init' command to automatically enable tracing:

DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME> node -r 'dd-trace/init' <your_app>.js

Note: If the Agent is running on a custom host or port, set DD_AGENT_HOST and DD_TRACE_AGENT_PORT accordingly.

If you are running LLM Observability in a serverless environment (AWS Lambda):

Enable LLM Observability by setting the following environment variables:

DD_SITE=<YOUR_DATADOG_SITE> DD_API_KEY=<YOUR_API_KEY> DD_LLMOBS_ENABLED=1 DD_LLMOBS_ML_APP=<YOUR_ML_APP_NAME>

Before the lambda finishes, call llmobs.flush():

const llmobs = require('dd-trace').llmobs;
// or, if dd-trace was not initialized via NODE_OPTIONS
const llmobs = require('dd-trace').init({
  llmobs: {
    mlApp: <YOUR_ML_APP>,
  }
}).llmobs; // with DD_API_KEY and DD_SITE being set at the environment level

async function handler (event, context) {
  ...
  llmobs.flush()
  return ...
}

Automatic OpenAI tracing

LLM Observability provides automatic tracing for OpenAI’s completion, chat completion, and embedding methods without requiring manual instrumentation.

The SDK will automatically trace the following OpenAI methods:

client.completions.create(), client.chat.completions.create(), client.embeddings.create() (where client is an instance of OpenAI)

No additional setup is required to capture latency, input/output messages, and token usage for these traced calls.

Debugging

If you encounter issues during setup, enable debug logging by setting DD_TRACE_DEBUG=1.

This will display detailed information about any errors or issues with tracing.

APM: Get Usage Metrics for Node.js Applications

Enable APM and StatsD in your Datadog Agent. For example, in Docker:

docker run -d
  --cgroupns host \
  --pid host \
  -v /var/run/docker.sock:/var/run/docker.sock:ro \
  -v /proc/:/host/proc/:ro \
  -v /sys/fs/cgroup/:/host/sys/fs/cgroup:ro \
  -e DD_API_KEY=<DATADOG_API_KEY> \
  -p 127.0.0.1:8126:8126/tcp \
  -p 127.0.0.1:8125:8125/udp \
  -e DD_DOGSTATSD_NON_LOCAL_TRAFFIC=true \
  -e DD_APM_ENABLED=true \
  gcr.io/datadoghq/agent:latest

Install the Datadog APM Node.js library.
```
npm install dd-trace
```

Inject the library into your OpenAI Node.js application.

DD_TRACE_DEBUG=1 DD_TRACE_BEAUTIFUL_LOGS=1 DD_SERVICE="my-service" \
  DD_ENV="staging" DD_API_KEY=<DATADOG_API_KEY> \
  NODE_OPTIONS='-r dd-trace/init' node app.js

Note: If the Agent is using a non-default hostname or port, you must also set DD_AGENT_HOST, DD_TRACE_AGENT_PORT, or DD_DOGSTATSD_PORT.

See the APM Node.js OpenAI documentation for more advanced usage.

Configuration

See the APM Node.js library documentation for all the available configuration options.

Validation

Validate that the APM Node.js library can communicate with your Agent by examining the debugging output from the application process. Within the section titled “Encoding payload,” you should see an entry with a name field and a correlating value of openai.request. See below for a truncated example of this output:

{
  "name": "openai.request",
  "resource": "listModels",
  "meta": {
    "component": "openai",
    "span.kind": "client",
    "openai.api_base": "https://api.openai.com/v1",
    "openai.request.endpoint": "/v1/models",
    "openai.request.method": "GET",
    "language": "javascript"
  },
  "metrics": {
    "openai.response.count": 106
  },
  "service": "my-service",
  "type": "openai"
}

Note: To collect OpenAI audio_speeches, audio_transcriptions, code_interpreter_sessions, completions, embeddings, images, moderations, and vector_stores metrics, follow the API key setup instructions.

Installation

APM: Get Usage Metrics for PHP Applications

Enable APM and StatsD in your Datadog Agent. For example, in Docker:

docker run -d
  --cgroupns host \
  --pid host \
  -v /var/run/docker.sock:/var/run/docker.sock:ro \
  -v /proc/:/host/proc/:ro \
  -v /sys/fs/cgroup/:/host/sys/fs/cgroup:ro \
  -e DD_API_KEY=<DATADOG_API_KEY> \
  -p 127.0.0.1:8126:8126/tcp \
  -p 127.0.0.1:8125:8125/udp \
  -e DD_DOGSTATSD_NON_LOCAL_TRAFFIC=true \
  -e DD_APM_ENABLED=true \
  gcr.io/datadoghq/agent:latest

Install the Datadog APM PHP library.
The library is automatically injected into your OpenAI PHP application.

Notes:

If the Agent is using a non-default hostname or port, set DD_AGENT_HOST, DD_TRACE_AGENT_PORT, or DD_DOGSTATSD_PORT.

See the APM PHP library documentation for more advanced usage.

Configuration

See the APM PHP library documentation for all the available configuration options.

Validation

To validate that the APM PHP library can communicate with your Agent, examine the phpinfo output of your service. Under the ddtrace section, Diagnostic checks should be passed.

Data Collected

Metrics

IMPORTANT: An admin-scoped API key is required to collect the following metrics and Cloud Cost Management data:

audio_speeches
audio_transcriptions
code_interpreter_sessions
completions
embeddings
images
moderations
vector_stores

Without an admin-scoped API key, these metrics and CCM cost data are not ingested.

All remaining metrics below are collected with the APM setup methods.


openai.audio_speeches.characters (count)	Number of characters generated for text-to-speech
openai.audio_speeches.num_model_requests (count)	Number of text-to-speech model requests Shown as request
openai.audio_transcriptions.num_model_requests (count)	Number of audio transcription model requests Shown as request
openai.audio_transcriptions.seconds (count)	Number of seconds of audio transcribed Shown as second
openai.code_interpreter_sessions.num_sessions (count)	Number of code interpreter sessions Shown as session
openai.completions.input_audio_tokens (count)	Number of audio input tokens for completions Shown as token
openai.completions.input_cached_tokens (count)	Number of cached input tokens for completions Shown as token
openai.completions.input_tokens (count)	Number of input tokens for completions Shown as token
openai.completions.num_model_requests (count)	Number of completion model requests Shown as request
openai.completions.output_audio_tokens (count)	Number of audio output tokens for completions Shown as token
openai.completions.output_tokens (count)	Number of output tokens for completions Shown as token
openai.embeddings.input_tokens (count)	Number of input tokens for embeddings Shown as token
openai.embeddings.num_model_requests (count)	Number of embedding model requests Shown as request
openai.images.images (count)	Number of images generated
openai.images.num_model_requests (count)	Number of image generation model requests Shown as request
openai.moderations.input_tokens (count)	Number of input tokens for moderations Shown as token
openai.moderations.num_model_requests (count)	Number of moderation model requests Shown as request
openai.organization.ratelimit.requests.remaining (gauge)	Number of requests remaining in the rate limit. Shown as request
openai.organization.ratelimit.tokens.remaining (gauge)	Number of tokens remaining in the rate limit. Shown as token
openai.ratelimit.requests (gauge)	Number of requests in the rate limit. Shown as request
openai.ratelimit.tokens (gauge)	Number of tokens in the rate limit. Shown as token
openai.request.duration (gauge)	Request duration distribution. Shown as nanosecond
openai.request.error (count)	Number of errors. Shown as error
openai.tokens.completion (gauge)	Number of tokens used in the completion of a response from OpenAI. Shown as token
openai.tokens.prompt (gauge)	Number of tokens used in the prompt of a request to OpenAI. Shown as token
openai.tokens.total (gauge)	Total number of tokens used in a request to OpenAI. Shown as token
openai.vector_stores.usage_bytes (gauge)	Number of bytes used in vector stores Shown as byte