---
title: Run an LLM inference
description: Datadog, the leading service for cloud-scale monitoring.
breadcrumbs: Docs > API Reference > LLM Observability
---

> For the complete documentation index, see [llms.txt](https://docs.datadoghq.com/llms.txt).

# Run an LLM inference{% #run-an-llm-inference %}
Copy pageCopied
{% tab title="v2" %}
**Note**: This endpoint is in Preview and is subject to change. If you have any feedback, contact [Datadog support](https://docs.datadoghq.com/help/).
| Datadog site      | API endpoint                                                                                           |
| ----------------- | ------------------------------------------------------------------------------------------------------ |
| ap1.datadoghq.com | POST https://api.ap1.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference |
| ap2.datadoghq.com | POST https://api.ap2.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference |
| app.datadoghq.eu  | POST https://api.datadoghq.eu/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference      |
| app.ddog-gov.com  | POST https://api.ddog-gov.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference      |
| us2.ddog-gov.com  | POST https://api.us2.ddog-gov.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference  |
| uk1.datadoghq.com | POST https://api.uk1.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference |
| app.datadoghq.com | POST https://api.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference     |
| us3.datadoghq.com | POST https://api.us3.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference |
| us5.datadoghq.com | POST https://api.us5.datadoghq.com/api/v2/llm-obs/v1/integrations/{integration}/{account_id}/inference |

### Overview

Run an LLM inference request through the specified integration and account, returning the model response and token usage.

### Arguments

#### Path Parameters

| Name                          | Type   | Description                        |
| ----------------------------- | ------ | ---------------------------------- |
| integration [*required*] | string | The name of the LLM integration.   |
| account_id [*required*]  | string | The ID of the integration account. |

### Request

#### Body Data (required)

Inference request parameters.

{% tab title="Model" %}

| Parent field          | Field                        | Type     | Description                                                                                                     |
| --------------------- | ---------------------------- | -------- | --------------------------------------------------------------------------------------------------------------- |
|                       | anthropic_metadata           | object   | Anthropic-specific metadata for an inference request.                                                           |
| anthropic_metadata    | effort                       | enum     | The effort level for Anthropic inference. Allowed enum values: `low,medium,high,max`                            |
| anthropic_metadata    | thinking                     | object   | Configuration for Anthropic extended thinking feature.                                                          |
| thinking              | budget_tokens                | int64    | Maximum token budget for extended thinking. Required when type is `enabled`.                                    |
| thinking              | type [*required*]       | enum     | The thinking mode for Anthropic extended thinking. Allowed enum values: `enabled,disabled,adaptive`             |
|                       | azure_openai_metadata        | object   | Azure OpenAI-specific metadata for an integration account or inference request.                                 |
| azure_openai_metadata | deployment_id                | string   | The Azure OpenAI deployment ID.                                                                                 |
| azure_openai_metadata | model_version                | string   | The model version deployed in Azure.                                                                            |
| azure_openai_metadata | resource_name                | string   | The Azure OpenAI resource name.                                                                                 |
|                       | bedrock_metadata             | object   | Amazon Bedrock-specific metadata for an inference request.                                                      |
| bedrock_metadata      | region                       | string   | The AWS region for the Bedrock request.                                                                         |
|                       | frequency_penalty            | double   | Penalty for token frequency to reduce repetition.                                                               |
|                       | json_schema                  | string   | JSON schema for structured output, if supported by the model.                                                   |
|                       | max_completion_tokens        | int64    | Maximum number of completion tokens to generate (alternative to max_tokens for some providers).                 |
|                       | max_tokens                   | int64    | Maximum number of tokens to generate.                                                                           |
|                       | messages [*required*]   | [object] | List of messages in an inference conversation.                                                                  |
| messages              | content                      | string   | Plain text content of the message.                                                                              |
| messages              | contents                     | [object] | List of structured content blocks in a message.                                                                 |
| contents              | type [*required*]       | string   | The content block type.                                                                                         |
| contents              | value [*required*]      | object   | The typed value of a message content block.                                                                     |
| value                 | text                         | string   | Plain text content.                                                                                             |
| value                 | tool_call                    | object   | A tool call made during LLM inference.                                                                          |
| tool_call             | arguments                    | object   | The arguments passed to the tool.                                                                               |
| tool_call             | name                         | string   | The name of the tool being called.                                                                              |
| tool_call             | tool_id                      | string   | Unique identifier for the tool call.                                                                            |
| tool_call             | type                         | string   | The type of tool call.                                                                                          |
| value                 | tool_call_result             | object   | The result returned by a tool call during LLM inference.                                                        |
| tool_call_result      | name                         | string   | The name of the tool that produced this result.                                                                 |
| tool_call_result      | result                       | string   | The result content returned by the tool.                                                                        |
| tool_call_result      | tool_id                      | string   | Identifier matching the corresponding tool call.                                                                |
| tool_call_result      | type                         | string   | The type of tool result.                                                                                        |
| messages              | id                           | string   | Unique identifier for the message.                                                                              |
| messages              | role                         | string   | The role of the message author.                                                                                 |
| messages              | tool_calls                   | [object] | List of tool calls in a message.                                                                                |
| tool_calls            | arguments                    | object   | The arguments passed to the tool.                                                                               |
| tool_calls            | name                         | string   | The name of the tool being called.                                                                              |
| tool_calls            | tool_id                      | string   | Unique identifier for the tool call.                                                                            |
| tool_calls            | type                         | string   | The type of tool call.                                                                                          |
| messages              | tool_results                 | [object] | List of tool results in a message.                                                                              |
| tool_results          | name                         | string   | The name of the tool that produced this result.                                                                 |
| tool_results          | result                       | string   | The result content returned by the tool.                                                                        |
| tool_results          | tool_id                      | string   | Identifier matching the corresponding tool call.                                                                |
| tool_results          | type                         | string   | The type of tool result.                                                                                        |
|                       | model_id [*required*]   | string   | The model identifier to use for inference.                                                                      |
|                       | openai_metadata              | object   | OpenAI-specific metadata for an inference request.                                                              |
| openai_metadata       | reasoning_effort             | enum     | The reasoning effort level for OpenAI models that support it. Allowed enum values: `none,low,medium,high,xhigh` |
| openai_metadata       | reasoning_summary            | enum     | The verbosity of the reasoning summary. Allowed enum values: `auto,concise,detailed`                            |
|                       | presence_penalty             | double   | Penalty for token presence to encourage topic diversity.                                                        |
|                       | temperature                  | double   | Sampling temperature between 0 and 2. Higher values produce more random output.                                 |
|                       | tools                        | [object] | List of tools available to the model.                                                                           |
| tools                 | function [*required*]   | object   | A function definition for a tool available to the model.                                                        |
| function              | description                  | string   | A description of what the function does.                                                                        |
| function              | name [*required*]       | string   | The name of the function.                                                                                       |
| function              | parameters [*required*] | object   | JSON schema describing the function parameters.                                                                 |
| tools                 | type [*required*]       | string   | The type of tool.                                                                                               |
|                       | top_k                        | int64    | Top-K sampling parameter.                                                                                       |
|                       | top_p                        | double   | Nucleus sampling probability mass.                                                                              |
|                       | vertex_ai_metadata           | object   | Vertex AI-specific metadata for an integration account or inference request.                                    |
| vertex_ai_metadata    | location                     | string   | The Vertex AI region.                                                                                           |
| vertex_ai_metadata    | project                      | string   | The Google Cloud project ID.                                                                                    |
| vertex_ai_metadata    | project_ids                  | [string] | List of Google Cloud project IDs available to the service account.                                              |

{% /tab %}

{% tab title="Example" %}

```json
{
  "anthropic_metadata": {
    "effort": "medium",
    "thinking": {
      "budget_tokens": 1024,
      "type": "enabled"
    }
  },
  "azure_openai_metadata": {
    "deployment_id": "my-gpt4-deployment",
    "model_version": "0613",
    "resource_name": "my-azure-resource"
  },
  "bedrock_metadata": {
    "region": "us-east-1"
  },
  "frequency_penalty": 0,
  "json_schema": "{\"type\":\"object\",\"properties\":{\"answer\":{\"type\":\"string\"}}}",
  "max_completion_tokens": 1024,
  "max_tokens": 1024,
  "messages": [
    {
      "content": "What is the capital of France?",
      "contents": [
        {
          "type": "text",
          "value": {
            "text": "Hello, how can I help you?",
            "tool_call": {
              "arguments": {
                "location": "San Francisco"
              },
              "name": "get_weather",
              "tool_id": "call_abc123",
              "type": "function"
            },
            "tool_call_result": {
              "name": "get_weather",
              "result": "The weather in San Francisco is 68°F and sunny.",
              "tool_id": "call_abc123",
              "type": "function"
            }
          }
        }
      ],
      "id": "msg_001",
      "role": "user",
      "tool_calls": [
        {
          "arguments": {
            "location": "San Francisco"
          },
          "name": "get_weather",
          "tool_id": "call_abc123",
          "type": "function"
        }
      ],
      "tool_results": [
        {
          "name": "get_weather",
          "result": "The weather in San Francisco is 68°F and sunny.",
          "tool_id": "call_abc123",
          "type": "function"
        }
      ]
    }
  ],
  "model_id": "gpt-4o",
  "openai_metadata": {
    "reasoning_effort": "medium",
    "reasoning_summary": "auto"
  },
  "presence_penalty": 0,
  "temperature": 0.7,
  "tools": [
    {
      "function": {
        "description": "Get the current weather for a location.",
        "name": "get_weather",
        "parameters": {
          "properties": {
            "location": {
              "type": "string"
            }
          },
          "type": "object"
        }
      },
      "type": "function"
    }
  ],
  "top_k": 50,
  "top_p": 1,
  "vertex_ai_metadata": {
    "location": "us-central1",
    "project": "my-gcp-project",
    "project_ids": [
      "my-gcp-project"
    ]
  }
}
```

{% /tab %}

### Response

{% tab title="200" %}
OK
{% tab title="Model" %}
The result of an LLM inference request, including input parameters and the model response.

| Parent field          | Field                             | Type     | Description                                                                                                     |
| --------------------- | --------------------------------- | -------- | --------------------------------------------------------------------------------------------------------------- |
|                       | anthropic_metadata                | object   | Anthropic-specific metadata for an inference request.                                                           |
| anthropic_metadata    | effort                            | enum     | The effort level for Anthropic inference. Allowed enum values: `low,medium,high,max`                            |
| anthropic_metadata    | thinking                          | object   | Configuration for Anthropic extended thinking feature.                                                          |
| thinking              | budget_tokens                     | int64    | Maximum token budget for extended thinking. Required when type is `enabled`.                                    |
| thinking              | type [*required*]            | enum     | The thinking mode for Anthropic extended thinking. Allowed enum values: `enabled,disabled,adaptive`             |
|                       | azure_openai_metadata             | object   | Azure OpenAI-specific metadata for an integration account or inference request.                                 |
| azure_openai_metadata | deployment_id                     | string   | The Azure OpenAI deployment ID.                                                                                 |
| azure_openai_metadata | model_version                     | string   | The model version deployed in Azure.                                                                            |
| azure_openai_metadata | resource_name                     | string   | The Azure OpenAI resource name.                                                                                 |
|                       | bedrock_metadata                  | object   | Amazon Bedrock-specific metadata for an inference request.                                                      |
| bedrock_metadata      | region                            | string   | The AWS region for the Bedrock request.                                                                         |
|                       | error_response                    | object   | Error details returned when an inference provider returns an error.                                             |
| error_response        | message [*required*]         | string   | A human-readable description of the error.                                                                      |
| error_response        | type [*required*]            | string   | The provider-specific error type.                                                                               |
|                       | frequency_penalty                 | double   | Frequency penalty that was applied.                                                                             |
|                       | json_schema                       | string   | JSON schema that was applied for structured output.                                                             |
|                       | max_completion_tokens             | int64    | Maximum number of completion tokens that were configured.                                                       |
|                       | max_tokens                        | int64    | Maximum number of tokens that were configured.                                                                  |
|                       | messages [*required*]        | [object] | List of messages in an inference conversation.                                                                  |
| messages              | content                           | string   | Plain text content of the message.                                                                              |
| messages              | contents                          | [object] | List of structured content blocks in a message.                                                                 |
| contents              | type [*required*]            | string   | The content block type.                                                                                         |
| contents              | value [*required*]           | object   | The typed value of a message content block.                                                                     |
| value                 | text                              | string   | Plain text content.                                                                                             |
| value                 | tool_call                         | object   | A tool call made during LLM inference.                                                                          |
| tool_call             | arguments                         | object   | The arguments passed to the tool.                                                                               |
| tool_call             | name                              | string   | The name of the tool being called.                                                                              |
| tool_call             | tool_id                           | string   | Unique identifier for the tool call.                                                                            |
| tool_call             | type                              | string   | The type of tool call.                                                                                          |
| value                 | tool_call_result                  | object   | The result returned by a tool call during LLM inference.                                                        |
| tool_call_result      | name                              | string   | The name of the tool that produced this result.                                                                 |
| tool_call_result      | result                            | string   | The result content returned by the tool.                                                                        |
| tool_call_result      | tool_id                           | string   | Identifier matching the corresponding tool call.                                                                |
| tool_call_result      | type                              | string   | The type of tool result.                                                                                        |
| messages              | id                                | string   | Unique identifier for the message.                                                                              |
| messages              | role                              | string   | The role of the message author.                                                                                 |
| messages              | tool_calls                        | [object] | List of tool calls in a message.                                                                                |
| tool_calls            | arguments                         | object   | The arguments passed to the tool.                                                                               |
| tool_calls            | name                              | string   | The name of the tool being called.                                                                              |
| tool_calls            | tool_id                           | string   | Unique identifier for the tool call.                                                                            |
| tool_calls            | type                              | string   | The type of tool call.                                                                                          |
| messages              | tool_results                      | [object] | List of tool results in a message.                                                                              |
| tool_results          | name                              | string   | The name of the tool that produced this result.                                                                 |
| tool_results          | result                            | string   | The result content returned by the tool.                                                                        |
| tool_results          | tool_id                           | string   | Identifier matching the corresponding tool call.                                                                |
| tool_results          | type                              | string   | The type of tool result.                                                                                        |
|                       | model_id [*required*]        | string   | The model identifier used for inference.                                                                        |
|                       | openai_metadata                   | object   | OpenAI-specific metadata for an inference request.                                                              |
| openai_metadata       | reasoning_effort                  | enum     | The reasoning effort level for OpenAI models that support it. Allowed enum values: `none,low,medium,high,xhigh` |
| openai_metadata       | reasoning_summary                 | enum     | The verbosity of the reasoning summary. Allowed enum values: `auto,concise,detailed`                            |
|                       | presence_penalty                  | double   | Presence penalty that was applied.                                                                              |
|                       | response [*required*]        | object   | The output of a completed LLM inference call.                                                                   |
| response              | assessment [*required*]      | string   | An optional assessment of the inference output quality.                                                         |
| response              | content [*required*]         | string   | The text content of the model response.                                                                         |
| response              | finish_reason [*required*]   | string   | The reason the model stopped generating tokens.                                                                 |
| response              | inference_codes [*required*] | [object] | List of generated code snippets for the inference configuration.                                                |
| inference_codes       | code [*required*]            | string   | The generated code content.                                                                                     |
| inference_codes       | id [*required*]              | string   | Unique identifier for the code snippet.                                                                         |
| inference_codes       | type [*required*]            | string   | The programming language or SDK type of the code snippet.                                                       |
| response              | input_tokens [*required*]    | int64    | Number of input tokens consumed.                                                                                |
| response              | internal_reasoning                | object   | The model's internal reasoning or thinking output, if available.                                                |
| internal_reasoning    | reasoning_tokens                  | int64    | Number of tokens used for internal reasoning.                                                                   |
| internal_reasoning    | text [*required*]            | string   | The reasoning text produced by the model.                                                                       |
| response              | latency [*required*]         | int64    | Request latency in milliseconds.                                                                                |
| response              | output_tokens [*required*]   | int64    | Number of output tokens generated.                                                                              |
| response              | tools [*required*]           | [object] | List of tools available to the model.                                                                           |
| tools                 | function [*required*]        | object   | A function definition for a tool available to the model.                                                        |
| function              | description                       | string   | A description of what the function does.                                                                        |
| function              | name [*required*]            | string   | The name of the function.                                                                                       |
| function              | parameters [*required*]      | object   | JSON schema describing the function parameters.                                                                 |
| tools                 | type [*required*]            | string   | The type of tool.                                                                                               |
| response              | total_tokens [*required*]    | int64    | Total tokens used (input plus output).                                                                          |
|                       | temperature                       | double   | Sampling temperature that was used.                                                                             |
|                       | tools                             | [object] | List of tools available to the model.                                                                           |
| tools                 | function [*required*]        | object   | A function definition for a tool available to the model.                                                        |
| function              | description                       | string   | A description of what the function does.                                                                        |
| function              | name [*required*]            | string   | The name of the function.                                                                                       |
| function              | parameters [*required*]      | object   | JSON schema describing the function parameters.                                                                 |
| tools                 | type [*required*]            | string   | The type of tool.                                                                                               |
|                       | top_k                             | int64    | Top-K sampling parameter that was used.                                                                         |
|                       | top_p                             | double   | Nucleus sampling parameter that was used.                                                                       |
|                       | vertex_ai_metadata                | object   | Vertex AI-specific metadata for an integration account or inference request.                                    |
| vertex_ai_metadata    | location                          | string   | The Vertex AI region.                                                                                           |
| vertex_ai_metadata    | project                           | string   | The Google Cloud project ID.                                                                                    |
| vertex_ai_metadata    | project_ids                       | [string] | List of Google Cloud project IDs available to the service account.                                              |

{% /tab %}

{% tab title="Example" %}

```json
{
  "anthropic_metadata": {
    "effort": "medium",
    "thinking": {
      "budget_tokens": 1024,
      "type": "enabled"
    }
  },
  "azure_openai_metadata": {
    "deployment_id": "my-gpt4-deployment",
    "model_version": "0613",
    "resource_name": "my-azure-resource"
  },
  "bedrock_metadata": {
    "region": "us-east-1"
  },
  "error_response": {
    "message": "The model does not exist.",
    "type": "invalid_request_error"
  },
  "frequency_penalty": 0,
  "json_schema": "{\"type\":\"object\",\"properties\":{\"answer\":{\"type\":\"string\"}}}",
  "max_completion_tokens": 1024,
  "max_tokens": 1024,
  "messages": [
    {
      "content": "What is the capital of France?",
      "contents": [
        {
          "type": "text",
          "value": {
            "text": "Hello, how can I help you?",
            "tool_call": {
              "arguments": {
                "location": "San Francisco"
              },
              "name": "get_weather",
              "tool_id": "call_abc123",
              "type": "function"
            },
            "tool_call_result": {
              "name": "get_weather",
              "result": "The weather in San Francisco is 68°F and sunny.",
              "tool_id": "call_abc123",
              "type": "function"
            }
          }
        }
      ],
      "id": "msg_001",
      "role": "user",
      "tool_calls": [
        {
          "arguments": {
            "location": "San Francisco"
          },
          "name": "get_weather",
          "tool_id": "call_abc123",
          "type": "function"
        }
      ],
      "tool_results": [
        {
          "name": "get_weather",
          "result": "The weather in San Francisco is 68°F and sunny.",
          "tool_id": "call_abc123",
          "type": "function"
        }
      ]
    }
  ],
  "model_id": "gpt-4o",
  "openai_metadata": {
    "reasoning_effort": "medium",
    "reasoning_summary": "auto"
  },
  "presence_penalty": 0,
  "response": {
    "assessment": "pass",
    "content": "The capital of France is Paris.",
    "finish_reason": "stop",
    "inference_codes": [
      {
        "code": "import openai\nclient = openai.OpenAI()\n...",
        "id": "code-python-001",
        "type": "python"
      }
    ],
    "input_tokens": 15,
    "internal_reasoning": {
      "reasoning_tokens": 256,
      "text": "Let me think about this step by step..."
    },
    "latency": 843,
    "output_tokens": 10,
    "tools": [
      {
        "function": {
          "description": "Get the current weather for a location.",
          "name": "get_weather",
          "parameters": {
            "properties": {
              "location": {
                "type": "string"
              }
            },
            "type": "object"
          }
        },
        "type": "function"
      }
    ],
    "total_tokens": 25
  },
  "temperature": 0.7,
  "tools": [
    {
      "function": {
        "description": "Get the current weather for a location.",
        "name": "get_weather",
        "parameters": {
          "properties": {
            "location": {
              "type": "string"
            }
          },
          "type": "object"
        }
      },
      "type": "function"
    }
  ],
  "top_k": 50,
  "top_p": 1,
  "vertex_ai_metadata": {
    "location": "us-central1",
    "project": "my-gcp-project",
    "project_ids": [
      "my-gcp-project"
    ]
  }
}
```

{% /tab %}

{% /tab %}

{% tab title="400" %}
Bad Request
{% tab title="Model" %}
API error response.

| Parent field | Field                    | Type     | Description                                                                     |
| ------------ | ------------------------ | -------- | ------------------------------------------------------------------------------- |
|              | errors [*required*] | [object] | A list of errors.                                                               |
| errors       | detail                   | string   | A human-readable explanation specific to this occurrence of the error.          |
| errors       | meta                     | object   | Non-standard meta-information about the error                                   |
| errors       | source                   | object   | References to the source of the error.                                          |
| source       | header                   | string   | A string indicating the name of a single request header which caused the error. |
| source       | parameter                | string   | A string indicating which URI query parameter caused the error.                 |
| source       | pointer                  | string   | A JSON pointer to the value in the request document that caused the error.      |
| errors       | status                   | string   | Status code of the response.                                                    |
| errors       | title                    | string   | Short human-readable summary of the error.                                      |

{% /tab %}

{% tab title="Example" %}

```json
{
  "errors": [
    {
      "detail": "Missing required attribute in body",
      "meta": {},
      "source": {
        "header": "Authorization",
        "parameter": "limit",
        "pointer": "/data/attributes/title"
      },
      "status": "400",
      "title": "Bad Request"
    }
  ]
}
```

{% /tab %}

{% /tab %}

{% tab title="401" %}
Unauthorized
{% tab title="Model" %}
API error response.

| Parent field | Field                    | Type     | Description                                                                     |
| ------------ | ------------------------ | -------- | ------------------------------------------------------------------------------- |
|              | errors [*required*] | [object] | A list of errors.                                                               |
| errors       | detail                   | string   | A human-readable explanation specific to this occurrence of the error.          |
| errors       | meta                     | object   | Non-standard meta-information about the error                                   |
| errors       | source                   | object   | References to the source of the error.                                          |
| source       | header                   | string   | A string indicating the name of a single request header which caused the error. |
| source       | parameter                | string   | A string indicating which URI query parameter caused the error.                 |
| source       | pointer                  | string   | A JSON pointer to the value in the request document that caused the error.      |
| errors       | status                   | string   | Status code of the response.                                                    |
| errors       | title                    | string   | Short human-readable summary of the error.                                      |

{% /tab %}

{% tab title="Example" %}

```json
{
  "errors": [
    {
      "detail": "Missing required attribute in body",
      "meta": {},
      "source": {
        "header": "Authorization",
        "parameter": "limit",
        "pointer": "/data/attributes/title"
      },
      "status": "400",
      "title": "Bad Request"
    }
  ]
}
```

{% /tab %}

{% /tab %}

{% tab title="403" %}
Forbidden
{% tab title="Model" %}
API error response.

| Parent field | Field                    | Type     | Description                                                                     |
| ------------ | ------------------------ | -------- | ------------------------------------------------------------------------------- |
|              | errors [*required*] | [object] | A list of errors.                                                               |
| errors       | detail                   | string   | A human-readable explanation specific to this occurrence of the error.          |
| errors       | meta                     | object   | Non-standard meta-information about the error                                   |
| errors       | source                   | object   | References to the source of the error.                                          |
| source       | header                   | string   | A string indicating the name of a single request header which caused the error. |
| source       | parameter                | string   | A string indicating which URI query parameter caused the error.                 |
| source       | pointer                  | string   | A JSON pointer to the value in the request document that caused the error.      |
| errors       | status                   | string   | Status code of the response.                                                    |
| errors       | title                    | string   | Short human-readable summary of the error.                                      |

{% /tab %}

{% tab title="Example" %}

```json
{
  "errors": [
    {
      "detail": "Missing required attribute in body",
      "meta": {},
      "source": {
        "header": "Authorization",
        "parameter": "limit",
        "pointer": "/data/attributes/title"
      },
      "status": "400",
      "title": "Bad Request"
    }
  ]
}
```

{% /tab %}

{% /tab %}

{% tab title="429" %}
Too many requests
{% tab title="Model" %}
API error response.

| Field                    | Type     | Description       |
| ------------------------ | -------- | ----------------- |
| errors [*required*] | [string] | A list of errors. |

{% /tab %}

{% tab title="Example" %}

```json
{
  "errors": [
    "Bad Request"
  ]
}
```

{% /tab %}

{% /tab %}

{% tab title="500" %}
Internal Server Error
{% tab title="Model" %}
API error response.

| Parent field | Field                    | Type     | Description                                                                     |
| ------------ | ------------------------ | -------- | ------------------------------------------------------------------------------- |
|              | errors [*required*] | [object] | A list of errors.                                                               |
| errors       | detail                   | string   | A human-readable explanation specific to this occurrence of the error.          |
| errors       | meta                     | object   | Non-standard meta-information about the error                                   |
| errors       | source                   | object   | References to the source of the error.                                          |
| source       | header                   | string   | A string indicating the name of a single request header which caused the error. |
| source       | parameter                | string   | A string indicating which URI query parameter caused the error.                 |
| source       | pointer                  | string   | A JSON pointer to the value in the request document that caused the error.      |
| errors       | status                   | string   | Status code of the response.                                                    |
| errors       | title                    | string   | Short human-readable summary of the error.                                      |

{% /tab %}

{% tab title="Example" %}

```json
{
  "errors": [
    {
      "detail": "Missing required attribute in body",
      "meta": {},
      "source": {
        "header": "Authorization",
        "parameter": "limit",
        "pointer": "/data/attributes/title"
      },
      "status": "400",
      "title": "Bad Request"
    }
  ]
}
```

{% /tab %}

{% /tab %}

### Code Example

##### 
                  \## default
# 
 \# Path parameters export integration="openai" export account_id="account-abc123" \# Curl command curl -X POST "https://api.datadoghq.com/api/v2/llm-obs/v1/integrations/${integration}/${account_id}/inference" \
-H "Accept: application/json" \
-H "Content-Type: application/json" \
-H "DD-API-KEY: ${DD_API_KEY}" \
-H "DD-APPLICATION-KEY: ${DD_APP_KEY}" \
-d @- << EOF
{
  "max_tokens": 256,
  "messages": [
    {
      "content": "What is the capital of France?",
      "role": "user"
    }
  ],
  "model_id": "gpt-4o",
  "temperature": 0.7
}
EOF 
                
##### 

```python
"""
Run an LLM inference returns "OK" response
"""

from datadog_api_client import ApiClient, Configuration
from datadog_api_client.v2.api.llm_observability_api import LLMObservabilityApi
from datadog_api_client.v2.model.llm_obs_anthropic_effort import LLMObsAnthropicEffort
from datadog_api_client.v2.model.llm_obs_anthropic_metadata import LLMObsAnthropicMetadata
from datadog_api_client.v2.model.llm_obs_anthropic_thinking_config import LLMObsAnthropicThinkingConfig
from datadog_api_client.v2.model.llm_obs_anthropic_thinking_type import LLMObsAnthropicThinkingType
from datadog_api_client.v2.model.llm_obs_azure_open_ai_metadata import LLMObsAzureOpenAIMetadata
from datadog_api_client.v2.model.llm_obs_bedrock_metadata import LLMObsBedrockMetadata
from datadog_api_client.v2.model.llm_obs_inference_content import LLMObsInferenceContent
from datadog_api_client.v2.model.llm_obs_inference_content_value import LLMObsInferenceContentValue
from datadog_api_client.v2.model.llm_obs_inference_function import LLMObsInferenceFunction
from datadog_api_client.v2.model.llm_obs_inference_message import LLMObsInferenceMessage
from datadog_api_client.v2.model.llm_obs_inference_tool import LLMObsInferenceTool
from datadog_api_client.v2.model.llm_obs_inference_tool_call import LLMObsInferenceToolCall
from datadog_api_client.v2.model.llm_obs_inference_tool_result import LLMObsInferenceToolResult
from datadog_api_client.v2.model.llm_obs_integration_inference_request import LLMObsIntegrationInferenceRequest
from datadog_api_client.v2.model.llm_obs_integration_name import LLMObsIntegrationName
from datadog_api_client.v2.model.llm_obs_open_ai_metadata import LLMObsOpenAIMetadata
from datadog_api_client.v2.model.llm_obs_open_ai_reasoning_effort import LLMObsOpenAIReasoningEffort
from datadog_api_client.v2.model.llm_obs_open_ai_reasoning_summary import LLMObsOpenAIReasoningSummary
from datadog_api_client.v2.model.llm_obs_vertex_ai_metadata import LLMObsVertexAIMetadata

body = LLMObsIntegrationInferenceRequest(
    anthropic_metadata=LLMObsAnthropicMetadata(
        effort=LLMObsAnthropicEffort.MEDIUM,
        thinking=LLMObsAnthropicThinkingConfig(
            budget_tokens=1024,
            type=LLMObsAnthropicThinkingType.ENABLED,
        ),
    ),
    azure_openai_metadata=LLMObsAzureOpenAIMetadata(
        deployment_id="my-gpt4-deployment",
        model_version="0613",
        resource_name="my-azure-resource",
    ),
    bedrock_metadata=LLMObsBedrockMetadata(
        region="us-east-1",
    ),
    frequency_penalty=0.0,
    json_schema='{"type":"object","properties":{"answer":{"type":"string"}}}',
    max_completion_tokens=1024,
    max_tokens=1024,
    messages=[
        LLMObsInferenceMessage(
            content="What is the capital of France?",
            contents=[
                LLMObsInferenceContent(
                    type="text",
                    value=LLMObsInferenceContentValue(
                        text="Hello, how can I help you?",
                        tool_call=LLMObsInferenceToolCall(
                            arguments=dict([("location", "San Francisco")]),
                            name="get_weather",
                            tool_id="call_abc123",
                            type="function",
                        ),
                        tool_call_result=LLMObsInferenceToolResult(
                            name="get_weather",
                            result="The weather in San Francisco is 68°F and sunny.",
                            tool_id="call_abc123",
                            type="function",
                        ),
                    ),
                ),
            ],
            id="msg_001",
            role="user",
            tool_calls=[
                LLMObsInferenceToolCall(
                    arguments=dict([("location", "San Francisco")]),
                    name="get_weather",
                    tool_id="call_abc123",
                    type="function",
                ),
            ],
            tool_results=[
                LLMObsInferenceToolResult(
                    name="get_weather",
                    result="The weather in San Francisco is 68°F and sunny.",
                    tool_id="call_abc123",
                    type="function",
                ),
            ],
        ),
    ],
    model_id="gpt-4o",
    openai_metadata=LLMObsOpenAIMetadata(
        reasoning_effort=LLMObsOpenAIReasoningEffort.MEDIUM,
        reasoning_summary=LLMObsOpenAIReasoningSummary.AUTO,
    ),
    presence_penalty=0.0,
    temperature=0.7,
    tools=[
        LLMObsInferenceTool(
            function=LLMObsInferenceFunction(
                description="Get the current weather for a location.",
                name="get_weather",
                parameters=dict([("properties", "{'location': {'type': 'string'}}"), ("type", "object")]),
            ),
            type="function",
        ),
    ],
    top_k=50,
    top_p=1.0,
    vertex_ai_metadata=LLMObsVertexAIMetadata(
        location="us-central1",
        project="my-gcp-project",
        project_ids=[
            "my-gcp-project",
        ],
    ),
)

configuration = Configuration()
configuration.unstable_operations["create_llm_obs_integration_inference"] = True
with ApiClient(configuration) as api_client:
    api_instance = LLMObservabilityApi(api_client)
    response = api_instance.create_llm_obs_integration_inference(
        integration=LLMObsIntegrationName.OPENAI, account_id="account_id", body=body
    )

    print(response)
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=python) and then save the example to `example.py` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" python3 "example.py"
##### 

```ruby
# Run an LLM inference returns "OK" response

require "datadog_api_client"
DatadogAPIClient.configure do |config|
  config.unstable_operations["v2.create_llm_obs_integration_inference".to_sym] = true
end
api_instance = DatadogAPIClient::V2::LLMObservabilityAPI.new

body = DatadogAPIClient::V2::LLMObsIntegrationInferenceRequest.new({
  anthropic_metadata: DatadogAPIClient::V2::LLMObsAnthropicMetadata.new({
    effort: DatadogAPIClient::V2::LLMObsAnthropicEffort::MEDIUM,
    thinking: DatadogAPIClient::V2::LLMObsAnthropicThinkingConfig.new({
      budget_tokens: 1024,
      type: DatadogAPIClient::V2::LLMObsAnthropicThinkingType::ENABLED,
    }),
  }),
  azure_openai_metadata: DatadogAPIClient::V2::LLMObsAzureOpenAIMetadata.new({
    deployment_id: "my-gpt4-deployment",
    model_version: "0613",
    resource_name: "my-azure-resource",
  }),
  bedrock_metadata: DatadogAPIClient::V2::LLMObsBedrockMetadata.new({
    region: "us-east-1",
  }),
  frequency_penalty: 0.0,
  json_schema: '{"type":"object","properties":{"answer":{"type":"string"}}}',
  max_completion_tokens: 1024,
  max_tokens: 1024,
  messages: [
    DatadogAPIClient::V2::LLMObsInferenceMessage.new({
      content: "What is the capital of France?",
      contents: [
        DatadogAPIClient::V2::LLMObsInferenceContent.new({
          type: "text",
          value: DatadogAPIClient::V2::LLMObsInferenceContentValue.new({
            text: "Hello, how can I help you?",
            tool_call: DatadogAPIClient::V2::LLMObsInferenceToolCall.new({
              arguments: {
                "location": "San Francisco",
              },
              name: "get_weather",
              tool_id: "call_abc123",
              type: "function",
            }),
            tool_call_result: DatadogAPIClient::V2::LLMObsInferenceToolResult.new({
              name: "get_weather",
              result: "The weather in San Francisco is 68°F and sunny.",
              tool_id: "call_abc123",
              type: "function",
            }),
          }),
        }),
      ],
      id: "msg_001",
      role: "user",
      tool_calls: [
        DatadogAPIClient::V2::LLMObsInferenceToolCall.new({
          arguments: {
            "location": "San Francisco",
          },
          name: "get_weather",
          tool_id: "call_abc123",
          type: "function",
        }),
      ],
      tool_results: [
        DatadogAPIClient::V2::LLMObsInferenceToolResult.new({
          name: "get_weather",
          result: "The weather in San Francisco is 68°F and sunny.",
          tool_id: "call_abc123",
          type: "function",
        }),
      ],
    }),
  ],
  model_id: "gpt-4o",
  openai_metadata: DatadogAPIClient::V2::LLMObsOpenAIMetadata.new({
    reasoning_effort: DatadogAPIClient::V2::LLMObsOpenAIReasoningEffort::MEDIUM,
    reasoning_summary: DatadogAPIClient::V2::LLMObsOpenAIReasoningSummary::AUTO,
  }),
  presence_penalty: 0.0,
  temperature: 0.7,
  tools: [
    DatadogAPIClient::V2::LLMObsInferenceTool.new({
      function: DatadogAPIClient::V2::LLMObsInferenceFunction.new({
        description: "Get the current weather for a location.",
        name: "get_weather",
        parameters: {
          "properties": "{'location': {'type': 'string'}}", "type": "object",
        },
      }),
      type: "function",
    }),
  ],
  top_k: 50,
  top_p: 1.0,
  vertex_ai_metadata: DatadogAPIClient::V2::LLMObsVertexAIMetadata.new({
    location: "us-central1",
    project: "my-gcp-project",
    project_ids: [
      "my-gcp-project",
    ],
  }),
})
p api_instance.create_llm_obs_integration_inference(LLMObsIntegrationName::OPENAI, "account_id", body)
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=ruby) and then save the example to `example.rb` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" rb "example.rb"
##### 

```go
// Run an LLM inference returns "OK" response

package main

import (
	"context"
	"encoding/json"
	"fmt"
	"os"

	"github.com/DataDog/datadog-api-client-go/v2/api/datadog"
	"github.com/DataDog/datadog-api-client-go/v2/api/datadogV2"
)

func main() {
	body := datadogV2.LLMObsIntegrationInferenceRequest{
		AnthropicMetadata: &datadogV2.LLMObsAnthropicMetadata{
			Effort: *datadogV2.NewNullableLLMObsAnthropicEffort(datadogV2.LLMOBSANTHROPICEFFORT_MEDIUM.Ptr()),
			Thinking: &datadogV2.LLMObsAnthropicThinkingConfig{
				BudgetTokens: *datadog.NewNullableInt64(datadog.PtrInt64(1024)),
				Type:         datadogV2.LLMOBSANTHROPICTHINKINGTYPE_ENABLED,
			},
		},
		AzureOpenaiMetadata: &datadogV2.LLMObsAzureOpenAIMetadata{
			DeploymentId: datadog.PtrString("my-gpt4-deployment"),
			ModelVersion: datadog.PtrString("0613"),
			ResourceName: datadog.PtrString("my-azure-resource"),
		},
		BedrockMetadata: &datadogV2.LLMObsBedrockMetadata{
			Region: datadog.PtrString("us-east-1"),
		},
		FrequencyPenalty:    *datadog.NewNullableFloat64(datadog.PtrFloat64(0.0)),
		JsonSchema:          *datadog.NewNullableString(datadog.PtrString(`{"type":"object","properties":{"answer":{"type":"string"}}}`)),
		MaxCompletionTokens: *datadog.NewNullableInt64(datadog.PtrInt64(1024)),
		MaxTokens:           *datadog.NewNullableInt64(datadog.PtrInt64(1024)),
		Messages: []datadogV2.LLMObsInferenceMessage{
			{
				Content: datadog.PtrString("What is the capital of France?"),
				Contents: []datadogV2.LLMObsInferenceContent{
					{
						Type: "text",
						Value: datadogV2.LLMObsInferenceContentValue{
							Text: datadog.PtrString("Hello, how can I help you?"),
							ToolCall: &datadogV2.LLMObsInferenceToolCall{
								Arguments: map[string]interface{}{
									"location": "San Francisco",
								},
								Name:   datadog.PtrString("get_weather"),
								ToolId: datadog.PtrString("call_abc123"),
								Type:   datadog.PtrString("function"),
							},
							ToolCallResult: &datadogV2.LLMObsInferenceToolResult{
								Name:   datadog.PtrString("get_weather"),
								Result: datadog.PtrString("The weather in San Francisco is 68°F and sunny."),
								ToolId: datadog.PtrString("call_abc123"),
								Type:   datadog.PtrString("function"),
							},
						},
					},
				},
				Id:   datadog.PtrString("msg_001"),
				Role: datadog.PtrString("user"),
				ToolCalls: []datadogV2.LLMObsInferenceToolCall{
					{
						Arguments: map[string]interface{}{
							"location": "San Francisco",
						},
						Name:   datadog.PtrString("get_weather"),
						ToolId: datadog.PtrString("call_abc123"),
						Type:   datadog.PtrString("function"),
					},
				},
				ToolResults: []datadogV2.LLMObsInferenceToolResult{
					{
						Name:   datadog.PtrString("get_weather"),
						Result: datadog.PtrString("The weather in San Francisco is 68°F and sunny."),
						ToolId: datadog.PtrString("call_abc123"),
						Type:   datadog.PtrString("function"),
					},
				},
			},
		},
		ModelId: "gpt-4o",
		OpenaiMetadata: &datadogV2.LLMObsOpenAIMetadata{
			ReasoningEffort:  *datadogV2.NewNullableLLMObsOpenAIReasoningEffort(datadogV2.LLMOBSOPENAIREASONINGEFFORT_MEDIUM.Ptr()),
			ReasoningSummary: *datadogV2.NewNullableLLMObsOpenAIReasoningSummary(datadogV2.LLMOBSOPENAIREASONINGSUMMARY_AUTO.Ptr()),
		},
		PresencePenalty: *datadog.NewNullableFloat64(datadog.PtrFloat64(0.0)),
		Temperature:     *datadog.NewNullableFloat64(datadog.PtrFloat64(0.7)),
		Tools: []datadogV2.LLMObsInferenceTool{
			{
				Function: datadogV2.LLMObsInferenceFunction{
					Description: datadog.PtrString("Get the current weather for a location."),
					Name:        "get_weather",
					Parameters: map[string]interface{}{
						"properties": "{'location': {'type': 'string'}}",
						"type":       "object",
					},
				},
				Type: "function",
			},
		},
		TopK: *datadog.NewNullableInt64(datadog.PtrInt64(50)),
		TopP: *datadog.NewNullableFloat64(datadog.PtrFloat64(1.0)),
		VertexAiMetadata: &datadogV2.LLMObsVertexAIMetadata{
			Location: datadog.PtrString("us-central1"),
			Project:  datadog.PtrString("my-gcp-project"),
			ProjectIds: []string{
				"my-gcp-project",
			},
		},
	}
	ctx := datadog.NewDefaultContext(context.Background())
	configuration := datadog.NewConfiguration()
	configuration.SetUnstableOperationEnabled("v2.CreateLLMObsIntegrationInference", true)
	apiClient := datadog.NewAPIClient(configuration)
	api := datadogV2.NewLLMObservabilityApi(apiClient)
	resp, r, err := api.CreateLLMObsIntegrationInference(ctx, datadogV2.LLMOBSINTEGRATIONNAME_OPENAI, "account_id", body)

	if err != nil {
		fmt.Fprintf(os.Stderr, "Error when calling `LLMObservabilityApi.CreateLLMObsIntegrationInference`: %v\n", err)
		fmt.Fprintf(os.Stderr, "Full HTTP response: %v\n", r)
	}

	responseContent, _ := json.MarshalIndent(resp, "", "  ")
	fmt.Fprintf(os.Stdout, "Response from `LLMObservabilityApi.CreateLLMObsIntegrationInference`:\n%s\n", responseContent)
}
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=go) and then save the example to `main.go` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" go run "main.go"
##### 

```java
// Run an LLM inference returns "OK" response

import com.datadog.api.client.ApiClient;
import com.datadog.api.client.ApiException;
import com.datadog.api.client.v2.api.LlmObservabilityApi;
import com.datadog.api.client.v2.model.LLMObsAnthropicEffort;
import com.datadog.api.client.v2.model.LLMObsAnthropicMetadata;
import com.datadog.api.client.v2.model.LLMObsAnthropicThinkingConfig;
import com.datadog.api.client.v2.model.LLMObsAnthropicThinkingType;
import com.datadog.api.client.v2.model.LLMObsAzureOpenAIMetadata;
import com.datadog.api.client.v2.model.LLMObsBedrockMetadata;
import com.datadog.api.client.v2.model.LLMObsInferenceContent;
import com.datadog.api.client.v2.model.LLMObsInferenceContentValue;
import com.datadog.api.client.v2.model.LLMObsInferenceFunction;
import com.datadog.api.client.v2.model.LLMObsInferenceMessage;
import com.datadog.api.client.v2.model.LLMObsInferenceTool;
import com.datadog.api.client.v2.model.LLMObsInferenceToolCall;
import com.datadog.api.client.v2.model.LLMObsInferenceToolResult;
import com.datadog.api.client.v2.model.LLMObsIntegrationInferenceRequest;
import com.datadog.api.client.v2.model.LLMObsIntegrationInferenceResponse;
import com.datadog.api.client.v2.model.LLMObsIntegrationName;
import com.datadog.api.client.v2.model.LLMObsOpenAIMetadata;
import com.datadog.api.client.v2.model.LLMObsOpenAIReasoningEffort;
import com.datadog.api.client.v2.model.LLMObsOpenAIReasoningSummary;
import com.datadog.api.client.v2.model.LLMObsVertexAIMetadata;
import java.util.Collections;
import java.util.Map;

public class Example {
  public static void main(String[] args) {
    ApiClient defaultClient = ApiClient.getDefaultApiClient();
    defaultClient.setUnstableOperationEnabled("v2.createLLMObsIntegrationInference", true);
    LlmObservabilityApi apiInstance = new LlmObservabilityApi(defaultClient);

    LLMObsIntegrationInferenceRequest body =
        new LLMObsIntegrationInferenceRequest()
            .anthropicMetadata(
                new LLMObsAnthropicMetadata()
                    .effort(LLMObsAnthropicEffort.MEDIUM)
                    .thinking(
                        new LLMObsAnthropicThinkingConfig()
                            .budgetTokens(1024L)
                            .type(LLMObsAnthropicThinkingType.ENABLED)))
            .azureOpenaiMetadata(
                new LLMObsAzureOpenAIMetadata()
                    .deploymentId("my-gpt4-deployment")
                    .modelVersion("0613")
                    .resourceName("my-azure-resource"))
            .bedrockMetadata(new LLMObsBedrockMetadata().region("us-east-1"))
            .jsonSchema("""
{"type":"object","properties":{"answer":{"type":"string"}}}
""")
            .maxCompletionTokens(1024L)
            .maxTokens(1024L)
            .messages(
                Collections.singletonList(
                    new LLMObsInferenceMessage()
                        .content("What is the capital of France?")
                        .contents(
                            Collections.singletonList(
                                new LLMObsInferenceContent()
                                    .type("text")
                                    .value(
                                        new LLMObsInferenceContentValue()
                                            .text("Hello, how can I help you?")
                                            .toolCall(
                                                new LLMObsInferenceToolCall()
                                                    .arguments(
                                                        Map.ofEntries(
                                                            Map.entry("location", "San Francisco")))
                                                    .name("get_weather")
                                                    .toolId("call_abc123")
                                                    .type("function"))
                                            .toolCallResult(
                                                new LLMObsInferenceToolResult()
                                                    .name("get_weather")
                                                    .result(
                                                        "The weather in San Francisco is 68°F and"
                                                            + " sunny.")
                                                    .toolId("call_abc123")
                                                    .type("function")))))
                        .id("msg_001")
                        .role("user")
                        .toolCalls(
                            Collections.singletonList(
                                new LLMObsInferenceToolCall()
                                    .arguments(
                                        Map.ofEntries(Map.entry("location", "San Francisco")))
                                    .name("get_weather")
                                    .toolId("call_abc123")
                                    .type("function")))
                        .toolResults(
                            Collections.singletonList(
                                new LLMObsInferenceToolResult()
                                    .name("get_weather")
                                    .result("The weather in San Francisco is 68°F and sunny.")
                                    .toolId("call_abc123")
                                    .type("function")))))
            .modelId("gpt-4o")
            .openaiMetadata(
                new LLMObsOpenAIMetadata()
                    .reasoningEffort(LLMObsOpenAIReasoningEffort.MEDIUM)
                    .reasoningSummary(LLMObsOpenAIReasoningSummary.AUTO))
            .temperature(0.7)
            .tools(
                Collections.singletonList(
                    new LLMObsInferenceTool()
                        .function(
                            new LLMObsInferenceFunction()
                                .description("Get the current weather for a location.")
                                .name("get_weather")
                                .parameters(
                                    Map.ofEntries(
                                        Map.entry("properties", "{'location': {'type': 'string'}}"),
                                        Map.entry("type", "object"))))
                        .type("function")))
            .topK(50L)
            .topP(1.0)
            .vertexAiMetadata(
                new LLMObsVertexAIMetadata()
                    .location("us-central1")
                    .project("my-gcp-project")
                    .projectIds(Collections.singletonList("my-gcp-project")));

    try {
      LLMObsIntegrationInferenceResponse result =
          apiInstance.createLLMObsIntegrationInference(
              LLMObsIntegrationName.OPENAI, "account-abc123", body);
      System.out.println(result);
    } catch (ApiException e) {
      System.err.println(
          "Exception when calling LlmObservabilityApi#createLLMObsIntegrationInference");
      System.err.println("Status code: " + e.getCode());
      System.err.println("Reason: " + e.getResponseBody());
      System.err.println("Response headers: " + e.getResponseHeaders());
      e.printStackTrace();
    }
  }
}
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=java) and then save the example to `Example.java` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" java "Example.java"
##### 

```rust
// Run an LLM inference returns "OK" response
use datadog_api_client::datadog;
use datadog_api_client::datadogV2::api_llm_observability::LLMObservabilityAPI;
use datadog_api_client::datadogV2::model::LLMObsAnthropicEffort;
use datadog_api_client::datadogV2::model::LLMObsAnthropicMetadata;
use datadog_api_client::datadogV2::model::LLMObsAnthropicThinkingConfig;
use datadog_api_client::datadogV2::model::LLMObsAnthropicThinkingType;
use datadog_api_client::datadogV2::model::LLMObsAzureOpenAIMetadata;
use datadog_api_client::datadogV2::model::LLMObsBedrockMetadata;
use datadog_api_client::datadogV2::model::LLMObsInferenceContent;
use datadog_api_client::datadogV2::model::LLMObsInferenceContentValue;
use datadog_api_client::datadogV2::model::LLMObsInferenceFunction;
use datadog_api_client::datadogV2::model::LLMObsInferenceMessage;
use datadog_api_client::datadogV2::model::LLMObsInferenceTool;
use datadog_api_client::datadogV2::model::LLMObsInferenceToolCall;
use datadog_api_client::datadogV2::model::LLMObsInferenceToolResult;
use datadog_api_client::datadogV2::model::LLMObsIntegrationInferenceRequest;
use datadog_api_client::datadogV2::model::LLMObsIntegrationName;
use datadog_api_client::datadogV2::model::LLMObsOpenAIMetadata;
use datadog_api_client::datadogV2::model::LLMObsOpenAIReasoningEffort;
use datadog_api_client::datadogV2::model::LLMObsOpenAIReasoningSummary;
use datadog_api_client::datadogV2::model::LLMObsVertexAIMetadata;
use serde_json::Value;
use std::collections::BTreeMap;

#[tokio::main]
async fn main() {
    let body = LLMObsIntegrationInferenceRequest::new(
        vec![LLMObsInferenceMessage::new()
            .content("What is the capital of France?".to_string())
            .contents(vec![LLMObsInferenceContent::new(
                "text".to_string(),
                LLMObsInferenceContentValue::new()
                    .text("Hello, how can I help you?".to_string())
                    .tool_call(
                        LLMObsInferenceToolCall::new()
                            .arguments(BTreeMap::from([(
                                "location".to_string(),
                                Value::from("San Francisco"),
                            )]))
                            .name("get_weather".to_string())
                            .tool_id("call_abc123".to_string())
                            .type_("function".to_string()),
                    )
                    .tool_call_result(
                        LLMObsInferenceToolResult::new()
                            .name("get_weather".to_string())
                            .result("The weather in San Francisco is 68°F and sunny.".to_string())
                            .tool_id("call_abc123".to_string())
                            .type_("function".to_string()),
                    ),
            )])
            .id("msg_001".to_string())
            .role("user".to_string())
            .tool_calls(vec![LLMObsInferenceToolCall::new()
                .arguments(BTreeMap::from([(
                    "location".to_string(),
                    Value::from("San Francisco"),
                )]))
                .name("get_weather".to_string())
                .tool_id("call_abc123".to_string())
                .type_("function".to_string())])
            .tool_results(vec![LLMObsInferenceToolResult::new()
                .name("get_weather".to_string())
                .result("The weather in San Francisco is 68°F and sunny.".to_string())
                .tool_id("call_abc123".to_string())
                .type_("function".to_string())])],
        "gpt-4o".to_string(),
    )
    .anthropic_metadata(
        LLMObsAnthropicMetadata::new()
            .effort(Some(LLMObsAnthropicEffort::MEDIUM))
            .thinking(
                LLMObsAnthropicThinkingConfig::new(LLMObsAnthropicThinkingType::ENABLED)
                    .budget_tokens(Some(1024)),
            ),
    )
    .azure_openai_metadata(
        LLMObsAzureOpenAIMetadata::new()
            .deployment_id("my-gpt4-deployment".to_string())
            .model_version("0613".to_string())
            .resource_name("my-azure-resource".to_string()),
    )
    .bedrock_metadata(LLMObsBedrockMetadata::new().region("us-east-1".to_string()))
    .frequency_penalty(Some(0.0 as f64))
    .json_schema(Some(
        r#"{"type":"object","properties":{"answer":{"type":"string"}}}"#.to_string(),
    ))
    .max_completion_tokens(Some(1024))
    .max_tokens(Some(1024))
    .openai_metadata(
        LLMObsOpenAIMetadata::new()
            .reasoning_effort(Some(LLMObsOpenAIReasoningEffort::MEDIUM))
            .reasoning_summary(Some(LLMObsOpenAIReasoningSummary::AUTO)),
    )
    .presence_penalty(Some(0.0 as f64))
    .temperature(Some(0.7 as f64))
    .tools(vec![LLMObsInferenceTool::new(
        LLMObsInferenceFunction::new(
            "get_weather".to_string(),
            BTreeMap::from([("type".to_string(), Value::from("object"))]),
        )
        .description("Get the current weather for a location.".to_string()),
        "function".to_string(),
    )])
    .top_k(Some(50))
    .top_p(Some(1.0 as f64))
    .vertex_ai_metadata(
        LLMObsVertexAIMetadata::new()
            .location("us-central1".to_string())
            .project("my-gcp-project".to_string())
            .project_ids(vec!["my-gcp-project".to_string()]),
    );
    let mut configuration = datadog::Configuration::new();
    configuration.set_unstable_operation_enabled("v2.CreateLLMObsIntegrationInference", true);
    let api = LLMObservabilityAPI::with_config(configuration);
    let resp = api
        .create_llm_obs_integration_inference(
            LLMObsIntegrationName::OPENAI,
            "account_id".to_string(),
            body,
        )
        .await;
    if let Ok(value) = resp {
        println!("{:#?}", value);
    } else {
        println!("{:#?}", resp.unwrap_err());
    }
}
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=rust) and then save the example to `src/main.rs` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" cargo run
##### 

```typescript
/**
 * Run an LLM inference returns "OK" response
 */

import { client, v2 } from "@datadog/datadog-api-client";

const configuration = client.createConfiguration();
configuration.unstableOperations["v2.createLLMObsIntegrationInference"] = true;
const apiInstance = new v2.LLMObservabilityApi(configuration);

const params: v2.LLMObservabilityApiCreateLLMObsIntegrationInferenceRequest = {
  body: {
    anthropicMetadata: {
      effort: "medium",
      thinking: {
        budgetTokens: 1024,
        type: "enabled",
      },
    },
    azureOpenaiMetadata: {
      deploymentId: "my-gpt4-deployment",
      modelVersion: "0613",
      resourceName: "my-azure-resource",
    },
    bedrockMetadata: {
      region: "us-east-1",
    },
    frequencyPenalty: 0.0,
    jsonSchema: `{"type":"object","properties":{"answer":{"type":"string"}}}`,
    maxCompletionTokens: 1024,
    maxTokens: 1024,
    messages: [
      {
        content: "What is the capital of France?",
        contents: [
          {
            type: "text",
            value: {
              text: "Hello, how can I help you?",
              toolCall: {
                arguments: {
                  location: "San Francisco",
                },
                name: "get_weather",
                toolId: "call_abc123",
                type: "function",
              },
              toolCallResult: {
                name: "get_weather",
                result: "The weather in San Francisco is 68°F and sunny.",
                toolId: "call_abc123",
                type: "function",
              },
            },
          },
        ],
        id: "msg_001",
        role: "user",
        toolCalls: [
          {
            arguments: {
              location: "San Francisco",
            },
            name: "get_weather",
            toolId: "call_abc123",
            type: "function",
          },
        ],
        toolResults: [
          {
            name: "get_weather",
            result: "The weather in San Francisco is 68°F and sunny.",
            toolId: "call_abc123",
            type: "function",
          },
        ],
      },
    ],
    modelId: "gpt-4o",
    openaiMetadata: {
      reasoningEffort: "medium",
      reasoningSummary: "auto",
    },
    presencePenalty: 0.0,
    temperature: 0.7,
    tools: [
      {
        _function: {
          description: "Get the current weather for a location.",
          name: "get_weather",
          parameters: {
            properties: "{'location': {'type': 'string'}}",
            type: "object",
          },
        },
        type: "function",
      },
    ],
    topK: 50,
    topP: 1.0,
    vertexAiMetadata: {
      location: "us-central1",
      project: "my-gcp-project",
      projectIds: ["my-gcp-project"],
    },
  },
  integration: "openai",
  accountId: "account_id",
};

apiInstance
  .createLLMObsIntegrationInference(params)
  .then((data: v2.LLMObsIntegrationInferenceResponse) => {
    console.log(
      "API called successfully. Returned data: " + JSON.stringify(data)
    );
  })
  .catch((error: any) => console.error(error));
```

#### Instructions

First [install the library and its dependencies](https://docs.datadoghq.com/api/latest.md?code-lang=typescript) and then save the example to `example.ts` and run following commands:
    DD_SITE="datadoghq.com" DD_API_KEY="<DD_API_KEY>" DD_APP_KEY="<DD_APP_KEY>" tsc "example.ts"
{% /tab %}