---
title: Experiments
description: An overview of Datadog's LLM Observability Experiments feature.
breadcrumbs: Docs > LLM Observability > Experiments
---

# Experiments

{% callout %}
# Important note for users on the following Datadog sites: app.ddog-gov.com

{% alert level="danger" %}
This product is not supported for your selected [Datadog site](https://docs.datadoghq.com/getting_started/site.md). ().
{% /alert %}

{% /callout %}

{% image
   source="https://docs.dd-static.net/images/llm_observability/experiments/Experiments_LLMO.9a2c7ef62909cb6beb323f21a06d6ead.png?auto=format&fit=max&w=850 1x, https://docs.dd-static.net/images/llm_observability/experiments/Experiments_LLMO.9a2c7ef62909cb6beb323f21a06d6ead.png?auto=format&fit=max&w=850&dpr=2 2x"
   alt="LLM Observability, Experiment view. Heading: 'Comparing 6 experiments across 9 fields'. Line graph visualization charting the accuracy, correctness, duration, estimated cost, and other metrics of various experiments." /%}

LLM Observability [Experiments](https://app.datadoghq.com/llm/testing/experiments) supports the entire lifecycle of building LLM applications and agents. It helps you understand how changes to prompts, models, providers, or system architecture affect performance. With this feature, you can:

- Create and version datasets
- Run and manage experiments
- Compare results to evaluate impact

## Cookbooks{% #cookbooks %}

For in-depth examples of what you can do with LLM Experiments, see Datadog's provided [Jupyter notebooks](https://github.com/DataDog/llm-observability/tree/main/experiments/notebooks).

## Further reading{% #further-reading %}

- [Set up and use LLM Observability Experiments](https://docs.datadoghq.com/llm_observability/experiments/setup.md)
- [Create and monitor LLM experiments with Datadog](https://www.datadoghq.com/blog/llm-experiments/)
- [Datadog LLM Observability natively supports OpenTelemetry GenAI Semantic Conventions](https://www.datadoghq.com/blog/llm-otel-semantic-convention)
- [Driving AI ROI: How Datadog connects cost, performance, and infrastructure so you can scale responsibly](https://www.datadoghq.com/blog/manage-ai-cost-and-performance-with-datadog/)
- [How we cut our NLQ agent debugging time from hours to minutes with LLM Observability](https://www.datadoghq.com/blog/llm-observability-at-datadog-nlq)
