This product is not supported for your selected 
Datadog site. (
).
Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, 
n'hésitez pas à nous contacter.
Overview
LLM Observability offers several ways to support evaluations. They can be configured by navigating to AI Observability > Settings > Evaluations.
Custom LLM-as-a-judge evaluations
Custom LLM-as-a-judge evaluations allow you to define your own evaluation logic using natural language prompts. You can create custom evaluations to assess subjective or objective criteria (like tone, helpfulness, or factuality) and run them at scale across your traces and spans.
Managed evaluations
Datadog builds and supports managed evaluations to support common use cases. You can enable and configure them within the LLM Observability application.
Submit external evaluations
You can also submit external evaluations using Datadog’s API. This mechanism is great if you have your own evaluation system, but would like to centralize that information within Datadog.
Evaluation integrations
Datadog also supports integrations with some 3rd party evaluation frameworks, such as Ragas and NeMo.
Sensitive Data Scanner integration
In addition to evaluating the input and output of LLM requests, agents, workflows, or the application, LLM Observability integrates with Sensitive Data Scanner, which helps prevent data leakage by identifying and redacting any sensitive information (such as personal data, financial details, or proprietary information) that may be present in any step of your LLM application.
By proactively scanning for sensitive data, LLM Observability ensures that conversations remain secure and compliant with data protection regulations. This additional layer of security reinforces Datadog’s commitment to maintaining the confidentiality and integration of user interactions with LLMs.
Permissions
LLM Observability Write permissions are necessary to configure evaluations.