Evaluations

이 페이지는 아직 영어로 제공되지 않습니다. 번역 작업 중입니다.
현재 번역 프로젝트에 대한 질문이나 피드백이 있으신 경우 언제든지 연락주시기 바랍니다.

Overview

LLM Observability offers several ways to support evaluations:

Out-of-the-Box Evaluations

Datadog builds and supports Out-of-the-Box Evaluations to support common use cases. You can enable and configure them within the LLM Observability application.

Submit Custom Evaluations

You can also Submit Custom Evaluations using Datadog’s API. This mechanism is great if you have your own evaluation system, but would like to centralize that information within Datadog.

Evaluation Integrations

Datadog also supports integrations with some 3rd party evaluation frameworks, such as Ragas and NeMo.

Sensitive Data Scanner integration

In addition to evaluating the input and output of LLM requests, agents, workflows, or the application, LLM Observability integrates with Sensitive Data Scanner, which helps prevent data leakage by identifying and redacting any sensitive information (such as personal data, financial details, or proprietary information) that may be present in any step of your LLM application.

By proactively scanning for sensitive data, LLM Observability ensures that conversations remain secure and compliant with data protection regulations. This additional layer of security reinforces Datadog’s commitment to maintaining the confidentiality and integration of user interactions with LLMs.

Permissions

LLM Observability Write permissions are necessary to configure evaluations.