Evaluation compatibility

このページは日本語には対応しておりません。随時翻訳に取り組んでいます。
翻訳に関してご質問やご意見ございましたら、お気軽にご連絡ください。

The supported third party LLM providers are OpenAI, Azure OpenAI, Anthropic, Amazon Bedrock, Vertex AI, and AI Gateway.

Managed evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Language Mismatch	Fully supported	Self hosted	All span kinds

Custom LLM-as-a-judge evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Boolean	Fully supported	All third party LLM providers	All span kinds
Score	Fully supported	All third party LLM providers	All span kinds
Categorical	Fully supported	All third party LLM providers	All span kinds
JSON	Fully supported	All third party LLM providers	All span kinds

Existing templates for custom LLM-as-a-judge evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Failure to Answer	Fully supported	All third party LLM providers	All span kinds
Hallucination	Fully supported	All third party LLM providers	LLM only
Sentiment	Fully supported	All third party LLM providers	All span kinds
Toxicity	Fully supported	All third party LLM providers	All span kinds
Prompt Injection	Fully supported	All third party LLM providers	All span kinds
Topic Relevancy	Fully supported	All third party LLM providers	All span kinds
Tool Selection	Fully supported	All third party LLM providers	LLM only
Tool Argument Correctness	Fully supported	All third party LLM providers	LLM only
Goal Completeness	Fully supported	All third party LLM providers	LLM only