Evaluation compatibility

Docs > LLM Observability > Evaluations > Evaluation compatibility

Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.

Evaluation compatibility

The supported third party LLM providers are OpenAI, Azure OpenAI, Anthropic, Amazon Bedrock, Vertex AI, and AI Gateway.

Managed evaluations

Managed evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Language Mismatch	Fully supported	Self hosted	All span kinds

Custom LLM-as-a-judge evaluations

Custom LLM-as-a-judge evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Boolean	Fully supported	All third party LLM providers	All span kinds
Score	Fully supported	All third party LLM providers	All span kinds
Categorical	Fully supported	All third party LLM providers	All span kinds
JSON	Fully supported	All third party LLM providers	All span kinds

Template LLM-as-a-judge evaluations

Existing templates for custom LLM-as-a-judge evaluations are supported for the following configurations.

Evaluation	DD-trace version	LLM Provider	Applicable span
Failure to Answer	Fully supported	All third party LLM providers	All span kinds
Hallucination	Fully supported	All third party LLM providers	LLM only
Sentiment	Fully supported	All third party LLM providers	All span kinds
Toxicity	Fully supported	All third party LLM providers	All span kinds
Prompt Injection	Fully supported	All third party LLM providers	All span kinds
Topic Relevancy	Fully supported	All third party LLM providers	All span kinds
Tool Selection	Fully supported	All third party LLM providers	LLM only
Tool Argument Correctness	Fully supported	All third party LLM providers	LLM only
Goal Completeness	Fully supported	All third party LLM providers	LLM only

Evaluation compatibility