AI Guard Security Signals

이 페이지는 아직 한국어로 제공되지 않습니다. 번역 작업 중입니다.
현재 번역 프로젝트에 대한 질문이나 피드백이 있으신 경우 언제든지 연락주시기 바랍니다.

AI Guard isn't available in the site.

AI Guard security signals provide visibility into threats and attacks AI Guard detects in your applications. These signals are built on top of AAP (Application and API Protection) security signals and integrate with Datadog’s security monitoring workflows.

Understand AI Guard signals

Datadog creates AI Guard security signals when it detects a threat based on a configured detection rule. Signals indicating threats such as prompt injection, jailbreaking, or tool misuse appear in the Datadog Security Signals explorer. These signals can provide:

  • Threat detection: Attack context based on your configured detection rules
  • Action insights: Blocked or allowed actions information according to your rule settings
  • Rich investigation context: Attack categories detected, AI Guard evaluation results, and links to related AI Guard spans for comprehensive analysis
  • Custom runbooks: Custom remediation guidance and response procedures for specific threat scenarios

To help you prioritize your remediation efforts, AI Guard automatically assigns a severity level to every security signal. You can create custom detection rules to customize severity levels and define specific security responses.

Create detection rules

You can create custom detection rules by defining thresholds for when you want to receive notifications; for example, more than 5 DENY actions in 10 minutes. When AI Guard evaluations exceed those thresholds, it generates security signals.

To create AI Guard detection rules:

  1. In Datadog, go to the AI Guard detection rule explorer, then click New Rule.
    AI Guard Detection Rules Explorer
  2. Under Define your Real-time rule, choose the type of rule to create.
  3. Under Define Search Queries, define the types of tags you want to create signals for. You can use the following AI Guard attributes to filter and target specific threat patterns:
    TagDescriptionPossible values
    @ai_guard.actionFilter by AI Guard's evaluation resultALLOW or DENY
    @ai_guard.attack_categoriesTarget specific attack types
    • jailbreak
    • indirect-prompt-injection
    • destructive-tool-call
    • denial-of-service-tool-call
    • security-exploit
    • authority-override
    • role-play
    • instruction-override
    • obfuscation
    • system-prompt-extraction
    • data-exfiltration
    @ai_guard.blockedFilter based on whether an action in the trace was blockedtrue or false
    @ai_guard.toolsFilter by specific tool names involved in the evaluationget_user_profile, user_recent_transactions, etc.
    @ai_guard.sds.categoriesFilter by sensitive data categories detected by Sensitive Data Scannercredentials, email_address, etc.
    @ai_guard.sds.rule_tagsFilter by specific sensitive data rule tagsaws_access_key_id, aws_secret_access_key, claude_api_key, email_address, etc.
  4. Under Define Rule Conditions:
    1. Define your threshold conditions, if applicable to the type of rule you chose.
    2. Set the severity level of the security signals AI Guard generates with this rule.
    3. Choose who should get notifications for new signals and how often.
    4. Choose security responses to take, such as automated IP or user blocking, and IP flagging.
    5. Configure additional settings, such as updating the same signal instead of creating a new one if AI Guard detects new values within a set amount of time, and decreasing signal severity for non-production environments.
  5. Under Describe your Playbook, customize the notification and define tags to send with the signals.
  6. Click Save Rule.

For more comprehensive detection rule capabilities, see detection rules.

Investigate signals

To view and investigate AI Guard security signals, and correlate them with other security events, you can view signals in two places:

The Security Signals explorers allow you to filter, prioritize, and investigate AI Guard signals alongside other application security threats, providing a unified view of your security posture.

You can create or link cases directly from an AI Guard security signal, and click any signal to open a side panel containing additional context.

Get additional context with spans

AI Guard spans offer detailed information about the assessments it made and why. When you open a span from the Investigate page or from a signal, you can get context on the specific prompts your AI agent used, read exact inputs and outputs, and see any attack categories that contributed to AI Guard assessing a tool call as unsafe.

Get context on a span

When you click on a span in the explorer, you can see:

  • The service and environment the requests occurred in
  • The blocking policy configured for that service, which determines whether AI Guard blocks unsafe requests, or detects and tags them without blocking them
  • The user who interacted with the agent
  • The specific inputs and outputs from your agent, and whether they came from LLMs or external tools
  • Whether AI Guard assessed each request as safe or unsafe
  • Whether AI Guard blocked the request
  • If AI Guard assessed the call as unsafe, which attack categories it included
  • Whether the request included sensitive data, and if so, what type of sensitive data
  • Additional tags, which you can use to filter spans in the explorer

Additionally, you can click Explore in graph view to see the requests in the conversation graphed out, or view the span in APM or LLM Observability.

Further reading