Data exfiltration successful
이 페이지는 아직 한국어로 제공되지 않습니다. 번역 작업 중입니다.
현재 번역 프로젝트에 대한 질문이나 피드백이 있으신 경우
언제든지 연락주시기 바랍니다.Goal
Detect successful data exfiltration from AI-enabled services. This detection identifies when an attacker has successfully manipulated an LLM to leak sensitive information, including PII, credentials, or other confidential data.
Strategy
Monitor application security events for successful (unblocked) data exfiltration using @ai_guard.attack_categories:data-exfiltration and -@ai_guard.blocked:true. Integration with Sensitive Data Scanner (@ai_guard.sds.categories) enables precise classification of the leaked data type.
Signal severity is determined as follows:
CRITICAL Data exfiltration was not blocked and included PII or credentials (@ai_guard.sds.categories:(pii OR credentials)). This represents a confirmed data breach with potential regulatory implications.HIGH Data exfiltration was not blocked, either containing other sensitive data categories or unclassified data. This represents a security incident requiring investigation.
Triage and response
- Enable AI Guard in blocking mode for the affected service or tool to prevent further exfiltration.
- Block the attacking IP addresses to interrupt ongoing exploitation.
- Review the flagged requests and LLM responses to identify exactly what data was exfiltrated.
- Determine the breach scope — identify all affected users, sessions, and data records that may have been exposed.
- For critical signals with PII or credentials, assess reporting obligations under GDPR, CCPA, or other applicable data protection regulations.
- Review system prompts, input sanitization, and output filtering to harden defenses against future attempts.