Manage Sensitive Logs Data Access
このページは日本語には対応しておりません。随時翻訳に取り組んでいます。
翻訳に関してご質問やご意見ございましたら、
お気軽にご連絡ください。
Overview
Logs may contain sensitive data, and should be handled carefully. If you are ingesting sensitive data into Datadog, consider the following:
- If you have intentionally set up your logs to have sensitive data for legitimate troubleshooting and auditing purposes, use the Roles Based Access Control to ensure that you have set up appropriate restrictions so only authorized users who have access to your Datadog account can access this data. For more information, see the Logs Roles Based Access Control (RBAC) User’s Guide to learn how to configure it for your organization.
- Address any unintentional logging of sensitive data to preemptively address any concerns down the road. Continue with this guide to learn more.
Controlling all of your data can be challenging, especially on a large and collaborative platform. This guide walks you through the different options to discover and manage sensitive data that is ingested into Datadog.
Sensitive Data Scanner
Sensitive Data Scanner is a stream-based, pattern matching service that you can use to identify, tag, and optionally redact or hash sensitive data. With this capability, your security and compliance teams can introduce a line of defense in preventing sensitive data from leaking outside your organization. Sensitive Data Scanner is available in your Organization Settings.
If you have already indexed logs that contain sensitive data, then follow these three steps:
- Determine the scope of the data being sent
- Fix the source of the data upstream
- Handle data already sent to Datadog
Mask action
The mask action is only available for logs.
When you set up or edit a scanner rule, there is an Action on Match section where you can set the rule to use the mask action for matched sensitive data. The mask action obfuscates the sensitive data, but users with the Data Scanner Unmask permission can de-obfuscate (unmask) and view the data in Datadog.
Notes:
- Unmasking can only be performed on indexed logs within Datadog.
- Masked data that is accessed programmatically, such as by using the API or Terraform, or within archives, always appears obfuscated and cannot be unmasked.
- Unmasking does not work on rehydrated logs.
- Datadog does not recommend using the mask action for credentials, unless you have a plan to respond to and rotate all leaked credentials.
To unmask sensitive data, navigate to the Summary page, click on a scanning rule, and then click on a log. If you have permission to see masked data, that data has an eye icon next to it. Click the eye icon to reveal the data. You can also use the Log Explorer to view your masked log data.
Determine the scope of the data being sent
What log query defines sensitive data?
First, define a query that outlines the sensitive data. That query will return all logs with sensitive data.
Chances are that queries such as version:x.y.z source:python status:debug match that expectation. Refer to the Log Search Syntax documentation if you need to use more advanced operators (wildcards, boolean operators, etc.).
This guide refers to this example query as the sensitive outline query.
Where does sensitive data live in Datadog?
Once sensitive data in logs is sent to the Datadog platform, it may exist in a number of places. As a result, check each of the following (ordered from most likely to have sensitive data to least likely):
- Datadog Indexes are where logs are stored in Datadog until they age out according to index retention. Focus should be on Datadog Indexes as other locations are less likely to be a compliance concern. Check indexes filters and exclusion filters to see if logs with sensitive data are indexed. 
- Log Archives, which is where Datadog sends logs to be stored. Set up Archive Filters to see if your archive contains sensitive logs. 
- Metrics generated from logs, which stores aggregated metrics. Sensitive data may have been discarded with this process. Check custom metrics filters to see if logs with sensitive data are processed. 
- Log Monitors notifications when they include Log Samples. Check specifically for monitors triggered during the period when sensitive data was being sent. 
- Live Tail, where logs are viewed in real-time by your organization’s users. There is no persistence beyond the 50 logs cached in browsers, and for broader queries, the result can be extremely sampled. 
Fix the source of the data upstream
Redact sensitive data in streaming logs using Sensitive Data Scanner
Use out-of-the-box or custom rules to identify and redact other kinds of sensitive data still coming in your logs.
Stop indexing sensitive logs
If you’re not using Sensitive Data Scanner, determine whether you want to exclude any new logs containing sensitive data from being indexed entirely. You’ll still need to address the logs containing sensitive data already indexed in Datadog.
- Find which index(es) hold logs with sensitive data.
- For each index, add an exclusion filter based on the sensitive outline query.
Stop sending sensitive data to Datadog
If certain kinds of sensitive data are prohibited from leaving your environment and being ingested into Datadog, then add scrubbing rules at source collection.
If you have way to change the loggers themselves, Datadog provides you with solutions to prevent compliance-sensitive data from being sent outside of your platform when using the Datadog Agent for Log Collection:
Similar scrubbing capabilities exist for the Serverless Forwarder.
Handle data already sent to and indexed in Datadog
Take the following steps according to your compliance requirements. You might not need to complete all steps.
Make sensitive logs un-queryable in Datadog (until they age-out)
This step makes logs with sensitive data, both logs that already sent and logs that might keep flowing in, not queryable in Datadog (Explorer, Dashboards, and Livetail).
Use the Data Access configuration page and a sensitive outline query to define a restriction and apply it to roles in your organization. For example, version:x.y.z source:python status:debug. You can also restrict over a time period with the @timestamp attribute. For example, @timestamp:[1731597125165 TO 1731597125200].
Note: Using NOT in the sensitive outline query restricts users from the logs matching the query and allows users to see logs that do not match the query.
Patch your archives
If you have to patch your archives to remove sensitive data, refer to the format of archives generated by Datadog documentation.
Support
If you have a specific compliance questions or need help, contact Datadog support. When you contact support, it is helpful for you to have the following information available:
- The sensitive outline query or anything that could define sensitive data such as a timerange, a service, or an environment.
- Whether you use Sensitive Data Scanner.
- Whether the sensitive data is still being sent into Datadog.
- Whether sensitive data has been indexed (in which index(es)) or turned into metrics.
- Whether you have already made sensitive data unqueryable.
Further Reading