Best Practices for RUM Sampling

문서 > RUM & 세션 재생 > 실제 사용자 모니터링 및 세션 재생 가이드 > Best Practices for RUM Sampling

이 페이지는 아직 한국어로 제공되지 않습니다. 번역 작업 중입니다.
현재 번역 프로젝트에 대한 질문이나 피드백이 있으신 경우 언제든지 연락주시기 바랍니다.

Overview

Sampling in Datadog’s Real User Monitoring product enables you to collect data from a certain percentage of user traffic.

There are two different ways of sampling, which control the data you send to Datadog:

Client-side (head-based) sampling: Makes the sampling decision at the beginning of a user session, before any data is collected. The RUM SDK in your application determines whether to track the entire session or not, reducing data collection and ingestion of sessions that aren’t analyzed.
Server-side (tail-based) sampling: Makes the sampling decision after data has been collected and sent to Datadog. It allows you to filter and retain specific sessions based on their characteristics (like errors or user attributes) using retention filters.
Note: Server-side sampling is only possible with the retention filters provided by RUM without Limits. If you need to use this but are on the legacy, client-side-only model, reach out to your account team.

This guide walks you through best practices for RUM sampling so you can capture sessions and collect data based on your monitoring needs. Learn more about how sessions are defined in RUM.

Sampling configuration

Configure the sampling rate

Client-side (head-based) sampling rate

With RUM without Limits, client-side sampling rate helps you control how many sessions you send from your applications to Datadog.

Before each new user session, the SDK draws a random floating-point number between 0 and 100, which is then compared to the value set in the SDK configuration. If the random number is lower than the value set in the SDK configuration, the session is kept and events start being collected. If the random number is higher, the session is not kept and events are not collected until the end of the session.

You can set the sampling rate with the SDK (Browser, Android, iOS, Flutter, Kotlin Multiplatform, React Native, Roku, Unity), then deploy it in the application code.

Server-side (tail-based) sampling rate

With RUM without Limits, server-side sampling rate defines which sessions you want to keep in Datadog (see details about the retention period).

The server-side sampling rate is defined as part of the retention filters for your sessions. When a retention filter matches a session or matches one of the events making up the sessions (view/action/error/resource, and so on), the whole session is stored alongside all its events (and including the ones that preceded the sampling decision). The retention rate allows you to store only a specific percentage of sessions that meet the filter criteria and discard the rest. Learn more about how retention filters work.

The effect of sampling on data and metrics that are available in RUM

RUM metrics, including the ones that come out-of-the-box with RUM without Limits (such as Core Web Vitals and usage numbers) and the custom ones that you can create yourself, are all calculated based on sessions that are ingested on Datadog. For example, if the client-sampling rate is set to capture 60% of sessions, then the Core Web Vitals and total number of sessions are calculated based on 60% of those sessions.

Note: With RUM without Limits, those metrics are computed before the retention filters - in other words, before server-side sampling.

Recommended sampling rate

Client-side (head-based) sampling rate

For optimal monitoring, Datadog recommends sending 100% of your sessions to Datadog. This ensures accurate out-of-the-box custom metrics, and complete visibility into your user experience.

However, if your application experiences high traffic and ingestion costs are a concern, you can reduce the sampling rate. Keep in mind that lower sampling rates affect the accuracy of your metrics proportionally.

Server-side (tail-based) sampling rate

For server-side sampling, Datadog recommends a two-step approach:

Start with basic retention filters to capture sessions with critical user paths, such as errors or from specific users.
Adjust the sampling rate based on your needs:
- Ensure you have enough sessions for troubleshooting
- Maintain sufficient data for APM correlation
- Keep enough samples for performance analysis (waterfall views, long tasks)

With RUM without Limits, your server-side sampling should provide enough data for both troubleshooting and performance analysis while managing your data volume effectively.

Sampling based on specific attributes

Configuring sampling based on specific attributes, such as sampling 100% of sessions with errors and 5% otherwise, or only sampling sessions that go through the checkout flow, is supported using retention filters. See the Retention Filters Best Practices guide to understand common retention filter types.

Changing the sampling rate in the Datadog RUM UI

During live outages, incidents, or bug investigations, and for customers that are not yet on RUM without Limits you can increase client-side (head-based) sampling to collect 100% of your sessions to ensure nothing is missed, or to have more examples of a particular issue.

You can only change the head-based sampling rate from the Datadog UI if you use the server-side injection method to add the Browser RUM SDK to your web application. To do this, modify the sampling rate on the SDK Configuration page.

Session Sampling and Session Replay Sampling sliders visible from the RUM SDK Configuration page.

For other instrumentation methods (such as npm or CDN), to modify the head-based sampling rate:

Deploy a new version of your application with an updated sessionSampleRate value
Use a feature flag or remote configuration service to dynamically set the rate when the SDK initializes

To modify the head-based sampling rate for mobile SDKs, redeploy your application with an updated sessionSampleRate value.