Cluster Sizing

Ce produit n'est pas pris en charge par le site Datadog que vous avez sélectionné. ().
Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.
CloudPrem is in Preview

Join the CloudPrem Preview to access new self-hosted log management features.

Request Access

Overview

Proper cluster sizing ensures optimal performance, cost efficiency, and reliability for your CloudPrem deployment. Your sizing requirements depend on several factors including log ingestion volume, query patterns, and the complexity of your log data.

This guide provides baseline recommendations for dimensioning your CloudPrem cluster components—indexers, searchers, supporting services, and the PostgreSQL database.

Use your expected daily log volume and peak ingestion rates as starting points, then monitor your cluster's performance and adjust sizing as needed.

Indexers

Indexers receive logs from Datadog Agents, then process, index, and store them as index files (called splits) in object storage. Proper sizing is critical for maintaining ingestion throughput and ensuring your cluster can handle your log volume.

SpecificationRecommendationNotes
Performance5 MB/s per vCPUBaseline throughput to determine initial sizing. Actual performance depends on log characteristics (size, number of attributes, nesting level)
Memory4 GB RAM per vCPU
Minimum Pod Size2 vCPUs, 8 GB RAMRecommended minimum for indexer pods
Storage CapacityAt least 200 GBRequired for temporary data while creating and merging index files
Storage TypeLocal SSDs (preferred)Local HDDs or network-attached block storage (Amazon EBS, Azure Managed Disks) can also be used
Disk I/O~20 MB/s per vCPUEquivalent to 320 IOPS per vCPU for Amazon EBS (assuming 64 KB IOPS)

To index 1 TB of logs per day (~11.6 MB/s), follow these steps:

  1. Calculate vCPUs: 11.6 MB/s ÷ 5 MB/s per vCPU ≈ 2.3 vCPUs
  2. Calculate RAM: 2.3 vCPUs × 4 GB RAM ≈ 9 GB RAM
  3. Add headroom: Start with one indexer pod configured with 3 vCPUs, 12 GB RAM, and a 200 GB disk. Adjust these values based on observed performance and redundancy needs.

Searchers

Searchers handle search queries from the Datadog UI, reading metadata from the Metastore and fetching data from object storage.

A general starting point is to provision roughly double the total number of vCPUs allocated to Indexers.

  • Performance: Search performance depends heavily on the workload (query complexity, concurrency, amount of data scanned). For instance, term queries (status:error AND message:exception) are usually computationally less expensive than aggregations.
  • Memory: 4 GB of RAM per searcher vCPU. Provision more RAM if you expect many concurrent aggregation requests.

Other services

Allocate the following resources for these lightweight components:

ServicevCPUsRAMReplicas
Control Plane24 GB1
Metastore24 GB2
Janitor24 GB1

PostgreSQL database

  • Instance Size: For most use cases, a PostgreSQL instance with 1 vCPU and 4 GB of RAM is sufficient
  • AWS RDS Recommendation: If using AWS RDS, the t4g.medium instance type is a suitable starting point
  • High Availability: Enable Multi-AZ deployment with one standby replica for high availability

Helm chart sizing tiers

The CloudPrem Helm chart provides predefined sizing tiers through the indexer.podSize and searcher.podSize parameters. Each tier sets the vCPU and memory resource limits for a pod, and automatically configures component-specific settings.

SizevCPUsMemory
medium14 GB
large28 GB
xlarge416 GB
2xlarge832 GB
4xlarge1664 GB
6xlarge2496 GB
8xlarge32128 GB

The following values are automatically applied when you set indexer.podSize in the Helm chart. For more details on each parameter, see the Quickwit Indexer configuration.

Sizesplit_store_max_num_bytessplit_store_max_num_splits
medium200G10000
large200G10000
xlarge200G10000
2xlarge200G10000
4xlarge200G10000
6xlarge200G10000
8xlarge200G10000

The following values are automatically applied when you set indexer.podSize in the Helm chart. For more details on each parameter, see the Quickwit Ingest API configuration.

Sizemax_queue_memory_usagemax_queue_disk_usage
medium2GiB4GiB
large4GiB8GiB
xlarge8GiB16GiB
2xlarge16GiB32GiB
4xlarge32GiB64GiB
6xlarge48GiB96GiB
8xlarge64GiB128GiB

The following values are automatically applied to searcher configuration when you set searcher.podSize in the Helm chart. For more details on each parameter, see the Quickwit Searcher configuration.

Sizefast_field_cache_capacitysplit_footer_cache_capacitypartial_request_cache_capacitymax_num_concurrent_split_searchesaggregation_memory_limit
medium1GiB500MiB64MiB2500MiB
large2GiB1GiB128MiB41GiB
xlarge4GiB2GiB256MiB82GiB
2xlarge8GiB4GiB512MiB164GiB
4xlarge16GiB8GiB1GiB328GiB
6xlarge24GiB12GiB1536MiB4812GiB
8xlarge32GiB16GiB2GiB6416GiB

Further reading