Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.

Overview

OpenTelemetry Apache Spark metrics in a Spark dashboard

The Apache Spark receiver allows for collection of Apache Spark metrics and access to the Spark Overview dashboard. Configure the receiver according to the specifications of the latest version of the apachesparkreceiver.

For more information, see the OpenTelemetry project documentation for the Apache Spark receiver.

Setup

To collect Apache Spark metrics with OpenTelemetry for use with Datadog:

  1. Configure the Apache Spark receiver in your OpenTelemetry Collector configuration.
  2. Ensure the OpenTelemetry Collector is configured to export to Datadog.

See the Apache Spark receiver documentation for detailed configuration options and requirements.

Data collected

OTELDESCRIPTIONFILTER
spark.driver.block_manager.disk.usageDisk space used by the BlockManager.
spark.driver.block_manager.memory.usageMemory usage for the driver’s BlockManager.
spark.driver.dag_scheduler.stage.countNumber of stages the DAGScheduler is either running or needs to run.
spark.executor.disk.usageDisk space used by this executor for RDD storage.
spark.executor.memory.usageStorage memory used by this executor.
spark.job.stage.activeNumber of active stages in this job.
spark.job.stage.resultNumber of stages with a specific result in this job.job_result: failed
spark.job.stage.resultNumber of stages with a specific result in this job.job_result: skipped
spark.job.stage.resultNumber of stages with a specific result in this job.job_result: completed
spark.job.task.activeNumber of active tasks in this job.
spark.job.task.resultNumber of tasks with a specific result in this job.job_result: completed
spark.job.task.resultNumber of tasks with a specific result in this job.job_result: failed
spark.job.task.resultNumber of tasks with a specific result in this job.job_result: skipped
spark.stage.io.recordsNumber of records written and read in this stage.direction: in
spark.stage.io.recordsNumber of records written and read in this stage.direction: out
spark.stage.io.sizeAmount of data written and read at this stage.direction: in
spark.stage.io.sizeAmount of data written and read at this stage.direction: out
spark.stage.shuffle.io.read.sizeAmount of data read in shuffle operations in this stage.
spark.stage.shuffle.io.recordsNumber of records written or read in shuffle operations in this stage.direction: in
spark.stage.shuffle.io.recordsNumber of records written or read in shuffle operations in this stage.direction: out

For the full mapping between OpenTelemetry and Datadog metric names, see OpenTelemetry Metrics Mapping.

Further reading

Documentation, liens et articles supplémentaires utiles: