Presto

Presto

Agent Check Check de l'Agent

Linux Mac OS Windows OS Supported

Présentation

Ce check recueille des métriques Presto comme :

  • Des métriques relatives aux activités générales (requêtes complétées/échouées, taille des entrées et des sorties de données, délai d’exécution)
  • Des métriques de performance (mémoire du cluster, entrées processeur, temps d’exécution processeur)

Configuration

Suivez les instructions ci-dessous pour installer et configurer ce check lorsque l’Agent est exécuté sur un host. Consultez la documentation relative aux modèles d’intégration Autodiscovery pour découvrir comment appliquer ces instructions à un environnement conteneurisé.

Installation

Le check Presto est inclus avec le package de l'Agent Datadog. Vous n’avez rien d’autre à installer sur vos serveurs. Installez l’Agent sur chacun des nœuds coordinateur ou worker depuis lesquels vous souhaitez recueillir des métriques d’utilisation et de performance.

Configuration

  1. Modifiez le fichier presto.d/conf.yaml dans le dossier conf.d/ à la racine du répertoire de configuration de votre Agent pour commencer à recueillir vos données de performance Presto. Consultez le fichier d’exemple presto.d/conf.yaml pour découvrir toutes les options de configuration disponibles.

    Ce check prévoit une limite de 350 métriques par instance. Le nombre de métriques renvoyées est indiqué dans la page d’information. Vous pouvez choisir des métriques pertinentes en modifiant la configuration ci-dessous. Pour découvrir comment modifier la liste des métriques à recueillir, consultez la documentation relative aux checks JMX afin d’obtenir des instructions détaillées. Si vous souhaitez surveiller plus de 350 métriques, contactez l’assistance Datadog.

  2. Redémarrez l’Agent.

Collecte de métriques

Utilisez la configuration par défaut de votre fichier presto.d/conf.yaml pour activer la collecte de vos métriques Presto. Consultez le fichier d’exemple presto.d/conf.yaml pour découvrir toutes les options de configuration disponibles.

Collecte de logs

Disponible à partir des versions > 6.0 de l’Agent

  1. La collecte de logs est désactivée par défaut dans l’Agent Datadog. Vous devez l’activer dans datadog.yaml :

    logs_enabled: true
    
  2. Ajoutez ce bloc de configuration à votre fichier presto.d/conf.yaml pour commencer à recueillir vos logs Presto :

    logs:
      - type: file
        path: /var/log/presto/*.log
        source: presto
        service: "<SERVICE_NAME>"
    

    Modifiez les valeurs des paramètres path et service et configurez-les pour votre environnement. Consultez le fichier d’exemple presto.d/conf.yaml pour découvrir toutes les options de configuration disponibles.

  3. Redémarrez l’Agent.

Validation

Lancez la sous-commande status de l’Agent et cherchez presto dans la section Checks.

Données collectées

Métriques

presto.execution.abandoned_queries.one_minute.count
(gauge)
Abandoned queries - one minute count.
Shown as query
presto.execution.abandoned_queries.one_minute.rate
(gauge)
Abandoned queries - one minute rate.
Shown as query
presto.execution.abandoned_queries.total_count
(gauge)
Abandoned queries - total count.
Shown as query
presto.execution.canceled_queries.one_minute.count
(gauge)
Canceled queries - one minute count.
Shown as query
presto.execution.canceled_queries.one_minute.rate
(gauge)
Canceled queries - one minute queries per second.
Shown as query
presto.execution.canceled_queries.total_count
(gauge)
Canceled queries - total count.
Shown as query
presto.execution.completed_queries.one_minute.count
(gauge)
Completed queries - one minute count.
Shown as query
presto.execution.completed_queries.one_minute.rate
(gauge)
Completed queries - one minute queries per second.
Shown as query
presto.execution.completed_queries.total_count
(gauge)
Completed queries - total count.
Shown as query
presto.execution.consumed_cpu_time_secs.one_minute.count
(gauge)
CPU (processing) time consumed - one minute count (seconds).
Shown as second
presto.execution.consumed_cpu_time_secs.one_minute.rate
(gauge)
CPU (processing) time consumed - one minute rate.
Shown as second
presto.execution.consumed_cpu_time_secs.total_count
(gauge)
CPU (processing) time consumed - total count (seconds).
Shown as second
presto.execution.cpu_input_byte_rate.all_time.avg
(gauge)
Distribution of query input data rates (cpu) - all time average bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.all_time.p75
(gauge)
Distribution of query input data rates (cpu) - all time bytes per second - p75.
Shown as byte
presto.execution.cpu_input_byte_rate.all_time.p95
(gauge)
Distribution of query input data rates (cpu) - all time bytes per second - p95.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.avg
(gauge)
Distribution of query input data rates (cpu) - one minute average bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.count
(gauge)
Distribution of query input data rates (cpu) - one minute count.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.max
(gauge)
Distribution of query input data rates (cpu) - one minute max bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.min
(gauge)
Distribution of query input data rates (cpu) - one minute min bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.p75
(gauge)
Distribution of query input data rates (cpu) - one minute bytes per second - p75.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.p95
(gauge)
Distribution of query input data rates (cpu) - one minute bytes per second - p95.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.total
(gauge)
Distribution of query input data rates (cpu) - one minute total bytes per second.
Shown as byte
presto.execution.execution_time.all_time.avg
(gauge)
Query execution time (millisecond) - all time average.
Shown as millisecond
presto.execution.execution_time.all_time.count
(gauge)
Query execution time (millisecond) - all time count.
Shown as millisecond
presto.execution.execution_time.all_time.max
(gauge)
Query execution time (millisecond) - all time max.
Shown as millisecond
presto.execution.execution_time.all_time.min
(gauge)
Query execution time (millisecond) - all time min.
Shown as millisecond
presto.execution.execution_time.all_time.p75
(gauge)
Query execution time (millisecond) - all time - p75.
Shown as millisecond
presto.execution.execution_time.all_time.p95
(gauge)
Query execution time (millisecond) - all time - p95.
Shown as millisecond
presto.execution.execution_time.one_minute.avg
(gauge)
Query execution time (millisecond) - one minute average.
Shown as millisecond
presto.execution.execution_time.one_minute.max
(gauge)
Query execution time (millisecond) - one minute max.
Shown as millisecond
presto.execution.execution_time.one_minute.min
(gauge)
Query execution time (millisecond) - one minute min.
Shown as millisecond
presto.execution.execution_time.one_minute.p75
(gauge)
Query execution time (millisecond) - one minute p75.
Shown as millisecond
presto.execution.execution_time.one_minute.p95
(gauge)
Query execution time (millisecond) - one minute p95.
Shown as millisecond
presto.execution.executor.active_count
(gauge)
presto.execution.executor.completed_task_count
(gauge)

Shown as task
presto.execution.executor.core_pool_size
(gauge)
presto.execution.executor.task_count
(gauge)

Shown as task
presto.execution.executor.pool_size
(gauge)
presto.execution.executor.queued_task_count
(gauge)
presto.execution.executor.blocked_splits
(gauge)
Blocked splits count.
Shown as split
presto.execution.executor.running_splits
(gauge)
Running splits count.
Shown as split
presto.execution.executor.total_splits
(gauge)
Total splits count.
Shown as split
presto.execution.executor.waiting_splits
(gauge)
Waiting splits count.
Shown as split
presto.execution.executor.processor_executor.queued_task_count
(gauge)
Queued task count.
Shown as task
presto.execution.external_failures.one_minute.count
(gauge)
Failed queries (external) - one minute count.
Shown as query
presto.execution.external_failures.one_minute.rate
(gauge)
Failed queries (external) - one minute failures per second.
Shown as query
presto.execution.external_failures.total_count
(gauge)
Failed queries (external) - total count.
Shown as query
presto.execution.failed_queries.one_minute.count
(gauge)
Failed queries - one minute count.
Shown as query
presto.execution.failed_queries.one_minute.rate
(gauge)
Failed queries - one minute queries per second.
Shown as query
presto.execution.failed_queries.total_count
(gauge)
Failed queries - total count.
Shown as query
presto.execution.input_data_size.one_minute.count
(gauge)
Input data (bytes) - one minute count.
Shown as byte
presto.execution.input_data_size.one_minute.rate
(gauge)
Input data (bytes) - one minute bytes per second.
Shown as byte
presto.execution.input_data_size.total_count
(gauge)
Input data (bytes) - total count.
Shown as byte
presto.execution.input_positions.one_minute.count
(gauge)
Input positions (rows) - one minute count.
Shown as row
presto.execution.input_positions.one_minute.rate
(gauge)
Input positions (rows) - one minute rows per second.
Shown as row
presto.execution.input_positions.total_count
(gauge)
Input positions (rows) - total count.
Shown as row
presto.execution.internal_failures.one_minute.count
(gauge)
Failed queries (internal) - one minute count.
Shown as query
presto.execution.internal_failures.one_minute.rate
(gauge)
Failed queries (internal) - one minute queries per second.
Shown as query
presto.execution.internal_failures.total_count
(gauge)
Failed queries (internal) - total count.
Shown as query
presto.execution.insufficient_resources_failures.one_minute.count
(gauge)
Insufficient resources failures one minute count.
presto.execution.insufficient_resources_failures.one_minute.rate
(gauge)
Insufficient resources failures one minute failures per second.
presto.execution.insufficient_resources_failures.total_count
(gauge)
Insufficient resources failures total count.
presto.execution.management_executor.active_count
(gauge)
presto.execution.management_executor.completed_task_count
(gauge)

Shown as task
presto.execution.management_executor.queued_task_count
(gauge)

Shown as task
presto.execution.output_data_size.one_minute.count
(gauge)
Output data (bytes) - one minute count.
Shown as byte
presto.execution.output_data_size.one_minute.rate
(gauge)
Output data (bytes) - one minute bytes per second.
Shown as byte
presto.execution.output_data_size.total_count
(gauge)
Output data (bytes) - total count.
Shown as byte
presto.execution.output_positions.one_minute.count
(gauge)
Output positions (rows) - one minute count.
Shown as row
presto.execution.output_positions.one_minute.rate
(gauge)
Output positions (rows) - one minute rows per second.
Shown as row
presto.execution.output_positions.total_count
(gauge)
Output positions (rows) - total count.
Shown as row
presto.execution.running_queries
(gauge)
Active queries.
Shown as query
presto.execution.started_queries.one_minute.count
(gauge)
Queries started - one minute count.
Shown as query
presto.execution.started_queries.one_minute.rate
(gauge)
Queries started - one minute queries per second.
Shown as query
presto.execution.started_queries.total_count
(gauge)
Queries started - total count.
Shown as query
presto.execution.task_notification_executor.active_count
(gauge)
presto.execution.task_notification_executor.completed_task_count
(gauge)

Shown as task
presto.execution.task_notification_executor.pool_size
(gauge)
presto.execution.task_notification_executor.queued_task_count
(gauge)

Shown as task
presto.execution.user_error_failures.one_minute.count
(gauge)
Failed queries (user error) - one minute count.
Shown as query
presto.execution.user_error_failures.one_minute.rate
(gauge)
Failed queries (user error) - one minute queries per second.
Shown as query
presto.execution.user_error_failures.total_count
(gauge)
Failed queries (user error) - total count.
Shown as query
presto.execution.wall_input_bytes_rate.one_minute.avg
(gauge)
Input data rate (bytes) - one minute average.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.max
(gauge)
Input data rate (bytes) - one minute max.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.min
(gauge)
Input data rate (bytes) - one minute min.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.p75
(gauge)
Input data rate (bytes) - one minute p75.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.p95
(gauge)
Input data rate (bytes) - one minute p95.
Shown as byte
presto.failure_detector.active_count
(gauge)
Active node count.
Shown as node
presto.memory.assigned_queries
(gauge)
Memory (assigned queries).
Shown as byte
presto.memory.cluster_memory_bytes
(gauge)
Cluster memory (bytes).
Shown as byte
presto.memory.blocked_nodes
(gauge)
Memory (blocked nodes).
Shown as byte
presto.memory.free_bytes
(gauge)
Memory (free bytes).
Shown as byte
presto.memory.free_distributed_bytes
(gauge)
Memory (free distributed bytes).
Shown as byte
presto.memory.max_bytes
(gauge)
Memory (max bytes).
Shown as byte
presto.memory.nodes
(gauge)
Memory (nodes).
Shown as byte
presto.memory.reserved_bytes
(gauge)
Memory (reserved bytes).
Shown as byte
presto.memory.reserved_distributed_bytes
(gauge)
Memory (reserved distributed bytes).
Shown as byte
presto.memory.reserved_revocable_bytes
(gauge)
Memory (reserved revocable bytes).
Shown as byte
presto.memory.reserved_revocable_distributed_bytes
(gauge)
Memory (reserved revocable distributed bytes).
Shown as byte
presto.memory.total_distributed_bytes
(gauge)
Memory (total distributed bytes).
Shown as byte

Événements

Presto n’inclut aucun événement.

Checks de service

presto.can_connect :
Renvoie CRITICAL si l’Agent ne parvient pas à se connecter à l’instance Presto qu’il surveille et à y recueillir des métriques. Si ce n’est pas le cas, renvoie OK.

Dépannage

Besoin d’aide ? Contactez l’assistance Datadog.