Presto
Dash が新機能を発表!インシデントマネジメント、Continuous Profiler など多数の機能が追加されました! Dash イベントで発表された新機能!

Presto

Agent Check Agentチェック

Supported OS: Linux Mac OS Windows

概要

このチェックは、次のような Presto メトリクスを収集します。

  • 全体的なアクティビティメトリクス: 完了/失敗したクエリ、データ入力/出力サイズ、実行時間。
  • パフォーマンスメトリクス: クラスターメモリ、入力 CPU 時間、実行 CPU 時間。

セットアップ

ホストで実行されている Agent 用にこのチェックをインストールおよび構成する場合は、以下の手順に従ってください。コンテナ環境の場合は、オートディスカバリーのインテグレーションテンプレートのガイドを参照してこの手順を行ってください。

インストール

Presto チェックは Datadog Agent パッケージに含まれています。 サーバーに追加でインストールする必要はありません。使用状況メトリクスとパフォーマンスメトリクスを収集するコーディネーターノードおよびワーカーノードごとに Agent をインストールします。

コンフィギュレーション

  1. Presto のパフォーマンスデータを収集するには、Agent のコンフィギュレーションディレクトリのルートにある conf.d/ フォルダーの presto.d/conf.yaml ファイルを編集します。使用可能なすべてのコンフィギュレーションオプションについては、サンプル presto.d/conf.yaml を参照してください。

このチェックでは、インスタンスあたりのメトリクス数が 350 に制限されています。返されたメトリクスの数は、情報ページに表示されます。以下で説明する構成を編集することで、関心があるメトリクスを指定できます。収集するメトリクスをカスタマイズする方法については、JMX チェックのドキュメントで詳細な手順を参照してください。制限以上のメトリクスを監視する必要がある場合は、Datadog のサポートチームまでお問い合わせください。

  1. Agent を再起動します

メトリクスの収集

presto.d/conf.yaml ファイルのデフォルトコンフィギュレーションを使用して、Presto メトリクスの収集を有効にします。使用可能なすべてのコンフィギュレーションオプションの詳細については、サンプル presto.d/conf.yaml を参照してください。

ログの収集

Agent バージョン 6.0 以降で利用可能

  1. Datadog Agent で、ログの収集はデフォルトで無効になっています。以下のように、datadog.yaml ファイルでこれを有効にします。

    logs_enabled: true
  2. Presto のログの収集を開始するには、次のコンフィギュレーションブロックを presto.d/conf.yaml ファイルに追加します。

    logs:
     - type: file
       path: /var/log/presto/*.log
       source: presto
       service: "<SERVICE_NAME>"

    path パラメーターと service パラメーターの値を変更し、環境に合わせて構成してください。使用可能なすべての構成オプションの詳細については、サンプル presto.d/conf.yaml を参照してください。

  3. Agent を再起動します

検証

Agent の status サブコマンドを実行し、Checks セクションで presto を探します。

収集データ

メトリクス

presto.execution.abandoned_queries.one_minute.count
(gauge)
Abandoned queries - one minute count.
Shown as query
presto.execution.abandoned_queries.one_minute.rate
(gauge)
Abandoned queries - one minute rate.
Shown as query
presto.execution.abandoned_queries.total_count
(gauge)
Abandoned queries - total count.
Shown as query
presto.execution.canceled_queries.one_minute.count
(gauge)
Canceled queries - one minute count.
Shown as query
presto.execution.canceled_queries.one_minute.rate
(gauge)
Canceled queries - one minute queries per second.
Shown as query
presto.execution.canceled_queries.total_count
(gauge)
Canceled queries - total count.
Shown as query
presto.execution.completed_queries.one_minute.count
(gauge)
Completed queries - one minute count.
Shown as query
presto.execution.completed_queries.one_minute.rate
(gauge)
Completed queries - one minute queries per second.
Shown as query
presto.execution.completed_queries.total_count
(gauge)
Completed queries - total count.
Shown as query
presto.execution.consumed_cpu_time_secs.one_minute.count
(gauge)
CPU (processing) time consumed - one minute count (seconds).
Shown as second
presto.execution.consumed_cpu_time_secs.one_minute.rate
(gauge)
CPU (processing) time consumed - one minute rate.
Shown as second
presto.execution.consumed_cpu_time_secs.total_count
(gauge)
CPU (processing) time consumed - total count (seconds).
Shown as second
presto.execution.cpu_input_byte_rate.all_time.avg
(gauge)
Distribution of query input data rates (cpu) - all time average bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.all_time.p75
(gauge)
Distribution of query input data rates (cpu) - all time bytes per second - p75.
Shown as byte
presto.execution.cpu_input_byte_rate.all_time.p95
(gauge)
Distribution of query input data rates (cpu) - all time bytes per second - p95.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.avg
(gauge)
Distribution of query input data rates (cpu) - one minute average bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.count
(gauge)
Distribution of query input data rates (cpu) - one minute count.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.max
(gauge)
Distribution of query input data rates (cpu) - one minute max bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.min
(gauge)
Distribution of query input data rates (cpu) - one minute min bytes per second.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.p75
(gauge)
Distribution of query input data rates (cpu) - one minute bytes per second - p75.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.p95
(gauge)
Distribution of query input data rates (cpu) - one minute bytes per second - p95.
Shown as byte
presto.execution.cpu_input_byte_rate.one_minute.total
(gauge)
Distribution of query input data rates (cpu) - one minute total bytes per second.
Shown as byte
presto.execution.execution_time.all_time.avg
(gauge)
Query execution time (millisecond) - all time average.
Shown as millisecond
presto.execution.execution_time.all_time.count
(gauge)
Query execution time (millisecond) - all time count.
Shown as millisecond
presto.execution.execution_time.all_time.max
(gauge)
Query execution time (millisecond) - all time max.
Shown as millisecond
presto.execution.execution_time.all_time.min
(gauge)
Query execution time (millisecond) - all time min.
Shown as millisecond
presto.execution.execution_time.all_time.p75
(gauge)
Query execution time (millisecond) - all time - p75.
Shown as millisecond
presto.execution.execution_time.all_time.p95
(gauge)
Query execution time (millisecond) - all time - p95.
Shown as millisecond
presto.execution.execution_time.one_minute.avg
(gauge)
Query execution time (millisecond) - one minute average.
Shown as millisecond
presto.execution.execution_time.one_minute.max
(gauge)
Query execution time (millisecond) - one minute max.
Shown as millisecond
presto.execution.execution_time.one_minute.min
(gauge)
Query execution time (millisecond) - one minute min.
Shown as millisecond
presto.execution.execution_time.one_minute.p75
(gauge)
Query execution time (millisecond) - one minute p75.
Shown as millisecond
presto.execution.execution_time.one_minute.p95
(gauge)
Query execution time (millisecond) - one minute p95.
Shown as millisecond
presto.execution.executor.active_count
(gauge)
presto.execution.executor.completed_task_count
(gauge)

Shown as task
presto.execution.executor.core_pool_size
(gauge)
presto.execution.executor.task_count
(gauge)

Shown as task
presto.execution.executor.pool_size
(gauge)
presto.execution.executor.queued_task_count
(gauge)
presto.execution.executor.blocked_splits
(gauge)
Blocked splits count.
Shown as split
presto.execution.executor.running_splits
(gauge)
Running splits count.
Shown as split
presto.execution.executor.total_splits
(gauge)
Total splits count.
Shown as split
presto.execution.executor.waiting_splits
(gauge)
Waiting splits count.
Shown as split
presto.execution.executor.processor_executor.queued_task_count
(gauge)
Queued task count.
Shown as task
presto.execution.external_failures.one_minute.count
(gauge)
Failed queries (external) - one minute count.
Shown as query
presto.execution.external_failures.one_minute.rate
(gauge)
Failed queries (external) - one minute failures per second.
Shown as query
presto.execution.external_failures.total_count
(gauge)
Failed queries (external) - total count.
Shown as query
presto.execution.failed_queries.one_minute.count
(gauge)
Failed queries - one minute count.
Shown as query
presto.execution.failed_queries.one_minute.rate
(gauge)
Failed queries - one minute queries per second.
Shown as query
presto.execution.failed_queries.total_count
(gauge)
Failed queries - total count.
Shown as query
presto.execution.input_data_size.one_minute.count
(gauge)
Input data (bytes) - one minute count.
Shown as byte
presto.execution.input_data_size.one_minute.rate
(gauge)
Input data (bytes) - one minute bytes per second.
Shown as byte
presto.execution.input_data_size.total_count
(gauge)
Input data (bytes) - total count.
Shown as byte
presto.execution.input_positions.one_minute.count
(gauge)
Input positions (rows) - one minute count.
Shown as row
presto.execution.input_positions.one_minute.rate
(gauge)
Input positions (rows) - one minute rows per second.
Shown as row
presto.execution.input_positions.total_count
(gauge)
Input positions (rows) - total count.
Shown as row
presto.execution.internal_failures.one_minute.count
(gauge)
Failed queries (internal) - one minute count.
Shown as query
presto.execution.internal_failures.one_minute.rate
(gauge)
Failed queries (internal) - one minute queries per second.
Shown as query
presto.execution.internal_failures.total_count
(gauge)
Failed queries (internal) - total count.
Shown as query
presto.execution.insufficient_resources_failures.one_minute.count
(gauge)
Insufficient resources failures one minute count.
presto.execution.insufficient_resources_failures.one_minute.rate
(gauge)
Insufficient resources failures one minute failures per second.
presto.execution.insufficient_resources_failures.total_count
(gauge)
Insufficient resources failures total count.
presto.execution.management_executor.active_count
(gauge)
presto.execution.management_executor.completed_task_count
(gauge)

Shown as task
presto.execution.management_executor.queued_task_count
(gauge)

Shown as task
presto.execution.output_data_size.one_minute.count
(gauge)
Output data (bytes) - one minute count.
Shown as byte
presto.execution.output_data_size.one_minute.rate
(gauge)
Output data (bytes) - one minute bytes per second.
Shown as byte
presto.execution.output_data_size.total_count
(gauge)
Output data (bytes) - total count.
Shown as byte
presto.execution.output_positions.one_minute.count
(gauge)
Output positions (rows) - one minute count.
Shown as row
presto.execution.output_positions.one_minute.rate
(gauge)
Output positions (rows) - one minute rows per second.
Shown as row
presto.execution.output_positions.total_count
(gauge)
Output positions (rows) - total count.
Shown as row
presto.execution.running_queries
(gauge)
Active queries.
Shown as query
presto.execution.started_queries.one_minute.count
(gauge)
Queries started - one minute count.
Shown as query
presto.execution.started_queries.one_minute.rate
(gauge)
Queries started - one minute queries per second.
Shown as query
presto.execution.started_queries.total_count
(gauge)
Queries started - total count.
Shown as query
presto.execution.task_notification_executor.active_count
(gauge)
presto.execution.task_notification_executor.completed_task_count
(gauge)

Shown as task
presto.execution.task_notification_executor.pool_size
(gauge)
presto.execution.task_notification_executor.queued_task_count
(gauge)

Shown as task
presto.execution.user_error_failures.one_minute.count
(gauge)
Failed queries (user error) - one minute count.
Shown as query
presto.execution.user_error_failures.one_minute.rate
(gauge)
Failed queries (user error) - one minute queries per second.
Shown as query
presto.execution.user_error_failures.total_count
(gauge)
Failed queries (user error) - total count.
Shown as query
presto.execution.wall_input_bytes_rate.one_minute.avg
(gauge)
Input data rate (bytes) - one minute average.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.max
(gauge)
Input data rate (bytes) - one minute max.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.min
(gauge)
Input data rate (bytes) - one minute min.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.p75
(gauge)
Input data rate (bytes) - one minute p75.
Shown as byte
presto.execution.wall_input_bytes_rate.one_minute.p95
(gauge)
Input data rate (bytes) - one minute p95.
Shown as byte
presto.failure_detector.active_count
(gauge)
Active node count.
Shown as node
presto.memory.assigned_queries
(gauge)
Memory (assigned queries).
Shown as byte
presto.memory.cluster_memory_bytes
(gauge)
Cluster memory (bytes).
Shown as byte
presto.memory.blocked_nodes
(gauge)
Memory (blocked nodes).
Shown as byte
presto.memory.free_bytes
(gauge)
Memory (free bytes).
Shown as byte
presto.memory.free_distributed_bytes
(gauge)
Memory (free distributed bytes).
Shown as byte
presto.memory.max_bytes
(gauge)
Memory (max bytes).
Shown as byte
presto.memory.nodes
(gauge)
Memory (nodes).
Shown as byte
presto.memory.reserved_bytes
(gauge)
Memory (reserved bytes).
Shown as byte
presto.memory.reserved_distributed_bytes
(gauge)
Memory (reserved distributed bytes).
Shown as byte
presto.memory.reserved_revocable_bytes
(gauge)
Memory (reserved revocable bytes).
Shown as byte
presto.memory.reserved_revocable_distributed_bytes
(gauge)
Memory (reserved revocable distributed bytes).
Shown as byte
presto.memory.total_distributed_bytes
(gauge)
Memory (total distributed bytes).
Shown as byte

イベント

Presto には、イベントは含まれません。

サービスのチェック

presto.can_connect:
Agent が監視対象の Presto インスタンスに接続できず、メトリクスを収集できない場合は、CRITICAL を返します。それ以外の場合は、OK を返します。

トラブルシューティング

ご不明な点は、Datadog のサポートチームまでお問合せください。