Amazon ECS on EC2
Dash が新機能を発表!インシデントマネジメント、Continuous Profiler など多数の機能が追加されました! Dash イベントで発表された新機能!

Amazon ECS on EC2

Crawler Crawler
コンテナ化された Datadog Agent を ECS クラスターにデプロイする方法については、専用の Amazon ECS Agent ドキュメントを参照してください。

概要

Amazon ECS on EC2 は、EC2 インスタンスで実行される Docker コンテナ用の拡張性とパフォーマンスに優れたコンテナ管理サービスです。

Amazon ECS Datadog インテグレーションを利用し、CloudWatch から ECS メトリクスを自動的に収集します。ECS API に ECS イベント、タグ、およびコンテナインスタンス、タスク、サービスのステータスを照会することで、これらのメトリクスを拡張します。

セットアップ

インストール

Amazon Web Services インテグレーションをまだセットアップしていない場合は、最初にセットアップします。

メトリクスの収集

  1. AWS インテグレーションの ロールの委任設定 に関する手順に従います。

  2. AWS タイルで、IAM ロール名を入力し、Limit metric collectionの下にある ECS ボックスにチェックマークを付けます。

メトリクスの収集が有効な場合、ECS メトリクスの詳細を提供するダッシュボードをこのインテグレーションで利用できます。ダッシュボードの詳細をご覧ください。

収集データ

メトリクス

aws.ecs.cluster.cpureservation
(gauge)
The percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.cpureservation.maximum
(gauge)
The maximum percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.cpureservation.minimum
(gauge)
The minimum percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.cpuutilization
(gauge)
The percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.cluster.cpuutilization.maximum
(gauge)
The maximum percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.cluster.cpuutilization.minimum
(gauge)
The minimum percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.cluster.memory_reservation
(gauge)
The percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.memory_reservation.maximum
(gauge)
The maximum percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.memory_reservation.minimum
(gauge)
The minimum percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cluster.memory_utilization
(gauge)
The percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.cluster.memory_utilization.maximum
(gauge)
The maximum percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.cluster.memory_utilization.minimum
(gauge)
The minimum percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.cpureservation
(gauge)
The percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cpureservation.maximum
(gauge)
The maximum percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cpureservation.minimum
(gauge)
The minimum percentage of CPU units that are reserved by running tasks in the cluster.
Shown as percent
aws.ecs.cpuutilization
(gauge)
The percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.cpuutilization.maximum
(gauge)
The maximum percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.cpuutilization.minimum
(gauge)
The minimum percentage of CPU units that are used in the cluster or service.
Shown as percent
aws.ecs.memory_reservation
(gauge)
The percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.memory_reservation.maximum
(gauge)
The maximum percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.memory_reservation.minimum
(gauge)
The minimum percentage of memory that is reserved by running tasks in the cluster.
Shown as percent
aws.ecs.memory_utilization
(gauge)
The percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.memory_utilization.maximum
(gauge)
The maximum percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.memory_utilization.minimum
(gauge)
The minimum percentage of memory that is used in the cluster or service.
Shown as percent
aws.ecs.pending_tasks_count
(gauge)
The number of tasks on the container instance that are in the PENDING status.
Shown as task
aws.ecs.registered_cpu
(gauge)
The number of CPU units registered on the container instance
aws.ecs.registered_memory
(gauge)
The number of Memory units registered on the container instance
aws.ecs.remaining_cpu
(gauge)
The number of CPU units remaining on the container instance
aws.ecs.remaining_memory
(gauge)
The number of Memory units remaining on the container instance
aws.ecs.running_tasks_count
(gauge)
The number of tasks on the container instance that are in the RUNNING status.
Shown as task
aws.ecs.service.cpuutilization
(gauge)
Average percentage of CPU units that are used in the service.
Shown as percent
aws.ecs.service.cpuutilization.maximum
(gauge)
Maximum percentage of CPU units that are used in the service.
Shown as percent
aws.ecs.service.cpuutilization.minimum
(gauge)
Minimum percentage of CPU units that are used in the service.
Shown as percent
aws.ecs.service.desired
(gauge)
The number of tasks in the cluster that are in the desired state
aws.ecs.service.memory_utilization
(gauge)
Average percentage of memory that is used in the service.
Shown as percent
aws.ecs.service.memory_utilization.maximum
(gauge)
Maximum percentage of memory that is used in the service.
Shown as percent
aws.ecs.service.memory_utilization.minimum
(gauge)
Minimum percentage of memory that is used in the service.
Shown as percent
aws.ecs.service.pending
(gauge)
The number of tasks in the cluster that are in the pending state
Shown as task
aws.ecs.service.running
(gauge)
The number of tasks in the cluster that are in the running state
Shown as task
aws.ecs.services
(gauge)
The number of services running per cluster

AWS から取得される各メトリクスには、ホスト名やセキュリティ グループなど、AWS コンソールに表示されるのと同じタグが割り当てられます。

イベント

ノイズを減らすため、Amazon ECS インテグレーションは次の単語を含むイベントのみを収集するように自動的に設定されます。drainerrorfailinsufficient memorypendingrebootterminate。以下にイベントの例を示します。

ホワイトリストを削除し、Datadog Amazon ECS インテグレーションからすべてのイベントを取得できるようにするには、Datadog のサポートチームまでお問い合わせください。

サービスチェック

aws.ecs.agent_connected:
Agent が接続できない場合は CRITICAL、それ以外の場合は OK を返します。

トラブルシューティング

ご不明な点は、Datadog のサポートチームまでお問合せください。