Monitoring Oracle Cloud Infrastructure (OCI) GPU instances is essential for ensuring optimal performance and reliability of your high-performance computing workloads. This integration provides a comprehensive set of GPU metrics through the gpu_infrastructure_health namespace, enabling you to track various aspects of GPU health and utilization.
This integration lets you monitor and alert on the health, capacity, throughput, status, and performance of your GPU Instances.
Activity level from GPU. Expressed as a percentage of total time. For instance pools, the value is averaged across all instances in the pool. Shown as percent