MetricsHub Enterprise

Supported OS HP UX Linux Solaris Windows

marketplace
Integration version1.0.0

Overview

MetricsHub® is a universal infrastructure monitoring agent designed to give Datadog deep visibility across your entire IT stack.

It captures and forwards comprehensive telemetry—from server hardware health and database performance to network devices, storage systems, energy usage, and sustainability KPIs. Its rich telemetry enables unified observability in Datadog, helping teams optimize operations, ensure uptime, and meet sustainability goals.

Key features

  • Remote Monitoring: MetricsHub supports the monitoring of thousands of systems remotely through protocols such as REST APIs, SNMP, WBEM, WMI, SSH, IPMI, and more.
  • 250+ Connectors: Ready-to-use connectors for monitoring a wide variety of platforms. MetricsHub agent is truly vendor-neutral, providing consistent coverage for all manufacturers (for example, Cisco, Dell EMC, Huawei, HP, IBM, Lenovo, Pure, and more).
  • OpenTelemetry Integration: MetricsHub pushes collected metrics as OpenTelemetry metrics, following its standards, protocol and semantic conventions, for easy integration with various observability platforms.
  • Sustainability Metrics: Track and report on energy usage and carbon footprint to optimize infrastructure efficiency.

Supported platforms

TypePlatforms
ServerFujitsu PRIMERGY, Cisco UCS, Dell PowerEdge, Hitachi Compute, HPE BladeSystem, HPE ProLiant, HP-UX, HPE Alletra 9000, HPE Integrity, HPE SuperDome, HPE Synergy, Huawei E Series, Huawei FusionCube, Huawei FusionServer, Huawei KunLun, Huawei TaiShan, Huawei X Series, Huawei xFusion, IBM POWER, IBM BladeCenter, IBM DataPower, IBM xSeries, Intel, IPMI, Lenovo, Lenovo ThinkSystem, Sun/Oracle Servers, NEC Express 5800, Nvidia, Oracle Networking, Quanta, RedFish, SuperMicro
SystemLinux, Microsoft Windows, Proxmox, VMware
Storage systemsDell PowerProtect Data Domain, DataDirect Networks, Dell Compellent, Dell EMC PowerMax, Dell PowerStore, Dell PowerVault, Dell EqualLogic, Dell XtremIO, Dot Hill, Dell EMC Celerra, Dell EMC VNX, Dell Isilon, Dell EMC CLARiiON, Dell EMC Unity, Dell EMC Symmetrix, Dell EMC VMAX, Dell EMC VNXe, Dell EMC VPLEX, Fujitsu Eternus, Hitachi VSP, Hitachi AMS, Hitachi HUS, Hitachi HNAS, HPE MSA, HPE Qumulo, HPE EVA, HPE 3PAR, HPE Primera, Huawei OceanStor, IBM DS3000-5000, IBM DS8000, IBM FlashSystem, IBM Storwize, IBM Tape Library, IBM XiV, MacroSan, McData, NetApp, NetApp E-Series, NetApp SolidFire, HPE Nimble, Oracle ZFS, Pure Storage, Quantum, SMI-S, StorageTek Tape Library, Synology
NetworkHPE Aruba, Cisco Networking, Dell PowerSwitch, Arista, Extreme Networks, F5, HPE Procurve, Juniper, Oracle Networking, any MIB-2 compliant device
DatabasesMicrosoft SQL Server, MySQL, PostgreSQL, Oracle Database
OtherAPC, Citrix NetScaler, Eaton, Raritan

MetricsHub connectors directory

Dashboards

This MetricsHub integration comes with pre-configured dashboards that display system performance metrics, hardware health, as well as sustainability KPIs:

  • MetricsHub Hardware Main: Overview of all monitored systems, focusing on key hardware and sustainability metrics.
  • MetricsHub Hardware Site: Metrics specific to a particular site (a data center or a server room) and its monitored hosts.
  • MetricsHub Hardware Host: Metrics associated with the hardware components of a physical host.
  • MetricsHub System Performance: System performance metrics for Linux and Windows systems, including CPU, memory, filesystems, disks, and network.

Data Collected

Metrics

metricshub.incident.count
(gauge)
Count of opened incidents
metricshub.agent.info
(gauge)
MetricsHub agent information
metricshub.connector.status
(gauge)
Connector status
metricshub.host.configured
(gauge)
Indicates whether the monitoring of the host is configured
metricshub.host.up
(gauge)
Indicates whether the host is present and responding
metricshub.host.response_time
(gauge)
Response time of the host
Shown as second
metricshub.hw.status
(gauge)
Status of the monitored device
metricshub.hw.energy
(count)
Energy usage of the monitored device (in joules)
metricshub.hw.power
(gauge)
Power consumption of the monitored device
Shown as watt
metricshub.hw.errors
(count)
Number of detected errors
Shown as error
metricshub.hw.host.ambient_temperature
(gauge)
Host ambient temperature
Shown as degree celsius
metricshub.hw.host.energy
(count)
Counter of the energy consumed by this host in Joules
metricshub.hw.host.heating_margin
(gauge)
Number of degrees before reaching the closest warning threshold
Shown as degree celsius
metricshub.hw.host.power
(gauge)
Amount of watts consumed by all the discovered components on the monitored host
Shown as watt
metricshub.hw.battery.charge
(gauge)
Battery charge ratio
Shown as percent
metricshub.hw.battery.time_left
(gauge)
Battery time left
Shown as second
metricshub.hw.blade.power_state
(gauge)
Indicates whether the blade is currently on or off
metricshub.hw.cpu.speed
(gauge)
CPU current speed
Shown as hertz
metricshub.hw.cpu.speed.limit
(gauge)
CPU maximum speed
Shown as hertz
metricshub.hw.cpu_core.speed
(gauge)
CPU core current speed
Shown as hertz
metricshub.hw.cpu_core.utilization
(gauge)
CPU core used time ratio
Shown as percent
metricshub.hw.enclosure.energy
(count)
Enclosure total consumed energy in Joules
metricshub.hw.enclosure.power
(gauge)
Enclosure power consumption
Shown as watt
metricshub.hw.fan.speed_ratio
(gauge)
Fan speed ratio
Shown as percent
metricshub.hw.fan.speed_ratio.limit
(gauge)
Alert threshold for hw.fan.speed_ratio
metricshub.hw.fan.speed
(gauge)
Fan speed
metricshub.hw.fan.speed.limit
(gauge)
Alert threshold for hw.fan.speed
metricshub.hw.gpu.io
(count)
GPU received or transmitted bytes
Shown as byte
metricshub.hw.gpu.memory.limit
(gauge)
GPU memory size
Shown as byte
metricshub.hw.gpu.memory.utilization
(gauge)
GPU memory utilization
Shown as percent
metricshub.hw.gpu.utilization
(gauge)
GPU utilization
Shown as percent
metricshub.hw.gpu.power
(gauge)
GPU power consumption
Shown as watt
metricshub.hw.logical_disk.limit
(gauge)
Logical disk size
Shown as byte
metricshub.hw.logical_disk.usage
(gauge)
Logical disk space usage
Shown as byte
metricshub.hw.logical_disk.utilization
(gauge)
Logical disk space utilization
Shown as percent
metricshub.hw.lun.paths
(gauge)
Number of distinct paths to the remote volume
metricshub.hw.memory.limit
(gauge)
Memory module size
Shown as byte
metricshub.hw.network.bandwidth.utilization
(gauge)
Ratio of the available bandwidth utilization
Shown as percent
metricshub.hw.network.bandwidth.limit
(gauge)
Speed that the network adapter and its remote counterpart currently use to communicate with each other
Shown as byte
metricshub.hw.network.up
(gauge)
Indicates whether the network card is plugged-in to the network or not
metricshub.hw.network.io
(count)
Network card received or transmitted bytes
Shown as byte
metricshub.hw.network.packets
(count)
Network card received or transmitted packets
Shown as packet
metricshub.hw.other_device.uses
(count)
Number of times this device has been used
Shown as operation
metricshub.hw.other_device.value
(gauge)
Currently reported value of the device
metricshub.hw.physical_disk.endurance_utilization
(gauge)
Endurance utilization of the SSD
Shown as percent
metricshub.hw.physical_disk.size
(gauge)
Physical disk size
Shown as byte
metricshub.hw.power_supply.limit
(gauge)
Maximum power output
Shown as watt
metricshub.hw.power_supply.utilization
(gauge)
Ratio of the power supply’s capacity currently in use
Shown as percent
metricshub.hw.robotics.moves
(count)
Number of moves operations that happened during the last collect interval
Shown as operation
metricshub.hw.site.pue
(gauge)
Power Usage Effectiveness (PUE) of the site
metricshub.hw.site.carbon_intensity
(gauge)
Carbon intensity of the electricity provider in grams of CO2 per kWh
metricshub.hw.site.electricity_price
(gauge)
Price of the electricity in dollars per kWh
Shown as dollar
metricshub.hw.tape_drive.operations
(count)
Number of operations performed by the tape drive
Shown as operation
metricshub.hw.temperature
(gauge)
Current temperature reading
Shown as degree celsius
metricshub.hw.temperature.limit
(gauge)
Alert threshold for hw.temperature
Shown as degree celsius
metricshub.hw.vm.power_ratio
(gauge)
Fraction of power of the physical host to run this VM
Shown as watt
metricshub.hw.vm.power_state
(gauge)
Virtual machine current power state (on / off / standby)
metricshub.hw.voltage
(gauge)
Voltage output
Shown as volt
metricshub.hw.voltage.limit
(gauge)
Alert threshold for hw.voltage
Shown as volt
metricshub.system.cpu.time
(count)
Total CPU seconds broken down by states
Shown as second
metricshub.system.cpu.utilization
(gauge)
CPU time utilization ratio by CPU state during the last collect
metricshub.system.disk.io
(count)
Disk activity in total bytes
Shown as byte
metricshub.system.disk.io_time
(count)
Total time disk spent activated
Shown as second
metricshub.system.disk.operation_time
(count)
Sum of the time each disk operation took to complete
Shown as second
metricshub.system.disk.operations
(count)
Number of disk operations
Shown as operation
metricshub.system.filesystem.usage
(gauge)
Reports a filesystem’s space usage across different states
Shown as byte
metricshub.system.filesystem.utilization
(gauge)
The fraction of the filesystem that is used. It represents how much of the total filesystem capacity has been consumed
Shown as percent
metricshub.system.memory.limit
(gauge)
Total memory available in the system
Shown as byte
metricshub.system.memory.usage
(gauge)
Reports memory in use by state
Shown as byte
metricshub.system.memory.utilization
(gauge)
Memory utilization ratio per memory state
Shown as percent
metricshub.system.network.dropped
(count)
Count of packets that are dropped or discarded even though there was no errors
Shown as packet
metricshub.system.network.errors
(count)
Count of detected network errors
Shown as error
metricshub.system.network.io
(count)
Total number of bytes transmitted or received by the network interface
Shown as byte
metricshub.system.network.packets
(count)
Total number of packets transmitted or received by the network interface
Shown as packet
metricshub.system.network.bandwidth.limit
(gauge)
Link speed of the network interface
Shown as byte
metricshub.system.paging.faults
(count)
Total number of page faults that have occurred in the system
Shown as fault
metricshub.system.paging.operations
(count)
Total number of paging operations performed by the system
Shown as operation
metricshub.system.paging.usage
(count)
Linux swap or windows pagefile usage
Shown as byte
metricshub.system.paging.utilization
(gauge)
The fraction of the paging (swap) space that is currently used
Shown as percent
metricshub.system.uptime
(gauge)
The time the system has been running
Shown as second

Support

A subscription to MetricsHub through the Datadog Marketplace grants access to all services provided by the MetricsHub Support Desk:

Further Reading

For more information, visit the MetricsHub website.


This application is made available through the Marketplace and is supported by a Datadog Technology Partner. Click Here to purchase this application.