- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
Supported OS
Dashboard preview (light)
Dashboard preview (dark)
This check monitors Lustre through the Datadog Agent.
Lustre is a distributed file system commonly used in high-performance computing (HPC) environments. This integration provides comprehensive monitoring of Lustre cluster performance, health, and operations across all node types: clients, metadata servers (MDS), and object storage servers (OSS).
The Datadog Agent can collect many metrics from Lustre clusters, including:
Follow the instructions below to install and configure this check for an Agent running on a host.
The Lustre check is included in the Datadog Agent package. No additional installation is needed on your server.
To configure the Agent check:
Edit the lustre.d/conf.yaml
file, in the conf.d/
folder at the root of your Agent’s configuration directory to start collecting your Lustre performance data. See the sample lustre.d/conf.yaml for all available configuration options.
Add the dd-agent
user to the sudoers file to allow it to run Lustre commands without a password. Edit the sudoers file with visudo
and add:
dd-agent ALL=(ALL) NOPASSWD: /path/to/lctl, /path/to/lnetctl, /path/to/lfs
Note: The Datadog Agent must have sufficient privileges to execute Lustre commands (lctl
, lnetctl
, lfs
). This typically requires running the Agent as root or with appropriate sudo permissions.
On client nodes, the Lustre integration can collect changelog events as structured logs. These logs contain:
operation_type
: The type of filesystem operationtimestamp
: When the operation occurredflags
: Operation flagsmessage
: Detailed operation informationImportant: Changelog users must be registered for changelogs to be collected. Use the lctl changelog_register
command to register changelog users. Refer to the Lustre manual.
To collect Lustre changelogs:
datadog.yaml
file: logs_enabled: true
lustre.d/conf.yaml
file. For example: logs:
- type: integration
source: lustre
service: lustre
lustre.d/conf.yaml
file. enable_changelogs: true
Run the Agent’s status subcommand and look for lustre
under the Checks section.
To uninstall this integration from your Agent, run the following command:
datadog-agent integration remove datadog-lustre
Alternatively, to disable the integration, rename the lustre.d/conf.yaml
file to lustre.d/conf.yaml.example
.
Need help? Contact Datadog Support.
The Lustre integration requires elevated privileges to run Lustre commands. Ensure the Datadog Agent is running with appropriate permissions:
# Check if the Agent user can run Lustre commands
sudo -u dd-agent lctl dl
sudo -u dd-agent sudo lnetctl net show
If the integration cannot automatically detect the node type, specify it explicitly in the configuration:
instances:
- node_type: client # or 'mds' or 'oss'
If expected metrics are not appearing:
For changelog collection on client nodes, ensure changelog users are registered:
# Register a changelog user
lctl changelog_register
# List registered changelog users
lctl changelog_users <filesystem>
Dashboard preview (light)
Dashboard preview (dark)