ibm_spectrum_lsf.can_connect (gauge) | Whether or not the integration can run LSF commands [Always reported] |
ibm_spectrum_lsf.cluster.hosts (gauge) | The number of hosts in the cluster. [Reported by lsclusters] |
ibm_spectrum_lsf.cluster.servers (gauge) | The number of servers in the cluster. [Reported by lsclusters] |
ibm_spectrum_lsf.cluster.status (gauge) | The status of the cluster. [Reported by lsclusters] |
ibm_spectrum_lsf.gpu.ecc (gauge) | Number of ECC errors. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.error (gauge) | Whether or not the GPU is in an error state. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.mem.total (gauge) | The total memory available on the GPU. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.mem.used (gauge) | The total memory used on the GPU. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.mem.utilization (gauge) | The percentage of the GPU’s memory currently in use. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.mode (gauge) | The GPU’s compute mode, 0 is default. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.power (gauge) | Current power draw of the GPU in watts. [Reported by lsload_gpu] Shown as watt |
ibm_spectrum_lsf.gpu.pstate (gauge) | Current performance state of the GPU. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.status (gauge) | Whether or not the GPU is OK. [Reported by lsload_gpu] |
ibm_spectrum_lsf.gpu.temperature (gauge) | The current temperature of the GPU. [Reported by lsload_gpu] Shown as degree celsius |
ibm_spectrum_lsf.gpu.utilization (gauge) | The current GPU utilization. [Reported by lsload_gpu] |
ibm_spectrum_lsf.host.cpu_factor (gauge) | The relative CPU performance factor. [Reported by lshosts] |
ibm_spectrum_lsf.host.is_server (gauge) | Indicates whether the host is a server or client host. [Reported by lshosts] |
ibm_spectrum_lsf.host.max_mem (gauge) | The maximum amount of physical memory available for user processes. [Reported by lshosts] |
ibm_spectrum_lsf.host.max_swap (gauge) | The total available swap space. [Reported by lshosts] |
ibm_spectrum_lsf.host.max_temp (gauge) | The maximum /tmp space in MB configured on a host. [Reported by lshosts] Shown as megabyte |
ibm_spectrum_lsf.host.num_cores (gauge) | The number of cores per processor that is configured on a host. [Reported by lshosts] Shown as core |
ibm_spectrum_lsf.host.num_cpus (gauge) | The number of processors on this host. [Reported by lshosts] |
ibm_spectrum_lsf.host.num_procs (gauge) | The number of physical processors per CPU configured on a host. [Reported by lshosts] |
ibm_spectrum_lsf.host.num_threads (gauge) | The number of threads per core that is configured on a host. [Reported by lshosts] Shown as thread |
ibm_spectrum_lsf.job.completed.details.avg_memory (gauge) | The average memory used by the completed job. [Reported by bhist_details] Shown as megabyte |
ibm_spectrum_lsf.job.completed.details.cpu_average_efficiency (gauge) | The CPU average efficiency percentage of the completed job. [Reported by bhist_details] Shown as percent |
ibm_spectrum_lsf.job.completed.details.cpu_peak (gauge) | The CPU peak value for the completed job. [Reported by bhist_details] |
ibm_spectrum_lsf.job.completed.details.cpu_peak_duration (gauge) | The duration of CPU peak usage for the completed job. [Reported by bhist_details] Shown as second |
ibm_spectrum_lsf.job.completed.details.cpu_peak_efficiency (gauge) | The CPU peak efficiency percentage of the completed job. [Reported by bhist_details] Shown as percent |
ibm_spectrum_lsf.job.completed.details.cpu_time (gauge) | The total CPU time consumed by the completed job. [Reported by bhist_details] Shown as second |
ibm_spectrum_lsf.job.completed.details.exit_code (gauge) | The exit code returned by the completed job. [Reported by bhist_details] |
ibm_spectrum_lsf.job.completed.details.max_memory (gauge) | The maximum memory used by the completed job. [Reported by bhist_details] Shown as megabyte |
ibm_spectrum_lsf.job.completed.details.mem_efficiency (gauge) | The memory efficiency percentage of the completed job. [Reported by bhist_details] Shown as percent |
ibm_spectrum_lsf.job.completed.details.status (gauge) | The status of the completed job (1). Tagged with status:success or status:failure. [Reported by bhist_details] |
ibm_spectrum_lsf.job.completed.details.success (gauge) | Indicates whether the job completed successfully (1) or failed (0). [Reported by bhist_details] |
ibm_spectrum_lsf.job.completed.pending (gauge) | The total amount of time spent by the job in the pending state. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.pending_user_suspended (gauge) | The total amount of time spent by the job in the user suspended state. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.running (gauge) | The total run time of the job. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.system_suspended (gauge) | The total amount of time the job was in the system suspended state. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.total (gauge) | The total amount of time spent by the job from submission to completion. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.unknown (gauge) | The total amount of time spent by the job in an unknown state. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.completed.user_suspended (gauge) | The total amount of time spent by the job in the user suspended state. [Reported by bhist] Shown as second |
ibm_spectrum_lsf.job.cpu_used (gauge) | The CPU used by the job. [Reported by bjobs] |
ibm_spectrum_lsf.job.idle_factor (gauge) | Job idle information (CPU time/runtime) if JOB_IDLE is configured in the queue, and the job has triggered an idle exception. [Reported by bjobs] |
ibm_spectrum_lsf.job.mem (gauge) | Total resident memory usage of all processes in a job. [Reported by bjobs] |
ibm_spectrum_lsf.job.percent_complete (gauge) | The estimated completion percentage of the job. [Reported by bjobs] |
ibm_spectrum_lsf.job.run_time (gauge) | Estimated run time for the job. [Reported by bjobs] Shown as second |
ibm_spectrum_lsf.job.swap (gauge) | Total virtual memory and swap usage of all processes in a job. [Reported by bjobs] |
ibm_spectrum_lsf.job.time_left (gauge) | The estimated run time that the job has remaining. [Reported by bjobs] Shown as second |
ibm_spectrum_lsf.load.cpu.run_queue_length.15m (gauge) | The 15 minute exponentially averaged CPU run queue length. [Reported by lsload] |
ibm_spectrum_lsf.load.cpu.run_queue_length.15s (gauge) | The 15 second exponentially averaged CPU run queue length. [Reported by lsload] |
ibm_spectrum_lsf.load.cpu.run_queue_length.1m (gauge) | The 1 minute exponentially averaged CPU run queue length. [Reported by lsload] |
ibm_spectrum_lsf.load.cpu.utilization (gauge) | The CPU utilization exponentially averaged over the last minute, 0 - 1. [Reported by lsload] |
ibm_spectrum_lsf.load.disk.io (gauge) | the disk I/O rate exponentially averaged over the last minute, in KB per second. [Reported by lsload] Shown as kilobyte |
ibm_spectrum_lsf.load.idle_time (gauge) | On UNIX, the idle time of the host (keyboard is not touched on all logged in sessions), in minutes. On Windows, the it index is based on the time that a screen saver is active on a particular host. [Reported by lsload] Shown as minute |
ibm_spectrum_lsf.load.login_users (gauge) | The number of current login users. [Reported by lsload] |
ibm_spectrum_lsf.load.mem.available_ram (gauge) | The amount of available RAM. [Reported by lsload] Shown as megabyte |
ibm_spectrum_lsf.load.mem.available_swap (gauge) | The amount of available swap space. [Reported by lsload] Shown as megabyte |
ibm_spectrum_lsf.load.mem.free (gauge) | The amount of free space in /tmp, in MB. [Reported by lsload] Shown as megabyte |
ibm_spectrum_lsf.load.mem.paging_rate (gauge) | The memory paging rate exponentially averaged over the last minute, in pages per second. [Reported by lsload] Shown as page |
ibm_spectrum_lsf.load.status (gauge) | Status of the host. [Reported by lsload] |
ibm_spectrum_lsf.perfmon.host.queries.avg (gauge) | The average number of host information queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.host.queries.current (gauge) | The current number of host information queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.host.queries.max (gauge) | The max number of host information queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.host.queries.min (gauge) | The min number of host information queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.host.queries.total (gauge) | The total number of host information queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.accepted_remote.avg (gauge) | The average number of jobs accepted from remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.accepted_remote.current (gauge) | The current number of jobs accepted from remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.accepted_remote.max (gauge) | The max number of jobs accepted from remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.accepted_remote.min (gauge) | The min number of jobs accepted from remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.accepted_remote.total (gauge) | The total number of jobs accepted from remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.buckets.avg (gauge) | The average number of scheduler buckets in which jobs are put based on resource requirements and different scheduling policies. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.buckets.current (gauge) | The current number of scheduler buckets in which jobs are put based on resource requirements and different scheduling policies. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.buckets.max (gauge) | The max number of scheduler buckets in which jobs are put based on resource requirements and different scheduling policies. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.buckets.min (gauge) | The min number of scheduler buckets in which jobs are put based on resource requirements and different scheduling policies. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.buckets.total (gauge) | The total number of scheduler buckets in which jobs are put based on resource requirements and different scheduling policies. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.completed.avg (gauge) | The average amount of jobs completed in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.completed.current (gauge) | The amount of jobs completed in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.completed.max (gauge) | The max amount of jobs completed in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.completed.min (gauge) | The min amount of jobs completed in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.completed.total (gauge) | The total amount of jobs completed in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.dispatched.avg (gauge) | The average number of jobs dispatched in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.dispatched.current (gauge) | The number of jobs dispatched. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.dispatched.max (gauge) | The max number of jobs dispatched in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.dispatched.min (gauge) | The min number of jobs dispatched in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.dispatched.total (gauge) | The total number of jobs dispatched in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.queries.avg (gauge) | The average number of job queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.queries.current (gauge) | The number of job queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.queries.max (gauge) | The max number of job queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.queries.min (gauge) | The min number of job queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.queries.total (gauge) | The total number of job queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.jobs.reordered.avg (gauge) | The average number of jobs reordered in the sampling period, that is, the number of jobs that reused the resource allocation of a finished job. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.reordered.current (gauge) | The number of jobs reordered in the sampling period, that is, the number of jobs that reused the resource allocation of a finished job. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.reordered.max (gauge) | The max number of jobs reordered in the sampling period, that is, the number of jobs that reused the resource allocation of a finished job. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.reordered.min (gauge) | The min number of jobs reordered in the sampling period, that is, the number of jobs that reused the resource allocation of a finished job. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.reordered.total (gauge) | The total number of jobs reordered in the sampling period, that is, the number of jobs that reused the resource allocation of a finished job. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.scheduling_interval.avg (gauge) | The average scheduling interval in the sampling period. [Reported by badmin_perfmon] Shown as second |
ibm_spectrum_lsf.perfmon.jobs.scheduling_interval.current (gauge) | The current scheduling interval in the sampling period. [Reported by badmin_perfmon] Shown as second |
ibm_spectrum_lsf.perfmon.jobs.scheduling_interval.max (gauge) | The max scheduling interval in the sampling period. [Reported by badmin_perfmon] Shown as second |
ibm_spectrum_lsf.perfmon.jobs.scheduling_interval.min (gauge) | The min scheduling interval in the sampling period. [Reported by badmin_perfmon] Shown as second |
ibm_spectrum_lsf.perfmon.jobs.scheduling_interval.total (gauge) | The total scheduling interval in the sampling period. [Reported by badmin_perfmon] Shown as second |
ibm_spectrum_lsf.perfmon.jobs.sent_remote.avg (gauge) | The average number of jobs sent to remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.sent_remote.current (gauge) | The number of jobs sent to remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.sent_remote.max (gauge) | The max number of jobs sent to remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.sent_remote.min (gauge) | The avminerage number of jobs sent to remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.sent_remote.total (gauge) | The total number of jobs sent to remote cluster in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.submission_requests.avg (gauge) | The average number of job submission requests in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.jobs.submission_requests.current (gauge) | The number of job submission requests in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.jobs.submission_requests.max (gauge) | The max number of job submission requests in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.jobs.submission_requests.min (gauge) | The min number of job submission requests in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.jobs.submission_requests.total (gauge) | The total number of job submission requests in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.jobs.submitted.avg (gauge) | The average number of jobs submitted in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.submitted.current (gauge) | The number of jobs submitted in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.submitted.max (gauge) | The max number of jobs submitted in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.submitted.min (gauge) | The min number of jobs submitted in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.jobs.submitted.total (gauge) | The total number of jobs submitted in the sampling period. [Reported by badmin_perfmon] Shown as job |
ibm_spectrum_lsf.perfmon.mbatchd.processed_requests.avg (gauge) | The average number of queries handled by mbatchd in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.mbatchd.processed_requests.current (gauge) | The number of queries handled by mbatchd in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.mbatchd.processed_requests.max (gauge) | The max number of queries handled by mbatchd in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.mbatchd.processed_requests.min (gauge) | The min number of queries handled by mbatchd in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.mbatchd.processed_requests.total (gauge) | The total number of queries handled by mbatchd in the sampling period. [Reported by badmin_perfmon] Shown as request |
ibm_spectrum_lsf.perfmon.memory.utilization.current (gauge) | Current memory utilization. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.memory.utilization.total (gauge) | Total memory utilization. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.queue.queries.avg (gauge) | The average number of queue queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.queue.queries.current (gauge) | The number of queue queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.queue.queries.max (gauge) | The max number of queue queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.queue.queries.min (gauge) | The min number of queue queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.queue.queries.total (gauge) | The total number of queue queries in the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.scheduler.host_matches.avg (gauge) | The average number of hosts matching the resource criteria for a job. [Reported by badmin_perfmon] Shown as host |
ibm_spectrum_lsf.perfmon.scheduler.host_matches.current (gauge) | The number of hosts matching the resource criteria for a job. [Reported by badmin_perfmon] Shown as host |
ibm_spectrum_lsf.perfmon.scheduler.host_matches.max (gauge) | The max number of hosts matching the resource criteria for a job. [Reported by badmin_perfmon] Shown as host |
ibm_spectrum_lsf.perfmon.scheduler.host_matches.min (gauge) | The min number of hosts matching the resource criteria for a job. [Reported by badmin_perfmon] Shown as host |
ibm_spectrum_lsf.perfmon.scheduler.host_matches.total (gauge) | The total number of hosts matching the resource criteria for a job in the sampling period. [Reported by badmin_perfmon] Shown as host |
ibm_spectrum_lsf.perfmon.slots.utilization.current (gauge) | The current slot utilization. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.perfmon.slots.utilization.total (gauge) | The total slot utilization of the sampling period. [Reported by badmin_perfmon] |
ibm_spectrum_lsf.queue.is_active (gauge) | Whether or not jobs in the queue can be started. [Reported by bqueues] |
ibm_spectrum_lsf.queue.is_open (gauge) | Whether or not the queue can accept jobs. [Reported by bqueues] |
ibm_spectrum_lsf.queue.max_jobs (gauge) | The maximum number of job slots that can be used by the jobs from the queue. These job slots are used by dispatched jobs that are not yet finished, and by pending jobs that reserve slots. [Reported by bqueues] |
ibm_spectrum_lsf.queue.max_jobs_per_host (gauge) | The maximum number of job slots a host can allocate from this queue. [Reported by bqueues] |
ibm_spectrum_lsf.queue.max_jobs_per_processor (gauge) | The maximum number of job slots a processor can process from the queue. [Reported by bqueues] |
ibm_spectrum_lsf.queue.max_jobs_per_user (gauge) | The maximum number of job slots each user can use for jobs in the queue. [Reported by bqueues] |
ibm_spectrum_lsf.queue.num_job_slots (gauge) | The total number of slots for jobs in the queue. [Reported by bqueues] |
ibm_spectrum_lsf.queue.pending (gauge) | The total number of tasks for all pending jobs in the queue. [Reported by bqueues] Shown as job |
ibm_spectrum_lsf.queue.priority (gauge) | The priority of the queue. The larger the value, the higher the priority. [Reported by bqueues] |
ibm_spectrum_lsf.queue.running (gauge) | The total number of tasks for all running jobs in the queue. If the -alloc option is used, the total is allocated slots for the jobs in the queue. [Reported by bqueues] Shown as task |
ibm_spectrum_lsf.queue.suspended (gauge) | The total number of tasks for all suspended jobs in the queue. [Reported by bqueues] Shown as task |
ibm_spectrum_lsf.server.gpu.num_gpus (gauge) | The total number of GPUs. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_alloc (gauge) | The current total number of GPUs that are allocated to be used by a job. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_exclusive_alloc (gauge) | The current total number of GPUs that are allocated to be used exclusive by the job. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_exclusive_available (gauge) | The current total number of GPUs that are used exclusive by the job. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_jexclusive_alloc (gauge) | The total number of GPUs allocated exclusively for a job. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_shared_alloc (gauge) | The total number of GPUs allocated but shared. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.gpu.num_gpus_shared_available (gauge) | The current total number of GPUs that are available for concurrent use by multiple jobs. [Reported by bhosts_gpu] |
ibm_spectrum_lsf.server.max_jobs (gauge) | The maximum number of job slots available. A -1 indicates no limit. [Reported by bhosts] Shown as job |
ibm_spectrum_lsf.server.num_jobs (gauge) | The number of tasks for all jobs that are dispatched to the host. The NJOBS value includes running, suspended, and chunk jobs. [Reported by bhosts] Shown as task |
ibm_spectrum_lsf.server.reserved (gauge) | The number of tasks for all pending jobs with reserved slots on the host. [Reported by bhosts] Shown as task |
ibm_spectrum_lsf.server.running (gauge) | The number of tasks for all running jobs on the host. [Reported by bhosts] |
ibm_spectrum_lsf.server.slots_per_user (gauge) | The maximum number of job slots that the host can process on a per user basis. A -1 indicates no limit. [Reported by bhosts] |
ibm_spectrum_lsf.server.status (gauge) | The status of the host and the sbatchd daemon. Batch jobs can be dispatched only to hosts with an ok status. 1 if ok, 0 otherwise. [Reported by bhosts] |
ibm_spectrum_lsf.server.suspended (gauge) | The number of tasks for all system suspended jobs on the host. [Reported by bhosts] |
ibm_spectrum_lsf.server.user_suspended (gauge) | The number of tasks for all user suspended jobs on the host. Jobs can be suspended by the user or by the LSF administrator. [Reported by bhosts] |
ibm_spectrum_lsf.slots.backfill.available (gauge) | The available slots for backfill jobs. [Reported by bslots] |
ibm_spectrum_lsf.slots.runtime_limit (gauge) | The runtime limit for the backfill slots. [Reported by bslots] |