S.M.A.R.T is a monitoring system included in computer hard disk drives (HDDs) and solid-state drives (SSDs) that detects and reports on various indicators of drive reliability to help anticipate hardware failures.

Why use the S.M.A.R.T Telegraf plugin?

Collecting and understanding drive health across your infrastructure or ecosystem can help identify hardware problems, predict drive failure, and assist with proactive maintenance. This data can help identify which drives may require replacement before failures occur, reducing potential data loss or system downtime.

How to monitor S.M.A.R.T. using the Telegraf plugin

You will want to be sure to update permissions for your system so that the plugin can access S.M.A.R.T data. Once you’ve done that, the plugin configuration is straightforward. You can set several options within the plugin config to meet your monitoring needs.

The SMART Input Plugin gets metrics using the command line utility smartctl for SMART (Self-Monitoring, Analysis and Reporting Technology) storage devices. SMART is a monitoring system included in computer hard disk drives (HDDs) and solid-state drives (SSDs), which include most modern ATA/SATA, SCSI/SAS and NVMe disks. The plugin detects and reports on various indicators of drive reliability, with the intent of enabling the anticipation of hardware failures.

Key S.M.A.R.T monitoring metrics to monitor

Some of the most important S.M.A.R.T. monitoring metrics that you should proactively monitor include:

  • smart_device:
    • tags:
      • capacity
      • device
      • enabled
      • model
      • serial_no
      • wwn
    • fields:
      • exit_status
      • health_ok
      • read_error_rate
      • seek_error
      • temp_c
      • udma_crc_errors
  • smart_attribute:
    • tags:
      • capacity
      • device
      • enabled
      • fail
      • flags
      • id
      • model
      • name
      • serial_no
      • wwn
    • fields:
      • exit_status
      • raw_value
      • threshold
      • value
      • worst

For more information, please check out the documentation.

Project URL   Documentation

Related resources

HddTemp

The Hddtemp Input Plugin reads data from `hddtemp` daemons.

Kernel

The Kernel Input Plugin gathers kernel statistics from `/proc/stat`.

Ethtool

The Ethtool Telegraf Plugin gathers ethernet device statistics.

Scroll to Top