SMART: Disks maintenance

Table of Contents

Configuration
Current_Pending_Sector
ZFS mirror and
Offline_Uncorrectable

Offline_Uncorrectable

by ross at 04:38:01 on November 2, 2011

If you enabled smartd daemon it's possible that one day you will receive and email like this:

This email was generated by the smartd daemon running on:

   host name: slim.local
  DNS domain: local
  NIS domain: 

The following warning/error was logged by the smartd daemon:

Device: /dev/ad6, 1 Offline uncorrectable sectors


For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
No additional email messages about this problem will be sent.

Check SMART values:

# smartctl -A /dev/ad6
smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-STABLE amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0003   184   176   021    Pre-fail  Always       -       5791
  4 Start_Stop_Count        0x0032   099   099   000    Old_age   Always       -       1102
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000e   200   200   051    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   073   073   000    Old_age   Always       -       20299
 10 Spin_Retry_Count        0x0012   100   100   051    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0012   100   100   051    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       1098
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       605
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       1102
194 Temperature_Celsius     0x0022   104   091   000    Old_age   Always       -       46
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0012   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   200   200   000    Old_age   Offline      -       1
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       2
200 Multi_Zone_Error_Rate   0x0008   200   200   051    Old_age   Offline      -       0

Offline_Uncorrectable is the number of sectors that the drive has attempted to correct itself, but failed. Running the offline self-test should cause the drive to test the sectors and attempt to fix them. Not all drives support this though.

# smartctl -t offline /dev/ad6
smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-STABLE amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART off-line routine immediately in off-line mode".
Drive command "Execute SMART off-line routine immediately in off-line mode" successful.
Testing has begun.
Please wait 13200 seconds for test to complete.
Test will complete after Tue Nov  1 14:07:39 2011

Use smartctl -X to abort test.

When the test is completed:

# smartctl -A /dev/ad6                                                                     [5:35:18]
smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-STABLE amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0003   184   176   021    Pre-fail  Always       -       5783
  4 Start_Stop_Count        0x0032   099   099   000    Old_age   Always       -       1103
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000e   200   200   051    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   073   073   000    Old_age   Always       -       20308
 10 Spin_Retry_Count        0x0012   100   100   051    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0012   100   100   051    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       1099
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       606
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       1103
194 Temperature_Celsius     0x0022   114   091   000    Old_age   Always       -       36
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0012   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       2
200 Multi_Zone_Error_Rate   0x0008   200   200   051    Old_age   Offline      -       0

The counter is set back to zero.

 

Comments