Unnecessary and excessive replacment of disks of ZFS with transient checksum errors (Doc ID 1956117.1)

Last updated on JULY 29, 2016

Applies to:

Solaris Operating System - Version 11.1 to 11.2 [Release 11.0]
Oracle Solaris on x86-64 (64-bit)
Oracle Solaris on SPARC (64-bit)

Symptoms

 Transient checksum errors are reported from Fault Management (FMA) which caused excessive replacment of disks.

FMA fmadm -faulty logs show

TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Nov 02 07:10:00 100d8c54-8b37-4a2a-d90d-9df6775912f9  DISK-8000-D5   Major

Problem Status    : resolved
Diag Engine       : fmd / 1.2
System
   Manufacturer  : unknown
   Name          : unknown
   Part_Number   : unknown
   Serial_Number : unknown

System Component
   Manufacturer  : Supermicro
   Name          : X9DRH-7TF-7F-iTF-iF
   Part_Number   : 1234567890
   Serial_Number : 1234567890
   Host_ID       : 00c48984

----------------------------------------
Suspect 1 of 1 :
  Fault class : fault.io.scsi.disk.csum-zfs.transient
  Certainty   : 100%
  Affects     : dev:///:devid=id1,sd@n5000c50056c4c287//scsi_vhci/disk@g5000c50
056c4c287

  FRU
    Location         : "SLOT 48 3B  "
    Manufacturer     : SEAGATE
    Name             : ST4000NM0023
    Part_Number      : SEAGATE-ST4000NM0023
    Revision         : 0003
    Serial_Number    : Z1Z10WAH00009347WQGA
    Chassis
       Manufacturer  : NEWISYS
       Name          : NEWISYS-NDS-4600-JD
       Part_Number   : unknown
       Serial_Number : 500093d00131c000
       Status        : repaired

Description : There have been excessive transient ZFS checksum errors on this
             disk.
Response    : A hot-spare disk may have been activated.

Impact      : If a hot spare is available it will be brought online and during
             this time I/O could be impacted. If a hot spare isn't available
             then I/O could be lost and data corruption is possible.

Action      : Use 'fmadm faulty' to provide a more detailed view of this event.
             Please refer to the associated reference document at
             http://support.oracle.com/msg/DISK-8000-D5 for the latest service
             procedures and policies regarding this diagnosis.


--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Nov 02 07:07:18 1f206ea7-1236-6530-9137-ae1e07444b9e  DISK-8000-D5   Major

Problem Status    : resolved
Diag Engine       : fmd / 1.2
System
   Manufacturer  : unknown
   Name          : unknown
   Part_Number   : unknown
   Serial_Number : unknown

System Component
   Manufacturer  : Supermicro
   Name          : X9DRH-7TF-7F-iTF-iF
   Part_Number   : 1234567890
   Serial_Number : 1234567890
   Host_ID       : 00c48984
----------------------------------------
Suspect 1 of 1 :
  Fault class : fault.io.scsi.disk.csum-zfs.transient
  Certainty   : 100%
  Affects     : dev:///:devid=id1,sd@n5000c50056b2e893//scsi_vhci/disk@g5000c50
056b2e893

  FRU
    Location         : "SLOT  4 03  "
    Manufacturer     : SEAGATE
    Name             : ST4000NM0023
    Part_Number      : SEAGATE-ST4000NM0023
    Revision         : 0003
    Serial_Number    : Z1Z0WXN800009347VVYJ
    Chassis
       Manufacturer  : NEWISYS
       Name          : NEWISYS-NDS-4600-JD
       Part_Number   : unknown
       Serial_Number : 500093d00131c000
       Status        : repaired

Description : There have been excessive transient ZFS checksum errors on this
             disk.

Response    : A hot-spare disk may have been activated.

Impact      : If a hot spare is available it will be brought online and during
             this time I/O could be impacted. If a hot spare isn't available
             then I/O could be lost and data corruption is possible.

Action      : Use 'fmadm faulty' to provide a more detailed view of this event.
             Please refer to the associated reference document at
             http://support.oracle.com/msg/DISK-8000-D5 for the latest service
             procedures and policies regarding this diagnosis.

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms