Unnecessary and excessive replacment of disks of ZFS with transient checksum errors
(Doc ID 1956117.1)
Last updated on JANUARY 13, 2023
Applies to:
Solaris Operating System - Version 11.1 to 11.2 [Release 11.0]Oracle Solaris on x86-64 (64-bit)
Oracle Solaris on SPARC (64-bit)
Symptoms
Transient checksum errors are reported from Fault Management (FMA) which caused excessive replacment of disks.
FMA fmadm -faulty logs show
TIME EVENT-ID MSG-ID SEVERITY
--------------- ------------------------------------ -------------- ---------
Nov 02 07:10:00 100d8c54-8b37-4a2a-d90d-9df6775912f9 DISK-8000-D5 Major
Problem Status : resolved
Diag Engine : fmd / 1.2
System
Manufacturer : unknown
Name : unknown
Part_Number : unknown
Serial_Number : unknown
System Component
Manufacturer : Supermicro
Name : X9DRH-7TF-7F-iTF-iF
Part_Number : 1234567890
Serial_Number : 1234567890
Host_ID : 00c48984
----------------------------------------
Suspect 1 of 1 :
Fault class : fault.io.scsi.disk.csum-zfs.transient
Certainty : 100%
Affects : dev:///:devid=id1,sd@n5000c50056c4c287//scsi_vhci/disk@g5000c50
056c4c287
FRU
Location : "SLOT 48 3B "
Manufacturer : SEAGATE
Name : ST4000NM0023
Part_Number : SEAGATE-ST4000NM0023
Revision : 0003
Serial_Number : Z1Z10WAH00009347WQGA
Chassis
Manufacturer : NEWISYS
Name : NEWISYS-NDS-4600-JD
Part_Number : unknown
Serial_Number : 500093d00131c000
Status : repaired
Description : There have been excessive transient ZFS checksum errors on this
disk.
Response : A hot-spare disk may have been activated.
Impact : If a hot spare is available it will be brought online and during
this time I/O could be impacted. If a hot spare isn't available
then I/O could be lost and data corruption is possible.
Action : Use 'fmadm faulty' to provide a more detailed view of this event.
Please refer to the associated reference document at
http://support.oracle.com/msg/DISK-8000-D5 for the latest service
procedures and policies regarding this diagnosis.
--------------- ------------------------------------ -------------- ---------
TIME EVENT-ID MSG-ID SEVERITY
--------------- ------------------------------------ -------------- ---------
Nov 02 07:07:18 1f206ea7-1236-6530-9137-ae1e07444b9e DISK-8000-D5 Major
Problem Status : resolved
Diag Engine : fmd / 1.2
System
Manufacturer : unknown
Name : unknown
Part_Number : unknown
Serial_Number : unknown
System Component
Manufacturer : Supermicro
Name : X9DRH-7TF-7F-iTF-iF
Part_Number : 1234567890
Serial_Number : 1234567890
Host_ID : 00c48984
----------------------------------------
Suspect 1 of 1 :
Fault class : fault.io.scsi.disk.csum-zfs.transient
Certainty : 100%
Affects : dev:///:devid=id1,sd@n5000c50056b2e893//scsi_vhci/disk@g5000c50
056b2e893
FRU
Location : "SLOT 4 03 "
Manufacturer : SEAGATE
Name : ST4000NM0023
Part_Number : SEAGATE-ST4000NM0023
Revision : 0003
Serial_Number : Z1Z0WXN800009347VVYJ
Chassis
Manufacturer : NEWISYS
Name : NEWISYS-NDS-4600-JD
Part_Number : unknown
Serial_Number : 500093d00131c000
Status : repaired
Description : There have been excessive transient ZFS checksum errors on this
disk.
Response : A hot-spare disk may have been activated.
Impact : If a hot spare is available it will be brought online and during
this time I/O could be impacted. If a hot spare isn't available
then I/O could be lost and data corruption is possible.
Action : Use 'fmadm faulty' to provide a more detailed view of this event.
Please refer to the associated reference document at
http://support.oracle.com/msg/DISK-8000-D5 for the latest service
procedures and policies regarding this diagnosis.
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |
References |