Fault Management Daemon (fmd) Threads May Hang In ct_barrier_acquire() Causing IO Related Commands To Hang Or A System Hang (Doc ID 2097459.1)

Last updated on AUGUST 01, 2016

Applies to:

Solaris Operating System - Version 11 11/11 to 11.2 [Release 11.0]
Information in this document applies to any platform.

Symptoms

If the described issue occurs, the either IO related commands such as zpool, zfs, fmadm, cfgadm, prtconf, devfsadm, format, ls, etc may appear hung and never return output.  Live analysis using mdb or offline analysis of a LIVE or FULL system crash dump should indicate the IO related commands are waiting for a condition variable (cv) from ndi_devi_enter().  The Solaris Crash Dump Analysis Tool (SCAT) will automatically scan and display what the top cv owner is when dumping the user-land command thread.  This should point to another thread waiting in ct_barrier_acquire() as the following example shows.

 

Changes

 FMD is handling a storage issue.  The trigger is usually when storage disappears/re-appears.

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms