Solaris Cluster 3.x or 4.x How to Establish Permanent guds Performance Monitoring to cover Probe Timeouts from SUNWsczone or other Agent / Data Service (Doc ID 1914195.1)

Last updated on NOVEMBER 29, 2016

Applies to:

Solaris Cluster - Version 3.2 12/06 to OSC 4.3 [Release 3.2 to 4.3]
Oracle Solaris on x86-64 (64-bit)
Oracle Solaris on SPARC (64-bit)

Goal

The Solaris Cluster HA data service for Solaris zones (package SUNWsczone) sometimes gets into probe timeouts causing an unwanted restart of the zone. In many of these cases no exact root cause can be provided due to missing diagnostic data covering the probe timeout. These service request mostly end up in a prophylactic raise of the probe timeout as addressed in:
    RFE 15756457: SUNBT7114158 Increase the default timeout of the probe for Container Agent.

This document describes how to set up 24x7 guds-monitoring to trap  probe timeouts from the HA container agent with suitable diagnostic data, without piling up huge amounts of performance data.

 

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms