My Oracle Support Banner

Introducing Cluster Health Monitor (IPD/OS) (Doc ID 736752.1)

Last updated on JUNE 18, 2023

Applies to:

Oracle Database - Standard Edition - Version 10.1.0.2 to 11.1.0.7 [Release 10.1 to 11.1]
Oracle Database Cloud Schema Service - Version N/A and later
Oracle Database Backup Service - Version N/A and later
Oracle Database Cloud Service - Version N/A and later
Oracle Cloud Infrastructure - Database Service - Version N/A and later
Generic Linux

Details

What is Cluster Health Monitor (IPD/OS)?

Cluster Health Monitor - also known as IPD/OS is a set of tools to collect Operating system performance data periodically and automatically. The data is stored for both online and offline analysis.

Please refer to the Cluster Health Monitor (CHM) FAQ <Document 1328466.1> for more information about Cluster Health Monitor

Where can I get latest copy of Cluster Health Monitor?

The latest copy of Cluster Health Monitor (IPD/OS) is always with the install image if the version is 11.2.0.2 or greater.  Please note that Cluster Health Monitor is available on only selected platforms.

For Linux , the pre-11.2.0.2 version of Cluster Health Monitor (IPD/OS) can be downloaded from :

  http://www.oracle.com/technetwork/database/options/clustering/downloads/index.html

Cluster Health Monitor Diagnostic Data Collection Process

The tool collects OS performance data. This data can be used to tune Single Instance, RAC performance tuning. It can also be used to find out root cause for Oracle Clusterware eviction especially ones caused by scheduler issues or high CPU loads. Generic performance collection tools sometimes have trouble collecting data when the OS gets very busy. This is where Cluster Health Monitor comes into play.

Why Cluster Health Monitor ?

Oracle Clusterware & Oracle database performance/node reboot due to lack of CPU/Memory resources cause Customers to ask how to monitor their OS. Some customers have rudimentary scripts that utilize vmstat, mpstat but they are often not collected at regular intervals. In some cases, we have seen customers collect this once per hour which does not make it very useful when the node is hung/evited via reboot in the middle of the hour. OSwatcher did a wonderful job of making the data collection uniform with uniform collection intervals. Cluster Health Monitor extends OSwatcher by ensuring it is always scheduled and collects data points while providing a client GUI to view current load.

What platforms can I run the Cluster Health Monitor?

The Cluster Health Monitor is NOT available for Itanium platform (Linux, Windows, and HP Itanium) on all version.

11.2.0.1 and earlier: Linux only (download from OTN)
11.2.0.2: Solaris (Sparc and x86-64) and Linux
11.2.0.3: AIX, Solaris (Sparc and x86-64), Linux , and Windows

Actions

Installation

For OTN version of Cluster Health Monitor, the complete steps to install the tool is explained in the readme file shipped with the product

For 11.2.0.2 or later version, the cluster health monitor is installed automatically when Grid Infrastructure (aka CRS) is installed.  The resource name for Cluster Health Monitor is ora.crf that is managed by ohasd.

Usage

The tool can be used by Customers to monitor their nodes online or offline. Generally when working with Oracle support, the data is viewed offline.

Please note that $ORACRF_HOME is /usr/lib/oracrf if Cluster Health Monitor is from OTN
and $ORACRF_HOME is GI_HOME if Cluster Health Monitor is installed with Grid Infrastructure (11.2.0.2 or later)

Non-GUI Mode (preferred for gathering the data)

The $ORACRF_HOME/bin/oclumon command can be used to get the load information.
Execute oclumon -h option to see the help

For help from command line : oclumon <verb> -h
For help in interactive mode : <verb> -h
Currently supported verbs are :
showtrail, showobjects, dumpnodeview, manage, version, debug, quit and help

There are various attributes that can be used to find out the performance problem.

Some useful attributes that can be passed to oclumon are
  1. Showobjects


    Make sure <your-directory> has more than 2Gb space to create file<your-filename>
    Zip or compress <your-filename> before uploading to the Service Request.

    Also update the SR with the information when (date and time) you have observed a specific issue.

    Contacts

    To view full details, sign in with your My Oracle Support account.

    Don't have a My Oracle Support account? Click to get started!


    In this Document
    Details
     What is Cluster Health Monitor (IPD/OS)?
     Where can I get latest copy of Cluster Health Monitor?
     Cluster Health Monitor Diagnostic Data Collection Process
     Why Cluster Health Monitor ?
     What platforms can I run the Cluster Health Monitor?
    Actions
     Installation
     Usage
     Data Collection
    Contacts
     Scalability RAC Community
    References

    My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.