RP/TUX 6.4, 6.5, 6.51, 7.1, 8.0 - BB deadlock when network partitioned and app servers killed

(Doc ID 770773.1)

Last updated on NOVEMBER 04, 2016

Applies to:

Oracle Tuxedo / Tuxedo / 6.4, 6.5x, 7.1, 8.0
Information in this document applies to any platform

Goal

PRODUCT: TUXEDO
VERSION: 6.4, 7.1, 8.0 
PLATFORM: SOLARIS
ISSUE:
The customer is using MP configuration with network failover/failback.  After the primary network interface is brought
down,   they try to shutdown some of their application servers.  Since MIB request is not working,  they kill the
servers by "kill pid" (SIGTERM).   This leads to a BB dead lock situation,  with pages of the following
errors in ULOG:

        BRIDGE.9206: WARN: Could not lock USEM
        BRIDGE.9206: LIBTUX_CAT:746: ERROR: Cannot lock Bulletin Board
       
This problem seems to related to secondary event brokers running in the slave node,  it does not happen using simple
testcase without TMUSREVT/TMSYSEVT.

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms