RP/MSQ 4.0A(Unix/NT) - RP45 reconnect fails with mulithomed systems if downtime is greater than 2 minutes. (Doc ID 768823.1)

Last updated on NOVEMBER 03, 2016

Applies to:

Oracle MessageQ / MessageQ / V4.0A/UNIX,NT
Information in this document applies to any platform

Goal

In an XGROUP which has multiple IP addresses, suppose a host which is set Initiate=Y loses its connection.  If this
disconnected condition lasts for more than 2 minutes, then the host will not try to reconnect any longer.  This
appears to be a bug introduced in RP45 for MessageQ V4.0A for Digital UNIX.

Host frsv_C1
G023            23  frsv_C1_es0 Y  2000000    -1  5  5 25000 TCPIP 30000
G023            23  frsv_C1_es1 Y  2000000    -1  5  5 25000 TCPIP 30000
G024            24  frsv_C2_es0 Y  2000000    -1  5  5 25000 TCPIP 30000
G024            24  frsv_C2_es1 Y  2000000    -1  5  5 25000 TCPIP 30000

Host frsv_C2
G023            23  frsv_C1_es0 N  2000000    -1  5  5 25000 TCPIP 30000
G023            23  frsv_C1_es1 N  2000000    -1  5  5 25000 TCPIP 30000
G024            24  frsv_C2_es0 Y  2000000    -1  5  5 25000 TCPIP 30000
G024            24  frsv_C2_es1 Y  2000000    -1  5  5 25000 TCPIP 30000

A connection is established between frsv_C1_es0 and frsv_C2_es0.  If it is lost by e.g. shutdown of dmq on frsv_C2,
then frsv_C1 sends reconnection packets to frsv_C2, because frsv_C1 is set Initiate=Y.  Problem is that this
reconnection attempt lasts for only 2 minutes.  If frsv_C2's dmq gets back up after a longer downtime than 2 minutes,
the connection cannot be re-established.  We see this problem on both es0 and es1.

RP42  and 44 don't appear to have tjhis problem.  This also  works fine with an XGROUP with only one IP address.

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms