AIX: ARP Cache Not Updated After HAIP Failover
(Doc ID 2831749.1)
Last updated on JULY 20, 2024
Applies to:
Oracle Database - Enterprise Edition - Version 19.3.0.0.0 and laterIBM AIX on POWER Systems (64-bit)
Symptoms
On 19C GI environment on IBM platform, if there are multiple private networks and one of the network interface is down and then up, communication problems can happen on HAIP layer, ASM & DB instances may hang or crash, the issue can cause around 20 minutes outage.
Main Symptoms:
In ohasd agent trace of node 2 , HAIP [169.254.xxx.236] failed over from [en3] to [en4] at [08:45:03], and then failed back from [en4] to [en3] 5 seconds later at [08:45:08], ARP cache was also cleaned up successfully, everything looks fine.
In the ARP table of node 1, [169.254.xxx.236] changed from [1a:16:xx:xx:14:17] to [1a:16:xx:xx:14:18] at [08:45:29], this is expected as the HAIP failed over from [en3] to [en4], however, until 20 minutes later at [09:06:02], it changed back to [1a:16:xx:xx:14:17]. HAIP couldn't work during this time, which caused ASM & DB communication problems.
Changes
No changes.
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |
References |