When running global Infiniband commands, such as "ibnetdiscover" or "ibswitches", it fails with "mad_rpc" warning messages.
(Doc ID 1505294.1)
Last updated on OCTOBER 15, 2020
Applies to:
Oracle Exalogic Elastic Cloud X2-2 Full Rack - Version X2 and laterLinux x86
Symptoms
Symptoms incude very slow response to global infiniband commands(ibnetdiscover/ibswitches), as well as returning the following errors:
ibwarn: [31013] mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0,26,13)
ibwarn: [31013] discover: can't reach node DR path slid 0; dlid 0; 0,26,13 port 16
ibwarn: [31013] _do_madrpc: recv failed: Connection timed out
ibwarn: [31013] mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0,26,13,18)
ibwarn: [31013] handle_port: NodeInfo on DR path slid 0; dlid 0; 0,26,13,18 failed, skipping port
ibwarn: [31013] _do_madrpc: recv failed: Invalid argument
ibwarn: [31013] mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0,26,13)
ibwarn: [31013] discover: can't reach node DR path slid 0; dlid 0; 0,26,13 port 19
ibwarn: [31013] mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0,26,13,33)
ibwarn: [31013] handle_port: NodeInfo on DR path slid 0; dlid 0; 0,26,13,33 failed, skipping port
ibwarn: [31013] mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0,26,13,34)
ibwarn: [31013] handle_port: NodeInfo on DR path slid 0; dlid 0; 0,26,13,34 failed, skipping port
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |